Speech/non-speech segments detection based on chaotic and prosodic features

Author(s):  
Soheil Shafiee ◽  
Farshad Almasganj ◽  
Ayyoob Jafari
2021 ◽  
pp. 325-336
Author(s):  
L. P. Prokofyeva ◽  
◽  
I. L. Plastun ◽  
N. V. Filippova ◽  
L. Yu. Matveeva ◽  
...  

The paper presents an experimental project of linguists, medical professionals, lawyers, com-puter security specialists dealing with emotions discernment by basic speech signal characteristics. The software has been created, and its first testing has been carried out in the social network VKontakte. The collected recordings of speech fragments of live spontaneous prox-imate-intermediated dialogical speech were analyzed at several levels. First, a complex lin-guistic analysis revealed lexico-semantic and prosodic features of emotionality. Then, a com-parison with the software results was carried out, and the data obtained were systematized. Also, conclusions on the leading role of prosody in revealing hidden types of emotional stress were made. Frequent agreements of digital meanings of prosodic elements in speech segments were found demonstrating emotions not fixed at the lexico-semantic level. Finally, the work-ing formulation for external and internal ways of emotionality expression in the live speech was offered.


2011 ◽  
Vol 52 (1-4) ◽  
pp. 373-392
Author(s):  
Damien Colas

To talk about the Frenchness of Le comte Ory might sounds like provocation. Being basically a rifacimento of his Viaggio a Reims, Rossini’s penultimate stage work belongs to the corpus of Italo-French operas. Yet there are three reasons for looking at Le comte Ory as an authentic French opera. Firstly, in the newly composed parts of the work, Rossini avoided the traditional features of the closed numbers typical of the Italian tradition by inserting recitatives inside the numbers and by merging closed numbers and subsequent recitatives, especially at the end of Act II. Secondly, the French lines written by Scribe to fit the already composed music follow poetic patterns from the Middle Ages, of which the prosodic features were closer to Italian than Classical French. Last, the very choice of the legend of Ory is typical of the troubadour style that had been fashionable in Paris since the last decades of the 18th century, and it turns out that this particular legend was extremely popular back then, as witnessed by the variety of local variants that were published in the 19th century.


2012 ◽  
Vol 34 (6) ◽  
pp. 1404-1407 ◽  
Author(s):  
Yan-xiong Li ◽  
Yong Wu ◽  
Qian-hua He

Author(s):  
Chunyan Ji ◽  
Thosini Bamunu Mudiyanselage ◽  
Yutong Gao ◽  
Yi Pan

AbstractThis paper reviews recent research works in infant cry signal analysis and classification tasks. A broad range of literatures are reviewed mainly from the aspects of data acquisition, cross domain signal processing techniques, and machine learning classification methods. We introduce pre-processing approaches and describe a diversity of features such as MFCC, spectrogram, and fundamental frequency, etc. Both acoustic features and prosodic features extracted from different domains can discriminate frame-based signals from one another and can be used to train machine learning classifiers. Together with traditional machine learning classifiers such as KNN, SVM, and GMM, newly developed neural network architectures such as CNN and RNN are applied in infant cry research. We present some significant experimental results on pathological cry identification, cry reason classification, and cry sound detection with some typical databases. This survey systematically studies the previous research in all relevant areas of infant cry and provides an insight on the current cutting-edge works in infant cry signal analysis and classification. We also propose future research directions in data processing, feature extraction, and neural network classification fields to better understand, interpret, and process infant cry signals.


2021 ◽  
Vol 7 (s4) ◽  
Author(s):  
Ajit Singh

Abstract This article investigates action plans not as mental but as situated and observable activities in social interactions. I argue that projections and action plans can be understood as complex embodied practices through which actors prepare and coordinate further actions as part of a trajectory of a “communicative project”. “Projections” within ‘talk-in-interaction’ are a central topic of conversation analysis (CA), e.g. for the micro analysis of the organization of turn-taking or for the identification of turn-constructional units. Aside from former CA-studies on syntactic and prosodic features, current research using CA from a multimodal perspective shows how embodied resources, such as gestures, serve as “premonitory components” of communicative actions. Using video data of communications in sports training in trampolining, I will show how communicatively situated “embodied action plans” are applied within pre-enactments and instructions for the production of embodied knowledge. Pre-enactments not only serve the production of an ideal imagination to corporally produce intersubjectivity. Pre-enactments are also used temporally for the multimodal and visibly situating of embodied action plans, to which actors can coordinate and orientate their current and prospective communicative actions.


Sign in / Sign up

Export Citation Format

Share Document