prosodic feature
Recently Published Documents


TOTAL DOCUMENTS

48
(FIVE YEARS 11)

H-INDEX

8
(FIVE YEARS 1)

2022 ◽  
Vol 14 (2) ◽  
pp. 614
Author(s):  
Taniya Hasija ◽  
Virender Kadyan ◽  
Kalpna Guleria ◽  
Abdullah Alharbi ◽  
Hashem Alyami ◽  
...  

Speech recognition has been an active field of research in the last few decades since it facilitates better human–computer interaction. Native language automatic speech recognition (ASR) systems are still underdeveloped. Punjabi ASR systems are in their infancy stage because most research has been conducted only on adult speech systems; however, less work has been performed on Punjabi children’s ASR systems. This research aimed to build a prosodic feature-based automatic children speech recognition system using discriminative modeling techniques. The corpus of Punjabi children’s speech has various runtime challenges, such as acoustic variations with varying speakers’ ages. Efforts were made to implement out-domain data augmentation to overcome such issues using Tacotron-based text to a speech synthesizer. The prosodic features were extracted from Punjabi children’s speech corpus, then particular prosodic features were coupled with Mel Frequency Cepstral Coefficient (MFCC) features before being submitted to an ASR framework. The system modeling process investigated various approaches, which included Maximum Mutual Information (MMI), Boosted Maximum Mutual Information (bMMI), and feature-based Maximum Mutual Information (fMMI). The out-domain data augmentation was performed to enhance the corpus. After that, prosodic features were also extracted from the extended corpus, and experiments were conducted on both individual and integrated prosodic-based acoustic features. It was observed that the fMMI technique exhibited 20% to 25% relative improvement in word error rate compared with MMI and bMMI techniques. Further, it was enhanced using an augmented dataset and hybrid front-end features (MFCC + POV + Fo + Voice quality) with a relative improvement of 13% compared with the earlier baseline system.


2021 ◽  
Vol 23 (1) ◽  
pp. 20-46
Author(s):  
Ling Zhang

Abstract Cantonese is a syllable-timed language: that is, the syllable is the isochronous unit of speech. However, in Cantonese, there is a type of closed syllable with the stop codas [-p], [-t], or [-k] (i.e. syllables with the so called “entering-tones”) which sound much shorter than other syllables. On the surface, the shorter duration of stop syllables and the general prosodic feature of syllable-isochrony seem to conflict. This study conducted acoustic investigations of stop syllables in Cantonese in different contexts (i.e. in isolated form, in disyllabic words, and in disyllabic words located at the beginning, middle, and final positions of sentences). The results showed that stop syllables alone are shorter than non-stop syllables in various contexts. However, in disyllabic words or in sentences, there is a supplementary lengthening effect immediately after the stop syllables: there is more acoustic blank, and in some circumstances the initial of the following syllable is lengthened. Therefore, we propose that the phonetic realization of syllable isochrony is beyond the syllable itself in Cantonese. The results and discussions of this study may also shed light on the problem of the disappearance of “entering tones” from various Chinese dialects.


2021 ◽  
Author(s):  
David Escudero ◽  
Valentín Cardeñoso-Payo ◽  
Mario Corrales Astorgano ◽  
César González-Ferreras

2021 ◽  
pp. 22-30
Author(s):  
Anna V. Dunashova ◽  

This paper aims to study phonostylistic variation of prosodic characteristics of a linguistic persona. The new aspect brought to the field is the focus not only on pitch and speech rate but also on voice quality prosodic aspects of a linguistic persona. The subject was a world-famous British linguist David Crystal whose recordings of lecture and interview were used as the material for this study. The data suggest wide variability of practically every prosodic feature. Among them, pitch minimum, pitch range, loudness median and shimmer values proved to be most constant features of the linguistic persona. The other prosodic values underwent changes due to the shift from the modal voice in the interview to a more arduous voice in the lecture thus reflecting different pragmatics goals of commuication. Prosodic variation range of the linguistic persona in question turned out wider than the average one.


2021 ◽  
Vol 29 (1) ◽  
pp. 183-197
Author(s):  
Shahid Ali Mahar ◽  
Mumtaz Hussain Mahar ◽  
Javed Ahmed Mahar ◽  
Mehedi Masud ◽  
Muneer Ahmad ◽  
...  

2020 ◽  
Vol 1 (3) ◽  
pp. 200-205
Author(s):  
Maftuna Gazieva ◽  
◽  
Mashhura Burxanova

This article discusses lexical stress, which is the unit of intonation. As a prosodic feature, stress is studied through experimental methods. The division of languages into dynamic, quantitative and musical-accented languages according tomelodic, elongated and quantitative signs of stress is determined experimentally. Visual images of lexical stress are provided through experimental analysis carried out using modern technologies. Based on the results obtained, various views on the types of lexical stress in the Uzbek language are critically generalized


Author(s):  
Paris Binos

Vocants are precursors to speech and are facially neutral. The presence of these speechlike vocalizations was evident during the precursors to mature phonology called “protophones”. The prosodic feature of duration of the nuclei plays a crucial role in the shift of prelexical to mature speech, since speech intelligibility is closely related to the control of duration. The aim of this work is to determine whether cochlear implants (CIs) positively trigger language acquisition and the development of verbal skills. Recent literature findings are compared and discussed with the performance of two Greek congenitally hearing-impaired infants who were matched with three normal-hearing (NH) infants. This work highlighted an important weakness of the prosodic abilities of young infants with CIs.


2019 ◽  
Vol 8 (2) ◽  
pp. 291-309 ◽  
Author(s):  
Mahmood Yenkimaleki ◽  
Vincent J. van Heuven

Abstract The present study investigates the effect of prosodic feature awareness training on the intelligibility of speech produced by Iranian interpreter trainees. Two groups of student interpreters were formed. All were native speakers of Farsi who studied English translation and interpreting at the BA level. Participants took a pretest of speaking skills before starting the program so that their speech intelligibility level was rated. The control group listened to authentic audio tracks in English and discussed their contents, watched authentic English movies, discussed issues in the movies in pairs in the classroom. The experimental group spent part of the time on theoretical explanation of, and practical exercises with, English prosody. Students then took a posttest in speaking skills so that the effect of treatment on the intelligibility of their speech could be assessed. The results show that the prosody awareness training significantly improved the students’ speech intelligibility.


Sign in / Sign up

Export Citation Format

Share Document