f0 contour Latest Research Papers

Effects of Fundamental Frequency Contours on Sentence Recognition in Mandarin-Speaking Children With Cochlear Implants

Journal of Speech Language and Hearing Research ◽

10.1044/2020_jslhr-20-00033 ◽

2020 ◽

Vol 63 (11) ◽

pp. 3855-3864

Author(s):

Wanting Huang ◽

Lena L. N. Wong ◽

Fei Chen ◽

Haihong Liu ◽

Wei Liang

Keyword(s):

Cochlear Implants ◽

Fundamental Frequency ◽

Signal To Noise Ratio ◽

Lexical Tone ◽

Signal To Noise ◽

Sentence Recognition ◽

Test Conditions ◽

Age Appropriate ◽

F0 Contour ◽

Appropriate Sentences

Purpose Fundamental frequency (F0) is the primary acoustic cue for lexical tone perception in tonal languages but is processed in a limited way in cochlear implant (CI) systems. The aim of this study was to evaluate the importance of F0 contours in sentence recognition in Mandarin-speaking children with CIs and find out whether it is similar to/different from that in age-matched normal-hearing (NH) peers. Method Age-appropriate sentences, with F0 contours manipulated to be either natural or flattened, were randomly presented to preschool children with CIs and their age-matched peers with NH under three test conditions: in quiet, in white noise, and with competing sentences at 0 dB signal-to-noise ratio. Results The neutralization of F0 contours resulted in a significant reduction in sentence recognition. While this was seen only in noise conditions among NH children, it was observed throughout all test conditions among children with CIs. Moreover, the F0 contour-induced accuracy reduction ratios (i.e., the reduction in sentence recognition resulting from the neutralization of F0 contours compared to the normal F0 condition) were significantly greater in children with CIs than in NH children in all test conditions. Conclusions F0 contours play a major role in sentence recognition in both quiet and noise among pediatric implantees, and the contribution of the F0 contour is even more salient than that in age-matched NH children. These results also suggest that there may be differences between children with CIs and NH children in how F0 contours are processed.

Tonal Contour Generation for Isarn Speech Synthesis Using Deep Learning and Sampling-Based F0 Representation

Applied Sciences ◽

10.3390/app10186381 ◽

2020 ◽

Vol 10 (18) ◽

pp. 6381 ◽

Cited By ~ 1

Author(s):

Pongsathon Janyoi ◽

Pusadee Seresangtakul

Keyword(s):

Neural Network ◽

Deep Learning ◽

Recurrent Neural Network ◽

Speech Synthesis ◽

Critical Factor ◽

Dynamic Features ◽

Linguistic Features ◽

Proposed Model ◽

F0 Contour ◽

Tonal Contour

The modeling of fundamental frequency (F0) in speech synthesis is a critical factor affecting the intelligibility and naturalness of synthesized speech. In this paper, we focus on improving the modeling of F0 for Isarn speech synthesis. We propose the F0 model for this based on a recurrent neural network (RNN). Sampled values of F0 are used at the syllable level of continuous Isarn speech combined with their dynamic features to represent supra-segmental properties of the F0 contour. Different architectures of the deep RNNs and different combinations of linguistic features are analyzed to obtain conditions for the best performance. To assess the proposed method, we compared it with several RNN-based baselines. The results of objective and subjective tests indicate that the proposed model significantly outperformed the baseline RNN model that predicts values of F0 at the frame level, and the baseline RNN model that represents the F0 contours of syllables by using discrete cosine transform.

Prediction of Voicing and the F0 Contour from Electromagnetic Articulography Data for Articulation-to-Speech Synthesis

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9053231 ◽

2020 ◽

Cited By ~ 1

Author(s):

Simon Stone ◽

Philipp Schmidt ◽

Peter Birkholz

Keyword(s):

Speech Synthesis ◽

Electromagnetic Articulography ◽

F0 Contour

Word segmentation in Persian continuous speech using F0 contour

Signal and Data Processing ◽

10.29252/jsdp.16.4.135 ◽

2020 ◽

Vol 16 (4) ◽

pp. 135-150

Author(s):

Vahid sadeghi ◽

Keyword(s):

Word Segmentation ◽

Continuous Speech ◽

F0 Contour

Information packaging correlates of semantic information structure categories

Bucharest Working Papers in Linguistics ◽

10.31178/bwpl.22.1.1 ◽

2020 ◽

Vol 22 (1) ◽

pp. 5-27

Author(s):

Doina Jitcă

Keyword(s):

Information Structure ◽

Semantic Information ◽

Contour Analysis ◽

Model Based ◽

Information Packaging ◽

Intonational Meaning ◽

F0 Contour ◽

Accent Identification ◽

The Relationship ◽

Model Based Analysis

This paper presents an Information Structure (IS) model at the information packaging (IPk) level and its usage in utterance partitioning and in explaining semantic IS category realizations at the pragmatic level. The IPk model proposes a hierarchical view of F0 contours that transforms utterances into binary contrast unit (CU) hierarchies. CUs have binary IPk partitions with two independent and overlapping structures and a nuclear element which project its IPk functions to the whole units it belongs to. Two nuclear accent identification rules are formulated in this paper in order to be used in decoding IPk partition hierarchy by F0 contour analysis. In the second part of the paper several intonational contours of English sentences, having different semantic IS events, are interpreted by correlating semantic IS analysis results with those of the IPk model-based analysis. By decoding IPk structure and functional constituents from F0 contours we can advance our knowledge about the relationship between prosody and intonational meaning.

Approximations to the Voice of a Cochlear Implant: Explorations With Single-Sided Deaf Listeners

Trends in Hearing ◽

10.1177/2331216520920079 ◽

2020 ◽

Vol 24 ◽

pp. 233121652092007

Author(s):

Michael F. Dorman ◽

Sarah Cook Natale ◽

Leslie Baxter ◽

Daniel M. Zeitler ◽

Matthew L. Carlson ◽

...

Keyword(s):

Cochlear Implant ◽

Normal Hearing ◽

Large Individual ◽

Electrode Arrays ◽

Formant Frequencies ◽

Low Pass ◽

Small Set ◽

F0 Contour ◽

Spectral Smearing ◽

Electrode Insertion

Fourteen single-sided deaf listeners fit with an MED-EL cochlear implant (CI) judged the similarity of clean signals presented to their CI and modified signals presented to their normal-hearing ear. The signals to the normal-hearing ear were created by (a) filtering, (b) spectral smearing, (c) changing overall fundamental frequency (F0), (d) F0 contour flattening, (e) changing formant frequencies, (f) altering resonances and ring times to create a metallic sound quality, (g) using a noise vocoder, or (h) using a sine vocoder. The operations could be used singly or in any combination. On a scale of 1 to 10 where 10 was a complete match to the sound of the CI, the mean match score was 8.8. Over half of the matches were 9.0 or higher. The most common alterations to a clean signal were band-pass or low-pass filtering, spectral peak smearing, and F0 contour flattening. On average, 3.4 operations were used to create a match. Upshifts in formant frequencies were implemented most often for electrode insertion angles less than approximately 500°. A relatively small set of operations can produce signals that approximate the sound of the MED-EL CI. There are large individual differences in the combination of operations needed. The sound files in Supplemental Material approximate the sound of the MED-EL CI for patients fit with 28-mm electrode arrays.

Intonational Pitch Features of Interrogatives and Declaratives in Chengdu Dialect

Forum for Linguistic Studies ◽

10.18063/fls.v1i1.1082 ◽

2019 ◽

Vol 1 (1) ◽

Author(s):

Hongliu Jiang

Keyword(s):

Acoustic Analysis ◽

Lexical Tone ◽

F0 Contour ◽

Final Syllable ◽

Theoretical Results

As a representative of southwestern Mandarin, the Chengdu dialect has its own distinctive pitch features in phonology of tone and intonation. Research on the pronunciation and lexical tone of the Chengdu dialect has a long history with a certain amount of theoretical results. However, research on intonation of Chengdu dialect is still rare. The writer provides an acoustic analysis of research into intonational pitch features of interrogative and declarative sentences of Chengdu dialect, discussing the F0 contour at the final syllable (character) of each sentence to find out if the statement or question mood is carried by the edge tone as well as the pitch perturbation between lexical tone and intonation on it. The results of this acoustic analysis show that there exist statement and question mood of Chengdu dialect carried by the final syllable within an intonational phrase as well as the perturbation on the final syllable (character) by the coexistence of its lexical tone and intonation.

The effect of F0 contour on the intelligibility of Mandarin Chinese for hearing-impaired listeners

The Journal of the Acoustical Society of America ◽

10.1121/1.5119264 ◽

2019 ◽

Vol 146 (2) ◽

pp. EL85-EL91 ◽

Cited By ~ 1

Author(s):

Yadong Niu ◽

Fei Chen ◽

Jing Chen

Keyword(s):

Mandarin Chinese ◽

Hearing Impaired ◽

F0 Contour

The Effects of Modulating Fundamental Frequency and Speech Rate on the Intelligibility, Communication Efficiency, and Perceived Naturalness of Synthetic Speech

American Journal of Speech-Language Pathology ◽

10.1044/2019_ajslp-msc18-18-0052 ◽

2019 ◽

Vol 28 (2S) ◽

pp. 875-886 ◽

Cited By ~ 1

Author(s):

Jennifer M. Vojtech ◽

Jacob P. Noordzij ◽

Gabriel J. Cler ◽

Cara E. Stepp

Keyword(s):

Fundamental Frequency ◽

Slow Rate ◽

Speech Synthesis ◽

Speech Rate ◽

Synthetic Speech ◽

Normal Rate ◽

Synthesized Speech ◽

Sentence Level ◽

Communication Efficiency ◽

F0 Contour

Purpose This study investigated how modulating fundamental frequency (f0) and speech rate differentially impact the naturalness, intelligibility, and communication efficiency of synthetic speech. Method Sixteen sentences of varying prosodic content were developed via a speech synthesizer. The f0 contour and speech rate of these sentences were altered to produce 4 stimulus sets: (a) normal rate with a fixed f0 level, (b) slow rate with a fixed f0 level, (c) normal rate with prosodically natural f0 variation, and (d) normal rate with prosodically unnatural f0 variation. Sixteen listeners provided orthographic transcriptions and judgments of naturalness for these stimuli. Results Sentences with f0 variation were rated as more natural than those with a fixed f0 level. Conversely, sentences with a fixed f0 level demonstrated higher intelligibility than those with f0 variation. Speech rate did not affect the intelligibility of stimuli with a fixed f0 level. Communication efficiency was highest for sentences produced at a normal rate and a fixed f0 level. Conclusions Sentence-level f0 variation increased naturalness ratings of synthesized speech, whether the variation was prosodically natural or not. However, these f0 variations reduced intelligibility. There is evidence of a trade-off in naturalness and intelligibility of synthesized speech, which may impact future speech synthesis designs. Supplemental Material https://doi.org/10.23641/asha.8847833

F0 contour prediction for the Kazakh language

Proceedings of the 5th International Conference on Engineering and MIS - ICEMIS '19 ◽

10.1145/3330431.3330436 ◽

2019 ◽

Author(s):

Arman Kaliyev ◽

Yuri N. Matveev ◽

Elena E. Lyakso ◽

Sergey V. Rybin

Keyword(s):

F0 Contour

f0 contour
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Effects of Fundamental Frequency Contours on Sentence Recognition in Mandarin-Speaking Children With Cochlear Implants

Tonal Contour Generation for Isarn Speech Synthesis Using Deep Learning and Sampling-Based F0 Representation

Prediction of Voicing and the F0 Contour from Electromagnetic Articulography Data for Articulation-to-Speech Synthesis

Word segmentation in Persian continuous speech using F0 contour

Information packaging correlates of semantic information structure categories

Approximations to the Voice of a Cochlear Implant: Explorations With Single-Sided Deaf Listeners

Intonational Pitch Features of Interrogatives and Declaratives in Chengdu Dialect

The effect of F0 contour on the intelligibility of Mandarin Chinese for hearing-impaired listeners

The Effects of Modulating Fundamental Frequency and Speech Rate on the Intelligibility, Communication Efficiency, and Perceived Naturalness of Synthetic Speech

F0 contour prediction for the Kazakh language

Export Citation Format

f0 contourRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Effects of Fundamental Frequency Contours on Sentence Recognition in Mandarin-Speaking Children With Cochlear Implants

Tonal Contour Generation for Isarn Speech Synthesis Using Deep Learning and Sampling-Based F0 Representation

Prediction of Voicing and the F0 Contour from Electromagnetic Articulography Data for Articulation-to-Speech Synthesis

Word segmentation in Persian continuous speech using F0 contour

Information packaging correlates of semantic information structure categories

Approximations to the Voice of a Cochlear Implant: Explorations With Single-Sided Deaf Listeners

Intonational Pitch Features of Interrogatives and Declaratives in Chengdu Dialect

The effect of F0 contour on the intelligibility of Mandarin Chinese for hearing-impaired listeners

The Effects of Modulating Fundamental Frequency and Speech Rate on the Intelligibility, Communication Efficiency, and Perceived Naturalness of Synthetic Speech

F0 contour prediction for the Kazakh language

f0 contour
Recently Published Documents