Exploring Wavelet Transform Based Methodology for Cochlear Prosthesis Advanced Speech Processing Strategy

2014 ◽  
Vol 100 (2) ◽  
pp. 341-352
Author(s):  
Mohamed Ghorbel ◽  
Amira Derbel ◽  
Fathi Kallel ◽  
Mounir Samet ◽  
Ahmed Ben Hamida

1987 ◽  
Vol 96 (1_suppl) ◽  
pp. 132-134 ◽  
Author(s):  
R. C. Dowell ◽  
P. M. Seligman ◽  
P. J. Blamey ◽  
G. M. Clark

Initial results with the two-formant speech-processing strategy (F0F1F2) confirm the advantage of a multichannel cochlear prosthesis capable of stimulating at different sites within the cochlea. The successful presentation of two spectra] components by varying the place of stimulation leads to the possibility of presenting further spectral information in this manner. Because virtually all multichannel implant patients demonstrate good “place” (electrode site) discrimination, these more refined coding strategies should lead to benefits for the majority of implantees. Already, with the F0F1F2 strategy, we have a system that appears to provide some effective auditory-alone communication ability for the average patient.



2007 ◽  
Vol 2007 ◽  
pp. 1-5 ◽  
Author(s):  
Aïcha Bouzid ◽  
Noureddine Ellouze

This paper describes a multiscale product method (MPM) for open quotient measure in voiced speech. The method is based on determining the glottal closing and opening instants. The proposed approach consists of making the products of wavelet transform of speech signal at different scales in order to enhance the edge detection and parameter estimation. We show that the proposed method is effective and robust for detecting speech singularity. Accurate estimation of glottal closing instants (GCIs) and opening instants (GOIs) is important in a wide range of speech processing tasks. In this paper, accurate estimation of GCIs and GOIs is used to measure the local open quotient (Oq) which is the ratio of the open time by the pitch period. Multiscale product operates automatically on speech signal; the reference electroglottogram (EGG) signal is used for performance evaluation. The ratio of good GCI detection is 95.5% and that of GOI is 76%. The pitch period relative error is 2.6% and the open phase relative error is 5.6%. The relative error measured on open quotient reaches 3% for the whole Keele database.



2021 ◽  
Vol 4 (3) ◽  
pp. 37-41
Author(s):  
Sayora Ibragimova ◽  

This work deals with basic theory of wavelet transform and multi-scale analysis of speech signals, briefly reviewed the main differences between wavelet transform and Fourier transform in the analysis of speech signals. The possibilities to use the method of wavelet analysis to speech recognition systems and its main advantages. In most existing systems of recognition and analysis of speech sound considered as a stream of vectors whose elements are some frequency response. Therefore, the speech processing in real time using sequential algorithms requires computing resources with high performance. Examples of how this method can be used when processing speech signals and build standards for systems of recognition.Key words: digital signal processing, Fourier transform, wavelet analysis, speech signal, wavelet transform



2010 ◽  
Vol 21 (01) ◽  
pp. 052-065 ◽  
Author(s):  
Richard S. Tyler ◽  
Shelley A. Witt ◽  
Camille C. Dunn ◽  
Ann Perreau ◽  
Aaron J. Parkinson ◽  
...  

Objectives: The purpose of this investigation was to determine if adult bilateral cochlear implant recipients could benefit from using a speech processing strategy in which the input spectrum was interleaved among electrodes across the two implants. Design: Two separate experiments were conducted. In both experiments, subjects were tested using a control speech processing strategy and a strategy in which the full input spectrum was filtered so that only the output of half of the filters was audible to one implant, while the output of the alternative filters was audible to the other implant. The filters were interleaved in a way that created alternate frequency “holes” between the two cochlear implants. Results: In experiment one, four subjects were tested on consonant recognition. Results indicated that one of the four subjects performed better with the interleaved strategy, one subject received a binaural advantage with the interleaved strategy that they did not receive with the control strategy, and two subjects showed no decrement in performance when using the interleaved strategy. In the second experiment, 11 subjects were tested on word recognition, sentences in noise, and localization (it should be noted that not all subjects participated in all tests). Results showed that for speech perception testing one subject achieved significantly better scores with the interleaved strategy on all tests, and seven subjects showed a significant improvement with the interleaved strategy on at least one test. Only one subject showed a decrement in performance on all speech perception tests with the interleaved strategy. Out of nine subjects, one subject preferred the sound quality of the interleaved strategy. No one performed better on localization with the interleaved strategy. Conclusion: Data from this study indicate that some adult bilateral cochlear implant recipients can benefit from using a speech processing strategy in which the input spectrum is interleaved among electrodes across the two implants. It is possible that the subjects in this study who showed a significant improvement with the interleaved strategy did so because of less channel interaction; however, this hypothesis was not directly tested.



2005 ◽  
Vol 114 (11) ◽  
pp. 886-893 ◽  
Author(s):  
Li Xu ◽  
Teresa A. Zwolan ◽  
Catherine S. Thompson ◽  
Bryan E. Pfingst

Objectives: The present study was performed to evaluate the efficacy and clinical feasibility of using monopolar stimulation with the Clarion Simultaneous Analog Stimulation (SAS) strategy in patients with cochlear implants. Methods: Speech recognition by 10 Clarion cochlear implant users was evaluated by means of 4 different speech processing strategy/electrode configuration combinations; ie, SAS and Continuous Interleaved Sampling (CIS) strategies were each used with monopolar (MP) and bipolar (BP) electrode configurations. The test measures included consonants, vowels, consonant-nucleus-consonant words, and Hearing in Noise Test sentences with a +10 dB signal-to-noise ratio. Additionally, subjective judgments of sound quality were obtained for each strategy/configuration combination. Results: All subjects but 1 demonstrated open-set speech recognition with the SAS/MP combination. The group mean Hearing in Noise Test sentence score for the SAS/MP combination was 31.6% (range, 0% to 92%) correct, as compared to 25.0%, 46.7%, and 37.8% correct for the CIS/BP, CIS/MP, and SAS/BP combinations, respectively. Intersubject variability was high, and there were no significant differences in mean speech recognition scores or mean preference ratings among the 4 strategy/configuration combinations tested. Individually, the best speech recognition performance was with the subject's everyday strategy/configuration combination in 72% of the applicable cases. If the everyday strategy was excluded from the analysis, the subjects performed best with the SAS/MP combination in 37.5% of the remaining cases. Conclusions: The SAS processing strategy with an MP electrode configuration gave reasonable speech recognition in most subjects, even though subjects had minimal previous experience with this strategy/configuration combination. The SAS/MP combination might be particularly appropriate for patients for whom a full dynamic range of electrical hearing could not be achieved with a BP configuration.



Sign in / Sign up

Export Citation Format

Share Document