scholarly journals Speech Communication and Signal Processing

Sadhana ◽  
2011 ◽  
Vol 36 (5) ◽  
pp. 551-553
Author(s):  
B YEGNANARAYANA
2019 ◽  
Vol 2019 ◽  
pp. 1-19 ◽  
Author(s):  
Vlado Delić ◽  
Zoran Perić ◽  
Milan Sečujski ◽  
Nikša Jakovljević ◽  
Jelena Nikolić ◽  
...  

Speech technologies have been developed for decades as a typical signal processing area, while the last decade has brought a huge progress based on new machine learning paradigms. Owing not only to their intrinsic complexity but also to their relation with cognitive sciences, speech technologies are now viewed as a prime example of interdisciplinary knowledge area. This review article on speech signal analysis and processing, corresponding machine learning algorithms, and applied computational intelligence aims to give an insight into several fields, covering speech production and auditory perception, cognitive aspects of speech communication and language understanding, both speech recognition and text-to-speech synthesis in more details, and consequently the main directions in development of spoken dialogue systems. Additionally, the article discusses the concepts and recent advances in speech signal compression, coding, and transmission, including cognitive speech coding. To conclude, the main intention of this article is to highlight recent achievements and challenges based on new machine learning paradigms that, over the last decade, had an immense impact in the field of speech signal processing.


2016 ◽  
Vol 27 (07) ◽  
pp. 527-540 ◽  
Author(s):  
Florian Wolters ◽  
Karolina Smeds ◽  
Erik Schmidt ◽  
Eva Kümmel Christensen ◽  
Christian Norup

Background: Evaluation of hearing-device signal-processing features is performed for research and development purposes, but also in clinical settings. Most people agree that the benefit experienced in a hearing-device user’s daily life is most important, but laboratory tests are popular since they can be performed uniformly for all participants in a study using sensitive outcome measures. In order to design laboratory tests that have the potential of indicating real-life benefit, there is a need for more information about the acoustic environments and listening situations encountered by hearing-device users as well as by normal-hearing people. Purpose: To investigate the acoustic environments and listening situations people encounter, and to provide a structured framework of common sound scenarios (CoSS) that can be used for instance when designing realistic laboratory tests. Research Design: A literature search was conducted. Extracted acoustic environments and listening situations were categorized using a context-based approach. A set of common sound scenarios was established based on the findings from the literature. Data Collection: A number of publications providing data on encountered acoustic environments and listening situations were identified. Focus was on studies including informants who reported or recorded information in field trials. Nine relevant references were found. In combination with data collected at our laboratory, 187 examples of acoustic environments or listening situations were found. Results: Based on the extracted data, a categorization approach based on context (intentions and tasks) was used when creating CoSS. Three intention categories, “speech communication,” “focused listening,” and “nonspecific” were divided into seven task categories. In each task category, two sound scenarios were described, creating in total 14 common sound scenarios. The literature search showed a general lack of studies investigating acoustic environments and listening situations, in particular studies where normal-hearing informants are included and studies performed outside North America and Western Europe. Conclusions: A structured framework was developed. Intentions and tasks constitute the main categories in the framework, and 14 common sound scenarios were selected and described. The framework can for instance be used when developing hearing-device signal-processing features, in the evaluation of such features in realistic laboratory tests, and for demonstration of feature effects to hearing-device wearers.


Entropy ◽  
2021 ◽  
Vol 23 (10) ◽  
pp. 1246
Author(s):  
Candy Olivia Mawalim ◽  
Masashi Unoki

Speech watermarking has become a promising solution for protecting the security of speech communication systems. We propose a speech watermarking method that uses the McAdams coefficient, which is commonly used for frequency harmonics adjustment. The embedding process was conducted, using bit-inverse shifting. We also developed a random forest classifier, using features related to frequency harmonics for blind detection. An objective evaluation was conducted to analyze the performance of our method in terms of the inaudibility and robustness requirements. The results indicate that our method satisfies the speech watermarking requirements with a 16 bps payload under normal conditions and numerous non-malicious signal processing operations, e.g., conversion to Ogg or MP4 format.


2007 ◽  
Vol 121 (6) ◽  
pp. 3269
Author(s):  
Adoram Erell ◽  
Avi Kleinstein

2016 ◽  
Vol 81 ◽  
pp. 1-29 ◽  
Author(s):  
Pejman Mowlaee ◽  
Rahim Saeidi ◽  
Yannis Stylianou

Author(s):  
Jean-Luc Starck ◽  
Fionn Murtagh ◽  
Jalal Fadili
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document