Greek Verbs and User Friendliness in the Speech Recognition and the Speech Production Module of Dialog Systems for the Broad Public

Children with a cochlear implant: Characteristics and determinants of speech recognition, speech-recognition growth rate, and speech production

International Journal of Audiology ◽

10.1080/14992020601182891 ◽

2007 ◽

Vol 46 (5) ◽

pp. 232-243 ◽

Cited By ~ 41

Author(s):

Ona Bø Wie ◽

Eva-Signe Falkenberg ◽

Ole Tvete ◽

Bruce Tomblin

Keyword(s):

Growth Rate ◽

Speech Recognition ◽

Cochlear Implant ◽

Speech Production

Download Full-text

Real-time large vocabulary spontaneous speech recognition for spoken dialog systems

2011 4th International Congress on Image and Signal Processing ◽

10.1109/cisp.2011.6100773 ◽

2011 ◽

Cited By ~ 1

Author(s):

Jan Svec ◽

Lubos Smidl

Keyword(s):

Speech Recognition ◽

Real Time ◽

Spontaneous Speech ◽

Spoken Dialog Systems ◽

Large Vocabulary ◽

Dialog Systems

Download Full-text

Utilizing relationships between named entities to improve speech recognition in dialog systems

2010 IEEE Spoken Language Technology Workshop ◽

10.1109/slt.2010.5700822 ◽

2010 ◽

Author(s):

Shajith Ikbal ◽

Om D Deshmukh ◽

Karthik Visweswariah ◽

Ashish Verma

Keyword(s):

Speech Recognition ◽

Named Entities ◽

Dialog Systems

Download Full-text

Localization of speech recognition in spoken dialog systems: how machine translation can make our lives easier

10.21437/interspeech.2009-450 ◽

2009 ◽

Author(s):

David Suendermann ◽

Jackson Liscombe ◽

Krishna Dayanidhi ◽

Roberto Pieraccini

Keyword(s):

Speech Recognition ◽

Machine Translation ◽

Spoken Dialog Systems ◽

Dialog Systems

Download Full-text

Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical Study

10.21437/interspeech.2020-1508 ◽

2020 ◽

Author(s):

Karthik Gopalakrishnan ◽

Behnam Hedayatnia ◽

Longshaokan Wang ◽

Yang Liu ◽

Dilek Hakkani-Tür

Keyword(s):

Speech Recognition ◽

Empirical Study ◽

Open Domain ◽

Dialog Systems ◽

Recognition Errors

Download Full-text

Improved Speech Command Classification System for Sinhala Language based on Automatic Speech Recognition

International Journal of Asian Language Processing ◽

10.1142/s2717554520500095 ◽

2020 ◽

pp. 2050009

Author(s):

Lakshika Kavmini ◽

Thilini Dinushika ◽

Uthayasanker Thayasivam ◽

Sanath Jayasena

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Error Rate ◽

Classification System ◽

Value Added ◽

Gaussian Mixture ◽

Conversational Agents ◽

Language Resources ◽

Dialog Systems ◽

The Individual

The recent advancements in conversational Artificial Intelligence (AI) are fastly getting integrated with every realm of human lives. Conversational agents who can learn, understand human languages and mimic the human thinking process have already created a revolution in human lifestyle. Understanding the intention of a speaker from his natural speech is a significant step in conversational AI. A major challenge that hinders the efficacy of this process is the lack of language resources. In this research, we address this issue and develop a domain-specific speech command classification system for the Sinhala language, one of the low-resourced languages. An effective speech command classification system can be utilized in several value added applications such as speech dialog systems. Our speech command classification system is developed by integrating Automatic Speech Recognition (ASR) and Natural Language Understanding (NLU). The ASR engine is implemented using Gaussian Mixture Model-Hidden Markov Model (GMM-HMM) and it converts a Sinhala speech command into a corresponding text representation. The text classifier, which is implemented as an ensemble unit of several classifiers, predicts the intent of the speaker when provided with the above text output. In this paper, we discuss and evaluate various algorithms and techniques that can be utilized to optimize the performance of both the ASR and text classifier. As well, we present our novel Sinhala speech data corpus of 4.15[Formula: see text]h which is based on the banking domain. As the final outcome, our system reports its Sinhala speech command classification accuracy as 91.03%. It shows that our system outperforms the state-of-the-art speech-to-intent mapping systems developed for the Sinhala language. The individual evaluation on the ASR system reports a 9.91% Word Error Rate and a 19.95% Sentence Error Rate, suggesting the applicability of advanced speech recognition techniques despite the limited language resources. Finally, our findings deliver useful insights on further research in speech command classification in the low-resourced context.

Download Full-text

Spracherkennung in der manuellen Montage/Speech recognition in manual assembly

wt Werkstattstechnik online ◽

10.37544/1436-4980-2021-09-5 ◽

2021 ◽

Vol 111 (09) ◽

pp. 579-582

Author(s):

Daniel Schulte ◽

Martin Sudhoff ◽

Bernd Kuhlenkötter

Keyword(s):

Speech Recognition ◽

Data Acquisition ◽

Production Systems ◽

Assembly System ◽

Process Time ◽

User Friendliness ◽

Manual Assembly ◽

Design And Testing

In diesem Beitrag wird die Konzeption und Erprobung eines Systems zur Datenerfassung mittels Spracherkennung in der manuellen Montage beschrieben. Dieses wurde in einem realen Montagesystem in der Lern- und Forschungsfabrik (LFF) des Lehrstuhls für Produktionssysteme (LPS) zur Prozesszeitaufnahme eingesetzt. Anschließend wurde die Qualität der Daten sowie auf die Bedienerfreundlichkeit untersucht. Es konnte gezeigt werden, dass die Spracherkennung eine gute Ergänzung zur manuellen Datenerfassung darstellt.   This paper describes the design and testing of a system for data acquisition using speech recognition in manual assembly. This was used in a real assembly system in the Learning and Research Factory of the Chair of Production Systems for process time recording. Subsequently, the quality of the data as well as the user-friendliness were examined. It could be shown that speech recognition is a good complement to manual data acquisition.

Download Full-text

The Functional Anatomy of Speech Processing: From Auditory Cortex to Speech Recognition and Speech Production

fMRI ◽

10.1007/978-3-642-34342-1_9 ◽

2013 ◽

pp. 111-118

Author(s):

Gregory Hickok

Keyword(s):

Speech Recognition ◽

Auditory Cortex ◽

Speech Production ◽

Speech Processing ◽

Functional Anatomy

Download Full-text

Leveraging speech production knowledge for improved speech recognition

2009 IEEE Workshop on Automatic Speech Recognition & Understanding ◽

10.1109/asru.2009.5373368 ◽

2009 ◽

Cited By ~ 3

Author(s):

Abhijeet Sangwan ◽

John H.L. Hansen

Keyword(s):

Speech Recognition ◽

Speech Production ◽

Production Knowledge

Download Full-text

Speech recognition, speech production and speech intelligibility in children with hearing aids versus implanted children

International Journal of Pediatric Otorhinolaryngology ◽

10.1016/s0165-5876(98)00137-2 ◽

1999 ◽

Vol 47 (2) ◽

pp. 165-169 ◽

Cited By ~ 10

Author(s):

E Löhle ◽

S Frischmuth ◽

M Holm ◽

L Becker ◽

K Flamm ◽

...

Keyword(s):

Speech Recognition ◽

Speech Production ◽

Hearing Aids ◽

Speech Intelligibility

Download Full-text