Arabic Speech Recognition System Based on CMUSphinx

Phonetic dictionaries are regarded as pivotal components of speech recognition systems. The function of speech recognition research is to generate a machine which will accurately identify and distinguish the normal human speech from any other speaker. Literature affirmed that Arabic phonetics is one of the major problems in Arabic speech recognition. Therefore, this paper reviews previous studies tackling the challenges faced by initiating an Arabic phonetic dictionary with respect to Arabic speech recognition. It has been found that the system of speech recognition investigated areas of differences concerning Arabic phonetics. In addition, an Arabic phonetic dictionary should be initiated where the Arabic vowels’ phonemes should be considered as a component of the consonants’ phonemes. Thus, the incorporation of developed machine translation systems may enhance the quality of the system. The current paper concludes with the existing challenges faced by Arabic phonetic dictionary.

Download Full-text

Arabic Speech Recognition System Trough VQLBG Algorithm Using Matlab

10.46514/1971-000-011-007 ◽

2020 ◽

pp. 1

Author(s):

Mowaffak O. A. Al Baraq

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Arabic Speech Recognition

Download Full-text

The development of acoustic models for command and control arabic speech recognition system

International Conference on Electrical, Electronic and Computer Engineering, 2004. ICEEC '04. ◽

10.1109/iceec.2004.1374575 ◽

2005 ◽

Cited By ~ 2

Author(s):

M. Nofal ◽

E. Abdel Reheem ◽

H. El Henawy ◽

N. Abdel Kader

Keyword(s):

Speech Recognition ◽

Command And Control ◽

Recognition System ◽

Speech Recognition System ◽

Acoustic Models ◽

And Control ◽

Arabic Speech Recognition

Download Full-text

Improved Arabic speech recognition system through the automatic generation of fine-grained phonetic transcriptions

Information Processing & Management ◽

10.1016/j.ipm.2017.07.002 ◽

2019 ◽

Vol 56 (2) ◽

pp. 343-353 ◽

Cited By ~ 5

Author(s):

Eiman Alsharhan ◽

Allan Ramsay

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Automatic Generation ◽

Speech Recognition System ◽

Fine Grained ◽

Arabic Speech Recognition

Download Full-text

Constructing accurate and robust HMM/GMM models for an Arabic speech recognition system

International Journal of Speech Technology ◽

10.1007/s10772-017-9456-7 ◽

2017 ◽

Vol 20 (4) ◽

pp. 937-949 ◽

Cited By ~ 5

Author(s):

Mohamed O. M. Khelifa ◽

Yahya Mohamed Elhadj ◽

Yousfi Abdellah ◽

Mostafa Belkasmi

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Arabic Speech Recognition

Download Full-text

Arabic Phonetic Dictionaries for Speech Recognition

Journal of Information Technology Research ◽

10.4018/jitr.2009062905 ◽

2009 ◽

Vol 2 (4) ◽

pp. 67-80 ◽

Cited By ~ 15

Author(s):

Mohamed Ali ◽

Moustafa Elshafei ◽

Mansour Al-Ghamdi ◽

Husni Al-Muhtaseb

Keyword(s):

Speech Recognition ◽

Language Model ◽

Recognition System ◽

Speech Recognition System ◽

Broadcast News ◽

Word Error Rate ◽

Large Vocabulary ◽

Essential Components ◽

News Corpus ◽

Arabic Speech Recognition

Phonetic dictionaries are essential components of large-vocabulary speaker-independent speech recognition systems. This paper presents a rule-based technique to generate phonetic dictionaries for a large vocabulary Arabic speech recognition system. The system used conventional Arabic pronunciation rules, common pronunciation rules of Modern Standard Arabic, as well as some common dialectal cases. The paper gives in detail an explanation of these rules as well as their formal mathematical presentation. The rules were used to generate a dictionary for a 5.4 hour corpus of broadcast news. The rules and the phone set were tested and evaluated on an Arabic speech recognition system. The system was trained on 4.3 hours of the 5.4 hours of Arabic broadcast news corpus and tested on the remaining 1.1 hours. The phonetic dictionary contains 23,841 definitions corresponding to about 14232 words. The language model contains both bi-grams and tri-grams. The Word Error Rate (WER) came to 9.0%.

Download Full-text