Speech Recognition Using Enhanced Features with Deep Belief Network for Real Time Application

Author(s):  
Gurpreet Kaur ◽  
Mohit Srivastava ◽  
Amod Kumar
Author(s):  
Jianmei Wang

The oral English teaching faces several common problems: the teaching method is very inefficient, and the learners are poor in oral English. The development of computer-aided language learning offers a possible solution to these problems. Based on techniques of speech recognition, cloud computing and deep learning, this paper applies the deep belief network (DBN) to recognize the speeches in oral English teaching, and establishes a multi-parameter evaluation model for the pronunciation quality of oral English among college students. The model combines the merits of subjective and objective evaluations, and assesses the pronunciation from four aspects: pitch, speech rate, rhythm and intonation. Finally, the proposed model was verified through speech recognition and pronunciation evaluation experiments on 26 non-English majors from a college. The results show that the proposed evaluation model output credible results, which are consistent with those of experts, as evidenced by consistency, neighbourhood consistency and Pearson correlation coefficient. The research provides a feasible way to evaluate the oral English proficiency of learners, laying the basis for improving the teaching and learning efficiency of oral English.


2013 ◽  
Vol 7 ◽  
Author(s):  
Peter O'Connor ◽  
Daniel Neil ◽  
Shih-Chii Liu ◽  
Tobi Delbruck ◽  
Michael Pfeiffer

Author(s):  
Kalamullah Ramli ◽  
Asril Jarin ◽  
Suryadi Suryadi

The performance of network-based speech recognition application is mainly determined by the availability of all speech data received on the server and also the realtimeness in delivering the recognition results from the server. On the basis of full-duplex speech recognition application this paper proposes a real-time application framework for web speech recognition using HTTP/2 protocol and Sender-Sent Events (SSE). A number of experiments were performed to compare the latency of both the application using HTTP/2 plus SSE and the full-duplex application using WebSocket. The results showed that the proposed framework offers better alternative for a web-based speech recognition than the framework using WebSocket.


Sign in / Sign up

Export Citation Format

Share Document