AISpeech-SJTU Accent Identification System for the Accented English Speech Recognition Challenge

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414292 ◽

2021 ◽

Author(s):

Houjun Huang ◽

Xu Xiang ◽

Yexin Yang ◽

Rao Ma ◽

Yanmin Qian

Keyword(s):

Speech Recognition ◽

Identification System ◽

Accented English ◽

Accent Identification

Download Full-text

Speech Recognition of Moroccan Dialect Using Hidden Markov Models

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v8.i1.pp7-13 ◽

2019 ◽

Vol 8 (1) ◽

pp. 7 ◽

Cited By ~ 1

Author(s):

Mouaz Bezoui

Keyword(s):

Speech Recognition ◽

Euclidean Distance ◽

Speaker Identification ◽

Arab World ◽

Training Session ◽

Arab Countries ◽

Identification System ◽

Training Phase ◽

Standard Arabic ◽

Testing Phase

<p>This paper addresses the development of an Automatic Speech Recognition (ASR) system for the Moroccan Dialect. Dialectal Arabic (DA) refers to the day-to-day vernaculars spoken in the Arab world. In fact, Moroccan Dialect is very different from the Modern Standard Arabic (MSA) because it is highly influenced by the French Language. It is observed throughout all Arab countries that standard Arabic widely written and used for official speech, news papers, public administration and school but not used in everyday conversation and dialect is widely spoken in everyday life but almost never written. we propose to use the Mel Frequency Cepstral Coefficient (MFCC) features to specify the best speaker identification system. The extracted speech features are quantized to a number of centroids using vector quantization algorithm. These centroids constitute the codebook of that speaker. MFCC’s are calculated in training phase and again in testing phase. Speakers uttered same words once in a training session and once in a testing session later. The Euclidean distance between the MFCC’s of each speaker in training phase to the centroids of individual speaker in testing phase is measured and the speaker is identified according to the minimum Euclidean distance. The code is developed in the MATLAB environment and performs the identification satisfactorily.</p>

Download Full-text

Automatic speech recognition of multiple accented English data

10.21437/interspeech.2010-477 ◽

2010 ◽

Author(s):

Dimitra Vergyri ◽

Lori Lamel ◽

Jean-Luc Gauvain

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Accented English

Download Full-text

Speech accent identification and speech recognition enhancement by speaker accent adaptation

10.22215/etd/2005-07797 ◽

2005 ◽

Author(s):

Mohammad Tanabian

Keyword(s):

Speech Recognition ◽

Accent Identification

Download Full-text

The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9413386 ◽

2021 ◽

Author(s):

Xian Shi ◽

Fan Yu ◽

Yizhou Lu ◽

Yuhao Liang ◽

Qiangze Feng ◽

...

Keyword(s):

Speech Recognition ◽

Accented English ◽

Open Datasets

Download Full-text

INDIVIDUAL IDENTIFICATION SYSTEM DESIGN THROUGH VOICE USING LINEAR PREDICTIVE CODING METHOD AND K-NEAREST NEIGHBOR

Jurnal Teknik Informatika (Jutif) ◽

10.20884/1.jutif.2021.2.2.71 ◽

2021 ◽

Vol 2 (2) ◽

pp. 95-100

Author(s):

Davita Nadia Fadhilah ◽

Rita Magdalena ◽

Sofia Sa’idah

Keyword(s):

Speech Recognition ◽

Nearest Neighbor ◽

Predictive Coding ◽

Voice Recognition ◽

Individual Identification ◽

Identification System ◽

K Nearest Neighbor ◽

Linear Predictive Coding ◽

Distance Method ◽

K Value

Humans have a variety of characteristics that are different from one another. Characteristics possessed by humans are genuine which can be used as a differentiator between one individual and another, one of which is sound. Voice recognition is called speech recognition. In this study, it was developed as an individual voice recognition system using a combination of the Linear Predictive Coding (LPC) method of feature extraction and K-Nearest Neighbor (K-NN) classification in the speech recognition process. Testing is done by testing changes in several parameters, namely the LPC order value, the number of frames, the K value, and different distance methods. The results of the parameter combination test showed a fairly good presentation of 73.56321839% with the combination parameter or LPC 8, the number of frames 480, the value of K 5, with the distance method used by Chebychev.

Download Full-text

Dyn-Asr: Compact, Multilingual Speech Recognition Via Spoken Language And Accent Identification

10.1109/wf-iot51360.2021.9594961 ◽

2021 ◽

Author(s):

Sangeeta Ghangam Manepalli ◽

Daniel Whitenack ◽

Joshua Nemecek

Keyword(s):

Speech Recognition ◽

Spoken Language ◽

Multilingual Speech Recognition ◽

Accent Identification

Download Full-text

An End-to-End Dialect Identification System with Transfer Learning from a Multilingual Automatic Speech Recognition Model

10.21437/interspeech.2021-374 ◽

2021 ◽

Author(s):

Ding Wang ◽

Shuaishuai Ye ◽

Xinhui Hu ◽

Sheng Li ◽

Xinkang Xu

Keyword(s):

Speech Recognition ◽

Transfer Learning ◽

Automatic Speech Recognition ◽

Identification System ◽

Recognition Model ◽

End To End

Download Full-text

A Study of Machine Learning Algorithms in Speech Recognition and Language Identification System

Innovations in Computer Science and Engineering - Lecture Notes in Networks and Systems ◽

10.1007/978-981-33-4543-0_54 ◽

2021 ◽

pp. 503-513

Author(s):

Aakansha Mathur ◽

Razia Sultana

Keyword(s):

Machine Learning ◽

Speech Recognition ◽

Learning Algorithms ◽

Language Identification ◽

Machine Learning Algorithms ◽

Identification System

Download Full-text

AISpeech-SJTU ASR System for the Accented English Speech Recognition Challenge

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414471 ◽

2021 ◽

Author(s):

Tian Tan ◽

Yizhou Lu ◽

Rao Ma ◽

Sen Zhu ◽

Jiaqi Guo ◽

...

Keyword(s):

Speech Recognition ◽

Accented English ◽

Asr System

Download Full-text

Computational Approaches to Exploring Persian-Accented English

Research in Language ◽

10.1515/rela-2015-0012 ◽

2015 ◽

Vol 13 (1) ◽

pp. 51-60

Author(s):

Corey Miller

Keyword(s):

Speech Recognition ◽

Computational Approaches ◽

Accented English ◽

L2 Pronunciation

Methods involving phonetic speech recognition are discussed for detecting Persianaccented English. These methods offer promise for both the identification and mitigation of L2 pronunciation errors. Pronunciation errors, both segmental and suprasegmental, particular to Persian speakers of English are discussed.

Download Full-text