dialect recognition Latest Research Papers

Dialect recognition is one of the most attentive topics in the speech analysis area. Machine learning algorithms have been widely used to identify dialects. In this paper, a model that based on three different 1D Convolutional Neural Network (CNN) structures is developed for Kurdish dialect recognition. This model is evaluated, and CNN structures are compared to each other. The result shows that the proposed model has outperformed the state of the art. The model is evaluated on the experimental data that have been collected by the staff of department of computer science at the University of Halabja. Three dialects are involved in the dataset as the Kurdish language consists of three major dialects, namely Northern Kurdish (Badini variant), Central Kurdish (Sorani variant), and Hawrami. The advantage of the CNN model is not required to concern handcraft as the CNN model is featureless. According to the results, the 1 D CNN method can make predictions with an average accuracy of 95.53% on the Kurdish dialect classification. In this study, a new method is proposed to interpret the closeness of the Kurdish dialects by using a confusion matrix and a non-metric multi-dimensional visualization technique. The outcome demonstrates that it is straightforward to cluster given Kurdish dialects and linearly isolated from the neighboring dialects.

Download Full-text

Dialect recognition from Telugu speech utterances using spectral and prosodic features

International Journal of Speech Technology ◽

10.1007/s10772-021-09854-8 ◽

2021 ◽

Author(s):

S. Shivaprasad ◽

M. Sadanandam

Keyword(s):

Prosodic Features ◽

Dialect Recognition ◽

Spectral And Prosodic Features

Download Full-text

Klasifikasi Dialek Pengujar Bahasa Inggris Menggunakan Random Forest

JURNAL MEDIA INFORMATIKA BUDIDARMA ◽

10.30865/mib.v5i2.2754 ◽

2021 ◽

Vol 5 (2) ◽

pp. 439

Author(s):

Muhamad Azhar ◽

Hilman Ferdinandus Pardede

Keyword(s):

Machine Learning ◽

Speech Recognition ◽

Random Forest ◽

Recognition Performance ◽

Learning Technology ◽

Random Forest Algorithm ◽

Research Fields ◽

Grid Search Method ◽

Dialect Recognition ◽

Archive Data

Speech recognition is one of the important research fields which is currently widely used for various applications. However, speech recognition performance is affected by the dialect of the speaker. Therefore, dialect recognition is often used as an additional feature in speech recognition. The process of recognizing dialects is not easy. Currently, Machine Learning technology is widely applied in dialect recognition. One of the challenges in the introduction of machine learning-based dialects is the imbalance of classes and overlaps in a wide variety of classification techniques. This study applies Random Forest-based oversampling technology for dialect recognition. For hyper-parameter optimization of the random forest algorithm, we apply the Grid Search method. Experiments on Speech Accent Archive data using the MFCC feature resulted in an accuracy of 0.91 and AUC of 0.95

Download Full-text

Sounding like a father: The influence of regional dialect on perceptions of masculinity and fatherhood

Language in Society ◽

10.1017/s0047404520000925 ◽

2021 ◽

pp. 1-24

Author(s):

Sara King ◽

Yi Ren ◽

Kaori Idemaru ◽

Cindi Sturtzsreetharan

Keyword(s):

Native Speakers ◽

Visual Media ◽

Japanese Language ◽

Language Variety ◽

Recognition Experiment ◽

The Social ◽

Social Meanings ◽

Regional Dialect ◽

Western Regional ◽

Dialect Recognition

Abstract Previous work on the Osaka dialect (OD) collectively suggests that this western regional variant of Japanese is associated with informality, masculinity, and affective fatherhood—social meanings that can be recruited in the construction of audio-visual media personas. This study examines the use of OD by one protagonist in the film Soshite chichi ni naru/Like father, like son, as well as the social meanings that listeners attribute to this variety of Japanese. Specifically, we ask two questions: (i) to what extent is the production of OD in the film recognizable to native speakers of Japanese, and (ii) what qualities do Japanese language users attribute to OD? A dialect recognition experiment found low recognizability of OD but high recognizability of a general ‘nonstandard Japanese’ language variety. Qualitative data revealed that Japanese language users perceived OD to index various characteristics including that of a masculine, affective father. (Perception, dialect, fatherhood, Osaka dialect, indexicality)*

Download Full-text

Kurdish Spoken Dialect Recognition Using X-Vector Speaker Embedding

10.1007/978-3-030-87802-3_5 ◽

2021 ◽

pp. 50-57

Author(s):

Arash Amani ◽

Mohammad Mohammadamini ◽

Hadi Veisi

Keyword(s):

Dialect Recognition

Download Full-text

Chinese Dialect Recognition Using X-vector with Multi-task Learning

Journal of Physics Conference Series ◽

10.1088/1742-6596/1624/3/032007 ◽

2020 ◽

Vol 1624 ◽

pp. 032007

Author(s):

Jingzhe Yang ◽

Tongrui Yu

Keyword(s):

Task Learning ◽

Chinese Dialect ◽

Dialect Recognition

Download Full-text

Spoken Arabic dialect recognition using X-vectors

Natural Language Engineering ◽

10.1017/s1351324920000091 ◽

2020 ◽

Vol 26 (6) ◽

pp. 691-700

Author(s):

Abualsoud Hanani ◽

Rabee Naser

Keyword(s):

Speaker Recognition ◽

Training And Development ◽

Vector System ◽

Identification System ◽

Shared Task ◽

Testing Dataset ◽

Phonetic Features ◽

Dialect Recognition ◽

Spoken Arabic ◽

Modern Standard

AbstractThis paper describes our automatic dialect identification system for recognizing four major Arabic dialects, as well as Modern Standard Arabic. We adapted the X-vector framework, which was originally developed for speaker recognition, to the task of Arabic dialect identification (ADI). The training and development ADI VarDial 2018 and VarDial 2017 were used to train and test all of our ADI systems. In addition to the introduced X-vectors, other systems use the traditional i-vectors, bottleneck features, phonetic features, words transcriptions, and GMM-tokens. X-vectors achieved good performance (0.687) on the ADI 2018 Discriminating between Similar Languages shared task testing dataset, outperforming other systems. The performance of the X-vector system is slightly improved (0.697) when fused with i-vectors, bottleneck features, and word uni-gram features.

Download Full-text