Bangladeshi Bangla Speech Corpus for Automatic Speech Recognition Research

This paper introduces a speech corpus which is developed for Myanmar Automatic Speech Recognition (ASR) research. Automatic Speech Recognition (ASR) research has been conducted by the researchers around the world to improve their language technologies. Speech corpora are important in developing the ASR and the creation of the corpora is necessary especially for low-resourced languages. Myanmar language can be regarded as a low-resourced language because of lack of pre-created resources for speech processing research. In this work, a speech corpus named UCSY-SC1 (University of Computer Studies Yangon - Speech Corpus1) is created for Myanmar ASR research. The corpus consists of two types of domain: news and daily conversations. The total size of the speech corpus is over 42 hrs. There are 25 hrs of web news and 17 hrs of conversational recorded data.<br />The corpus was collected from 177 females and 84 males for the news data and 42 females and 4 males for conversational domain. This corpus was used as training data for developing Myanmar ASR. Three different types of acoustic models such as Gaussian Mixture Model (GMM) - Hidden Markov Model (HMM), Deep Neural Network (DNN), and Convolutional Neural Network (CNN) models were built and compared their results. Experiments were conducted on different data sizes and evaluation is done by two test sets: TestSet1, web news and TestSet2, recorded conversational data. It showed that the performance of Myanmar ASRs using this corpus gave satisfiable results on both test sets. The Myanmar ASR using this corpus leading to word error rates of 15.61% on TestSet1 and 24.43% on TestSet2.<br /><br />

Download Full-text

Chhattisgarhi speech corpus for research and development in automatic speech recognition

International Journal of Speech Technology ◽

10.1007/s10772-018-9496-7 ◽

2018 ◽

Vol 21 (2) ◽

pp. 193-210 ◽

Cited By ~ 2

Author(s):

Narendra D. Londhe ◽

Ghanahshyam B. Kshirsagar

Keyword(s):

Speech Recognition ◽

Research And Development ◽

Automatic Speech Recognition ◽

Speech Corpus

Download Full-text

Creation of Marathi speech corpus for automatic speech recognition

2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) ◽

10.1109/icsda.2013.6709893 ◽

2013 ◽

Cited By ~ 6

Author(s):

Santosh Gaikwad ◽

Bharti Gawali ◽

Suresh Mehrotra

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Speech Corpus

Download Full-text

Development of Isolated Numeric Speech Corpus for Swahili Language for Development of Automatic Speech Recognition System

International Journal of Computer Applications ◽

10.5120/12929-9841 ◽

2013 ◽

Vol 74 (11) ◽

pp. 20-22 ◽

Cited By ~ 1

Author(s):

Aaron M.Oirere ◽

Ratnadeep R. Deshmukh ◽

Pukhraj P. Shrishrimal

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Speech Corpus ◽

Automatic Speech Recognition System

Download Full-text

Indonesian audio-visual speech corpus for multimodal automatic speech recognition

2017 International Conference on Advanced Computer Science and Information Systems (ICACSIS) ◽

10.1109/icacsis.2017.8355062 ◽

2017 ◽

Cited By ~ 3

Author(s):

Muhammad Rizki Aulia Rahman Maulana ◽

Mohamad Ivan Fanany

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Visual Speech ◽

Speech Corpus

Download Full-text

Towards a continuous speech corpus for banking domain automatic speech recognition

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) ◽

10.1109/sped.2017.7990436 ◽

2017 ◽

Cited By ~ 1

Author(s):

George Suciu ◽

Stefan-Adrian Toma ◽

Romulus Cheveresan

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Continuous Speech ◽

Speech Corpus

Download Full-text

An automatic speech recognition system for spontaneous Punjabi speech corpus

International Journal of Speech Technology ◽

10.1007/s10772-017-9408-2 ◽

2017 ◽

Vol 20 (2) ◽

pp. 297-303 ◽

Cited By ~ 7

Author(s):

Yogesh Kumar ◽

Navdeep Singh

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Speech Corpus ◽

Automatic Speech Recognition System

Download Full-text

Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights

10.18653/v1/2021.findings-acl.447 ◽

2021 ◽

Author(s):

Devaraja Adiga ◽

Rishabh Kumar ◽

Amrith Krishna ◽

Preethi Jyothi ◽

Ganesh Ramakrishnan ◽

...

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Speech Corpus

Download Full-text

Improving Deep Learning based Automatic Speech Recognition for Gujarati

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3483446 ◽

2022 ◽

Vol 21 (3) ◽

pp. 1-18

Author(s):

Deepang Raval ◽

Vyom Pathak ◽

Muktan Patel ◽

Brijesh Bhatt

Keyword(s):

Deep Learning ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Short Term Memory ◽

Language Model ◽

Recognition System ◽

Processing Technique ◽

Speech Corpus ◽

Novel Approach ◽

Asr System

We present a novel approach for improving the performance of an End-to-End speech recognition system for the Gujarati language. We follow a deep learning-based approach that includes Convolutional Neural Network, Bi-directional Long Short Term Memory layers, Dense layers, and Connectionist Temporal Classification as a loss function. To improve the performance of the system with the limited size of the dataset, we present a combined language model (Word-level language Model and Character-level language model)-based prefix decoding technique and Bidirectional Encoder Representations from Transformers-based post-processing technique. To gain key insights from our Automatic Speech Recognition (ASR) system, we used the inferences from the system and proposed different analysis methods. These insights help us in understanding and improving the ASR system as well as provide intuition into the language used for the ASR system. We have trained the model on the Microsoft Speech Corpus, and we observe a 5.87% decrease in Word Error Rate (WER) with respect to base-model WER.

Download Full-text

CTIMIT: a speech corpus for the cellular environment with applications to automatic speech recognition

1995 International Conference on Acoustics, Speech, and Signal Processing ◽

10.1109/icassp.1995.479284 ◽

2002 ◽

Cited By ~ 14

Author(s):

K.L. Brown ◽

E.B. George

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Speech Corpus ◽

Cellular Environment

Download Full-text

Bangladeshi Bangla Speech Corpus for Automatic Speech Recognition Research

UCSY-SC1: A Myanmar speech corpus for automatic speech recognition

Chhattisgarhi speech corpus for research and development in automatic speech recognition

Creation of Marathi speech corpus for automatic speech recognition

Development of Isolated Numeric Speech Corpus for Swahili Language for Development of Automatic Speech Recognition System

Indonesian audio-visual speech corpus for multimodal automatic speech recognition

Towards a continuous speech corpus for banking domain automatic speech recognition

An automatic speech recognition system for spontaneous Punjabi speech corpus

Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights

Improving Deep Learning based Automatic Speech Recognition for Gujarati

CTIMIT: a speech corpus for the cellular environment with applications to automatic speech recognition

Export Citation Format