Development and analysis of Punjabi ASR system for mobile phones under different acoustic models

2019 ◽  
Vol 22 (1) ◽  
pp. 219-230 ◽  
Author(s):  
Puneet Mittal ◽  
Navdeep Singh
Author(s):  
Ankit Kumar ◽  
Rajesh Kumar Aggarwal

Background: In India, thousands of languages or dialects are in use. Most Indian dialects are low asset dialects. A well-performing Automatic Speech Recognition (ASR) system for Indian languages is unavailable due to a lack of resources. Hindi is one of them as large vocabulary Hindi speech datasets are not freely available. We have only a few hours of transcribed Hindi speech dataset. There is a lot of time and money involved in creating a well-transcribed speech dataset. Thus, developing a real-time ASR system with a few hours of the training dataset is the most challenging task. The different techniques like data augmentation, semi-supervised training, multilingual architecture, and transfer learning, have been reported in the past to tackle the fewer speech data issues. In this paper, we examine the effect of multilingual acoustic modeling in ASR systems for the Hindi language. Objective: This article’s objective is to develop a high accuracy Hindi ASR system with a reasonable computational load and high accuracy using a few hours of training data. Method: To achieve this goal we used Multilingual training with Time Delay Neural Network- Bidirectional Long Short Term Memory (TDNN-BLSTM) acoustic modeling. Multilingual acoustic modeling has significantly improved the ASR system's performance for low and limited resource languages. The common practice is to train the acoustic model by merging data from similar languages. In this work, we use three Indian languages, namely Hindi, Marathi, and Bengali. Hindi with 2.5 hours of training data and Marathi with 5.5 hours of training data and Bengali with 28.5 hours of transcribed data, was used in this work to train the proposed model. Results: The Kaldi toolkit was used to perform all the experiments. The paper is investigated over three main points. First, we present the monolingual ASR system using various Neural Network (NN) based acoustic models. Second, we show that Recurrent Neural Network (RNN) language modeling helps to improve the ASR performance further. Finally, we show that a multilingual ASR system significantly reduces the Word Error Rate (WER) (absolute 2% WER reduction for Hindi and 3% for the Marathi language). In all the three languages, the proposed TDNN-BLSTM-A multilingual acoustic models help to get the lowest WER. Conclusion: The multilingual hybrid TDNN-BLSTM-A architecture shows a 13.67% relative improvement over the monolingual Hindi ASR system. The best WER of 8.65% was recorded for Hindi ASR. For Marathi and Bengali, the proposed TDNN-BLSTM-A acoustic model reports the best WER of 30.40% and 10.85%.


Pathology ◽  
2001 ◽  
Vol 33 (3) ◽  
pp. 269-270
Author(s):  
Clive G. Harper ◽  
Victor K. Lee
Keyword(s):  

2014 ◽  
Vol 35 (3) ◽  
pp. 158-165 ◽  
Author(s):  
Christian Montag ◽  
Konrad Błaszkiewicz ◽  
Bernd Lachmann ◽  
Ionut Andone ◽  
Rayna Sariyska ◽  
...  

In the present study we link self-report-data on personality to behavior recorded on the mobile phone. This new approach from Psychoinformatics collects data from humans in everyday life. It demonstrates the fruitful collaboration between psychology and computer science, combining Big Data with psychological variables. Given the large number of variables, which can be tracked on a smartphone, the present study focuses on the traditional features of mobile phones – namely incoming and outgoing calls and SMS. We observed N = 49 participants with respect to the telephone/SMS usage via our custom developed mobile phone app for 5 weeks. Extraversion was positively associated with nearly all related telephone call variables. In particular, Extraverts directly reach out to their social network via voice calls.


2011 ◽  
Author(s):  
Christopher S. Walsh ◽  
Tom Power
Keyword(s):  

2006 ◽  
Author(s):  
Alessandra Preziosa ◽  
Marta Bassi ◽  
Daniela Villani ◽  
Andrea Gaggioli ◽  
Giuseppe Riva

Author(s):  
Huyen Thi Nguyen ◽  
Ngoc Minh Nguyen

The purpose of this study is to examine the effect of prestige sensitivity on mobile phone customer’s price acceptance in Vietnam and the mediating role of product knowledge and price mavenism on this relationship. We used the convenience sampling method for data collection via questionnaires with a sample of 605 consumers who purchased mobile phones. The collected data was analysed by applying a structural equation modelling method. The result indicates that prestige sensitivity has both direct and indirect effects on price acceptance via product knowledge and price mavenism. The findings suggest that prestige sensitivity can be used as a market segmentation criterion for mobile phones when making price decisions and providing customers with adequate information could improve price acceptance.


Sign in / Sign up

Export Citation Format

Share Document