A Benchmark System for Indian Language Text Recognition

This paper presents a novel technique for context based numeral reading in Indian language text to speech systems. The model uses a set of rules to determine the context of the numeral pronunciation and is being integrated with the waveform concatenation technique to produce speech out of the input text in Indian languages. For this purpose, the three Indian languages Odia, Hindi and Bengali are considered. To analyze the performance of the proposed technique, a set of experiments are performed considering different context of numeral pronunciations and the results are compared with existing syllable-based technique. The results obtained from different experiments shows the effectiveness of the proposed technique in producing intelligible speech out of the entered text utterances compared to the existing technique even with very less storage and execution time.

Download Full-text

Indian Language Text Representation and Categorization Using Supervised Learning Algorithm

2014 International Conference on Intelligent Computing Applications ◽

10.1109/icica.2014.89 ◽

2014 ◽

Author(s):

M. Narayana Swamy ◽

M. Hanumanthappa ◽

N.M. Jyothi

Keyword(s):

Supervised Learning ◽

Learning Algorithm ◽

Text Representation ◽

Indian Language ◽

Language Text

Download Full-text

A speech enabled Indian language text to Braille transliteration system

2009 International Conference on Information and Communication Technologies and Development (ICTD) ◽

10.1109/ictd.2009.5426698 ◽

2009 ◽

Cited By ~ 5

Author(s):

Tirthankar Dasgupta ◽

Anupam Basu

Keyword(s):

Indian Language ◽

Language Text

Download Full-text

A Rule-Based Concatenative Approach to Speech Synthesis in Indian Language Text-to-Speech Systems

Advances in Intelligent Systems and Computing - Intelligent Computing, Communication and Devices ◽

10.1007/978-81-322-2009-1_59 ◽

2014 ◽

pp. 523-531 ◽

Cited By ~ 1

Author(s):

Soumya Priyadarsini Panda ◽

Ajit Kumar Nayak

Keyword(s):

Speech Synthesis ◽

Text To Speech ◽

Indian Language ◽

Rule Based ◽

Language Text

Download Full-text

A Unified Parser for Developing Indian Language Text to Speech Synthesizers

Text, Speech, and Dialogue - Lecture Notes in Computer Science ◽

10.1007/978-3-319-45510-5_59 ◽

2016 ◽

pp. 514-521 ◽

Cited By ~ 3

Author(s):

Arun Baby ◽

Nishanthi N.L. ◽

Anju Leela Thomas ◽

Hema A. Murthy

Keyword(s):

Text To Speech ◽

Indian Language ◽

Language Text

Download Full-text

Criteria and Algorithm for the Russian Language Text Recognition Based on the Frequency Characteristics Set

2018 XIV International Scientific-Technical Conference on Actual Problems of Electronics Instrument Engineering (APEIE) ◽

10.1109/apeie.2018.8545877 ◽

2018 ◽

Author(s):

Yuri A. Kotov ◽

Olga V. Sanina

Keyword(s):

Frequency Characteristics ◽

Russian Language ◽

Text Recognition ◽

The Russian Language ◽

Language Text

Download Full-text

Indian Language Text Representation and Categorization Using Supervised Learning Algorithm

International Journal of Web Technology ◽

10.20894/ijwt.104.002.002.004 ◽

2013 ◽

Vol 002 (002) ◽

pp. 40-44

Author(s):

M Narayana Swamy ◽

◽

M. Hanumanthappa ◽

Keyword(s):

Supervised Learning ◽

Learning Algorithm ◽

Text Representation ◽

Indian Language ◽

Language Text

Download Full-text

Machine Learning Techniques for Sentiment Analysis of Indian Languages

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b1456.0982s1119 ◽

2019 ◽

Vol 8 (2S11) ◽

pp. 3630-3636

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Sentiment Analysis ◽

Language Processing ◽

Machine Learning Techniques ◽

Indian Languages ◽

Indian Language ◽

Learning Techniques ◽

Textual Data ◽

Language Text

Sentiment Analysis is the domain of automatically understanding the emotions, feelings, opinions in a textual data. It is a way of understating how a product, brand, service, idea or an event is viewed by common people, customers and stakeholders. Sentiment Analysis Systems are used by politicians, business leaders, developers and researchers to infer useful information as per their specific needs. It is used in business decision making process to value the views of the customers. Sentiment analysis has become a hot topic of scientific and market research in the field of natural Language Processing. India is a large populated country and the number of Internet users is also huge. Most people share their experience in English. However, during the last decade, due to the accessibility of Internet and evolution in language modelling people express their views in their own native Indian language. With the increase in Indian language text, researchers find it quite fascinating to infer valuable information from this unstructured text data. A number of machine learning techniques have been applied on this textual data set. Basic concepts of Sentiment analysis shall be discussed with focus on Indian language text in this paper. Due to on availability of rich lexicon resources for unsupervised learning techniques and better evaluation measures for the Supervised learning techniques, the later become the first choice for researchers in the field of Natural Language Processing. A comparative analysis shall be made for various supervised machine learning techniques in the context of Indian languages.

Download Full-text

Sindhi Handwritten Text Recognition Using SVM

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2021/201032021 ◽

2021 ◽

Vol 10 (3) ◽

pp. 1627-1631

Keyword(s):

Feature Extraction ◽

Complex Problem ◽

Training Data ◽

Text Recognition ◽

Support Vector ◽

Text Data ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Text Feature ◽

Language Text

In Sindhi Language, handwritten text feature extraction is such a challenging task for all scholars, because different people write in different styles or manners, to analyze each text is such a complex problem. Feature extraction of text segmentation, classifying each character and labelling for training data to recognize text for different handwritings and testing for analyzing features of providing handwritten text data .In this research, SVM (support vector machine) is used for analyzing and tokenizing each character or word of Sindhi Language text and transform into suitable information with efficiency & accuracy. The research is not only useful for improving the knowledge of Sindhi Handwritten Text Recognition but it can be beneficial for other recognition systems

Download Full-text