Smart Reader for Visually Challenged Using Optical Character Recognition and Text-To-Speech

Assistive technology uses assistive, adaptive and rehabilitative devices for people with disabilities. It’s assessed there are about 36 million people with visual impairment in the world and a further 216 million who lead life with moderate to severe visual impairments. Leveraging technology has helped the visually challenged in carrying out tasks on par with the people blessed with vision particularly in the activities of reading and writing. In the proposed work, an image scanning device attached to a microcontroller is designed. This device is designed in the form of hand gloves for ease of usage. The glove with the camera at the fingertip, when rolled over lines of text, scans the information and converts it into digital text with Optical Character Recognition (OCR). The converted digital text is finally read aloud using Text-to-speech synthesis. The results obtained were accurate and met the standards of operability.

Download Full-text

Implementation of Optical Character Recognition Using Raspberry Pi for Visually Challenged Person

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i3.34.18718 ◽

2018 ◽

Vol 7 (3.34) ◽

pp. 65 ◽

Cited By ~ 2

Author(s):

S Thiyagarajan ◽

Dr G.Saravana Kumar ◽

E Praveen Kumar ◽

G Sakana

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Speech Synthesis ◽

Native Speaker ◽

Image Data ◽

Raspberry Pi ◽

Spoken Discourse ◽

Optical Character ◽

Digital Format ◽

Visually Challenged

Blind people are unable to perform visual tasks. The majority of published printed works does not include Braille or audio versions, and digital versions are still a minority. In this project, the technology of optical character recognition (OCR) enables the recognition of texts from image data. The system is constituted by the raspberry pi, HD camera and Bluetooth headset. This technology has been widely used in scanned or photographed documents, converting them into electronic copies. The technology of speech synthesis (TTS) enables a text in digital format to be synthesized into human voice and played through an audio system. The objective of the TTS is the automatic conversion of sentences, without restrictions, into spoken discourse in a natural language, resembling the spoken form of the same text, by a native speaker of the language.

Download Full-text

Optical Character Reader & Text To Speech Conversion using Correlations & Speech Synthesis

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.j7619.0891020 ◽

2020 ◽

Vol 9 (10) ◽

pp. 478-483

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Speech Synthesis ◽

Figurative Language ◽

Text To Speech ◽

Rate System ◽

Written Text ◽

Optical Character ◽

Effective Manner ◽

High Level

In the modern era of image processing, recognizing content or information from an image is process of electronic conversion into machine encoded text. Advanced systems that are capable of producing high accuracy for multi-font recognition are now becoming commonplace, and with the support of digital consent formatting. Some programs are able to retrieve formats that are very close to the original page including images, columns, and other non-text items. Proposed system is able to recognize text from an image and convert it into editable text along with speech conversion. System uses Correlation model for OCR (Optical Character Recognition) and Speech Synthesis for TTS (Text To Speech) conversion. Correlation is a measurement of the similarities between two similar objects such as the predefined alphabets and recognizing a combination of those alphabets from an image. Speech synthesis is an artificial expression of human speech. The computer program that has been used this feature is called a speech computer as well as speech synthesizer that can be implemented on the basis of software or hardware primitives. The text-to-speech system (TTS) converts a standard language text into a speech; some programs provide figurative language presentations such as typed text in speech. System is capable enough to acquire high level of accuracy with less false recognition. It is required to built an effective text scanner that can recognize text from an image with less error rate. System has been implemented in MATLAB and various pre-processing filters have been applied for better enhancement and extraction. Hand written text can also be recognized with an effective manner.

Download Full-text

Smart Glass for Visually Challenged Peoples to Read the Books using Raspberry Pi

International Journal of Advanced Research in Science, Communication and Technology ◽

10.48175/ijarsct-1842 ◽

2021 ◽

pp. 258-262

Author(s):

Anitha D B ◽

Jyothi T M ◽

Pooja R ◽

Sahana N

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Computer Software ◽

Raspberry Pi ◽

Text To Speech ◽

Smart Glasses ◽

Optical Character ◽

Text Reading ◽

Visually Challenged ◽

Audio Output

The objective of this paper is to presents new design on assistive smart glasses for visually impaired. The objective is to assist in multiple daily tasks using the advantage of wearable design format. The proposed method is a camera based assistive text reading to help to blind in person in reading the text present on the text labels, printed notes and products in their own respective languages. It combines the concept of Optical Character Recognition (OCR), text to Speech Synthesizer (TTS) and translator in Raspberry pi. Optical character recognition (OCR) is the identification of printed characters using photoelectric devices and computer software. It converts images of typed, handwritten or printed text into machine encoded text from scanned document or from subtitle text superimposed on an image. Text-to-Speech conversion is a method that scans and reads any language letters and numbers that are in the image using OCR technique and then translates it into any desired language and at last it gives audio output of the translated text. The audio output is heard through the raspberry pi's audio jack using speakers or earphones.

Download Full-text

Model for Converting PDF to Audio Format (Listen Your Book)

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.36522 ◽

2021 ◽

Vol 9 (VII) ◽

pp. 3203-3206

Author(s):

Shailendra Singh

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Efficient Technique ◽

Text To Speech ◽

Multiple Methods ◽

Optical Character ◽

Visually Impaired People ◽

The People ◽

Textual Content ◽

Text Images

The present paper has introduced an innovative and efficient technique that enables user to hear the contents of text images instead of reading through them. In the current world, there is a great increase in the utilization of digital technology and multiple methods are available for the people to capture images. such images may contain important textual content that the user may need to edit or store digitally. It merges the concept of Optical Character Recognition (OCR) and Text to Speech Synthesizer (TTS). This can be done using Optical Character Recognition with the use of Tesseract OCR Engine. OCR is a branch of AI that is used in applications to recognize text from scanned documents or images. The analyzed text can also be converted to audio format to help visually impaired people hear the content that they wish to know. Text-to-Speech conversion is a method that scans and reads alphabets and numbers that are in the image using OCR technique and convert it into voices. The aim is to study and compare the multiple methods used for STT conversions and to figure out the most efficient technique that can be adapted for the conversion processes. As a result, based on review study it is found that HMM is a statistical model which is most suitable for TTS conversions.

Download Full-text

Development of a Text-to-Speech Scanner for Visually Impaired People

Advances in Medical Technologies and Clinical Practice - Design and Development of Affordable Healthcare Technologies ◽

10.4018/978-1-5225-4969-7.ch010 ◽

2018 ◽

pp. 218-238

Author(s):

Minerva Sarma ◽

Anuskha Kumar ◽

Aditi Joshi ◽

Suraj Kumar Nayak ◽

Biswajeet Champaty

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Speech Synthesis ◽

Low Cost ◽

Raspberry Pi ◽

Text To Speech ◽

Visually Impaired People ◽

Text To Speech Synthesis ◽

Cost Efficient ◽

Blind Persons

In this chapter, a low-cost, efficient, and real-time wearable text-to-speech scanner has been proposed that can enable blind persons to hear the contents of a text material. The device captures the images of the text and converts them to speech. The hardware of the device has been realized using Raspberry Pi 3, Pi camera, and an earphone. Optical character recognition (OCR) and text-to-speech synthesis (TTS) have been implemented using Raspberry Pi 3 to accomplish the working of the device. OCR technology converted the captured text images to editable text, whereas the TTS technology scanned the alphanumeric characters in the processed image and converted them to speech. The proposed technology imitates the ability of the human sensory organs and the nervous system, where the camera mimics human eye and the image processing in Raspberry Pi 3 substitutes the human brain. This proposed device can also help people suffering from diseases like dyslexia and nyctalopia, and inability to see in dim light or at night.

Download Full-text

Smart Approach to Optical Character Recognition and Ubiquitous Speech Synthesis Using Real-Time Deep Learning Algorithms

Algorithms for Intelligent Systems - Applications of Artificial Intelligence in Engineering ◽

10.1007/978-981-33-4604-8_8 ◽

2021 ◽

pp. 107-118

Author(s):

Bhargav Goradiya ◽

Yagnik Mehta ◽

Nisarg Patel ◽

Neel Macwan ◽

Vatsal Shah

Keyword(s):

Deep Learning ◽

Real Time ◽

Character Recognition ◽

Optical Character Recognition ◽

Speech Synthesis ◽

Learning Algorithms ◽

Optical Character

Download Full-text

Text to Speech Conversion using Optical character Recognition for Visually Impaired Persons

International Journal of Computer Trends and Technology ◽

10.14445/22312803/ijctt-v29p118 ◽

2015 ◽

Vol 29 (2) ◽

pp. 97-102

Author(s):

Prince saini ◽

◽

Rajesh Mehra

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Visually Impaired ◽

Text To Speech ◽

Optical Character ◽

Visually Impaired Persons

Download Full-text

Expiry date recognition using deep neural networks

International Joural of User-System Interaction ◽

10.37789/ijusi.2020.13.1.1 ◽

2020 ◽

Vol 13 (1) ◽

pp. 1-17

Author(s):

Traian Rebedea ◽

Vlad Florea

Keyword(s):

Neural Networks ◽

Character Recognition ◽

Optical Character Recognition ◽

Speech Synthesis ◽

Efficient Solution ◽

Deep Neural Networks ◽

Accuracy Improvement ◽

Expiry Date ◽

Food Items ◽

Optical Character

This paper proposes a deep learning solution for optical character recognition, specifically tuned to detect expiration dates that are printed on the packaging of food items. This method can be used to reduce food waste, having a significant impact on the design of smart refrigerators and can prove especially useful for persons with vision difficulties, by combining it with a speech synthesis engine. The main problem in designing an efficient solution for expiry date recognition is the lack of a large enough dataset to train deep neural networks. To tackle this issue, we propose to use an additional dataset composed of synthetically generated images. Both the synthetic and real image datasets are detailed in the paper and we show that the proposed method offers a 9.4% accuracy improvement over using real images alone.

Download Full-text

Bangla Optical Character Recognition and Text-to-Speech Conversion using Raspberry Pi

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2020.0110636 ◽

2020 ◽

Vol 11 (6) ◽

Author(s):

Aditya Rajbongshi ◽

Md. Ibadul ◽

Al Amin ◽

Md. Mahbubur ◽

Anup Majumder ◽

...

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Raspberry Pi ◽

Text To Speech ◽

Optical Character

Download Full-text

Development of Text to Speech Conversion System for Low Vision and Blind People

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.d2110.049620 ◽

2020 ◽

Vol 9 (4) ◽

pp. 2154-2158

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Low Vision ◽

World Health ◽

Thought Process ◽

Speech Output ◽

Optical Character ◽

Content Recognition ◽

Visually Challenged ◽

Health Organization

Around the world 285 million individuals are found to be visually challenged out of 7.4 billion populations found in a survey made by World Health Organization. These people face many problems but the major problem is reading. It is observed that they cannot read the text which is not written in braille. In the thought process of supporting them, here is a framework proposed for the visually challenged people which can perform content recognition and produce voice yield. This can assist the visually challenged people with reading any printed content and convey in speech output. A camera is utilized to capture the content from the printed content and the captured picture experiences progression of picture pre-preprocessing steps to get the content of the picture and expels the background. Characters are identified utilizing Tesseract-Optical Character recognition (OCR). The identified script is then changed into voice, utilizing open source speech synthesizer (TTS). Finally, the speech output is heard by the earphones.

Download Full-text