scholarly journals Mobile app with reading speech-translated OCR images for visually impaired people

Author(s):  
Francisco Vázquez-Guzmán ◽  
Liliana Elena Olguín-Gil ◽  
Eduardo Vázquez-Zayas ◽  
Brawhim Jesseth Nicanor-Pimentel

This research allows to have an overview of the different technologies that can be used to benefit people with visual disabilities. In the association "Sentir con los ojos del corazón" located in Tehuacán, Puebla, México, people with visual disabilities are served who do not have the technological tools available to understand their environment, such as restaurant menus, signs on doors, reading a book and any setting that contains a text, making life difficult in a world where most texts are oriented towards visual people. There are few applications for people with visual disabilities that allow them to improve their lives in the different areas in which they operate. Therefore, it is proposed to design a mobile application that interacts with a virtual assistant to translate the images into text to speech through optical character recognition (OCR), allowing them to function in different educational, work, social environments, among others. This project allows the Inclusion of people with visual disabilities to improve the quality of life using applications for mobile devices and to be self-sufficient in their daily life, later managing to translate in different languages, with different intensities and tone of voice, using different platforms.

2021 ◽  
Author(s):  
Jay Bagrecha ◽  
Tanay Shah ◽  
Karan Shah ◽  
Tanvi Gandhi ◽  
Sushila Palwe

In India, almost 18 million visually impaired people have difficulties in managing their day-to-day activities. Hence, there is a need to develop an application that can assist them every time and give vocal instructions in both English and Hindi. In this paper, we introduced a robust lightweight Android application that facilitates visually impaired individuals by providing a variety of essential features such as object and distance detection, Indian currency note detection, and optical character recognition that can enhance their quality of life. This application aims to have a user-friendly GUI well suited to the needs of the blind user and modules like Object Recognition with Image Captioning so that the visually challenged user can gain a better understanding of their surroundings.


1979 ◽  
Vol 73 (10) ◽  
pp. 389-399
Author(s):  
Gregory L. Goodrich ◽  
Richard R. Bennett ◽  
William R. De L'aune ◽  
Harvey Lauer ◽  
Leonard Mowinski

This study was designed to assess the Kurzweil Reading Machine's ability to read three different type styles produced by five different means. The results indicate that the Kurzweil Reading Machines tested have different error rates depending upon the means of producing the copy and upon the type style used; there was a significant interaction between copy method and type style. The interaction indicates that some type styles are better read when the copy is made by one means rather than another. Error rates varied between less than one percent and more than twenty percent. In general, the user will find that high quality printed materials will be read with a relatively high level of accuracy, but as the quality of the material decreases, the number of errors made by the machine also increases. As this error rate increases, the user will find it increasingly difficult to understand the spoken output.


Author(s):  
Ton Tsang ◽  
Cheung Yip Kan

Along by a continuous improvement to composite electronic devices, a safety to technicians takes additionally become the matter to good concern, as a result to technicians' lives is in jeopardy while their work through shutting down circuit breakers, even that even once the breaker takes been switched off, someone will inadvertently flip to while a technician remains working. That should be a system to guarantee safety that technicians. Also, individuals do not love switching all the time toward turn on / off appliances like fans/lighting/air conditioners. It ends in wasted energy thanks to unnecessarily placing the instrument. To address these issues, we tend to come up through the system through mobile app-controlled circuit breakers that degrade wireless management to home appliances to hunt down a golem app. That replaces a traditional breaker through the mobile app-controlled system in the on / off system, where no one will activate the breaker, while not the word. The remote of home appliances helps a user to save electricity. That enhances a quality of life and luxury. Additionally, a system includes the home security mechanism against drone intrusion using the mobile app-controlled door lock system besides the mechanism that sleuthing dangerous gas leaks. A formation of the system subtracts the degree of victim associate ESP 32 microcontroller, the Bluetooth module, matrix 4x4 keyboards, and the paraffin gas detector associate with a golem mobile application. The entire system is usually compact systems.


2015 ◽  
Vol 74 (6) ◽  
Author(s):  
Teng Ren Sin ◽  
Eileen Su Lee Ming ◽  
Yeong Che Fai ◽  
Ong Jian Fu ◽  
Sim Yang Shane

People with low vision have visual acuity less than 6/18 and at least 3/60 in the better eye, with correction. The limited vision requires them to enhance their reading ability using magnifying glass or electronic screen magnifier. However, people with severe low vision have difficulty and suffer fatigue from using such assistive tool. This paper presents the development of a mobile text reader dedicated for people with low vision. The mobile text reader is developed as a mobile application that allows user to capture an image of texts and then translate the texts into audio format. One main contribution of this work compared to typical optical character recognition (OCR) engines or text-to-speech engines is the addition of image stitching feature. The image stitching feature can produce one single image from multiple poorly aligned images, and is integrated into the process of image acquisition. Either single or composite image is subsequently uploaded to a cloud-based OCR engine for robust character recognition. Eventually, a text-to-speech (TTS) synthesizer reproduces the word recognized in a natural-sounding speech. The whole series of computation is implemented as a mobile application to be run from a smartphone, allowing the visual impaired to access text information independently. 


Visual impairment persons are not able to do all works as normal persons especially during purchasing products in supermarket. To help the blind peoples recognise the objects a text reading method is proposed along with the help of camera. A motion detection method is used to detect the presence of the object. The audio instructions about all the objects and their location in supermarket are notified to the blind user that helps them to move freely inside the supermarket. The proposed system aims to make more convenient for the blind persons to purchase in a sophisticated environment. This system also provides easy shopping, consumers time is saved, etc. The implementation of proposed system is done using artificial intelligence and OCR technology. General Terms Visually impaired people, smart shopping, OCR.


2021 ◽  
Author(s):  
S. Anbarasi ◽  
S. Krishnaveni ◽  
R. Aruna ◽  
K. Karpagasaravanakumar

Visually impaired people fail to read the text with existing technology. The proposed project targeted to design a spectacle with a camera by which the blind visually impaired people can read whatever they want to read based on contemporary OCR (optical character recognition) technique and text-to-speech (TTS) engines. This proposed smart reader will read any kind of documents like books, magazines and mobiles. People can access this novel technology with blindness and limited vision. The earlier version of the proposed project was developed successfully with mobile reader which had certain drawbacks such as high cost due to the need of android mobile, not user friendly and improper focusing. To overcome these disadvantages, a spectacle type reader with camera is proposed in this project, which will be cost effective and more efficient.


Author(s):  
Kirad Varad Vinay ◽  
Indla Omkar Balaobaiah ◽  
Mujawar Sohail Mahiboob ◽  
Shinde Dinesh Nagnath ◽  
Prof. Darshana Patil

According to survey taken the total number of vehicles in [1] India were 260 million. Therefore, there is a need to develop Automatic Number Plate Recognition (ANPR) systems [1] in India because of the large number of vehicles travelling on the roads. [1] It would also help in proper tracking of the vehicles, traffic examining, finding stolen vehicles, supervising parking toll and imposing strict actions against red light breaching. Automatic number plate recognition is image processing technique for finding number plate from image and extracting characters from detected number plate. ANPR in India has always been challenging due to different lighting conditions, changes in fonts, shapes, angles, letters size, number of lines and padding between lines, different languages used. In our project we proposed a model that can detects number plate with considering all irregularities. this system uses Computer vision and machine learning technology in order to detect number plate from image. In our proposed system number plate can be of different fonts and non-roman script. For identification of characters from number plate we use OCR (Optical character recognition) technique. OCR involves two parts: Character segmentation and Character Recognition. This OCR system can be used to extract characters of different fonts and non-roman script. The Quality of OCR depends on the quality of image, image contrast, text font style and size. To improve quality of OCR we can use image processing technique to enhance quality of image.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Acrapol Nimmolrat ◽  
Pattaraporn Khuwuthyakorn ◽  
Purida Wientong ◽  
Orawit Thinnukool

Abstract Background Most mobile pharmaceutical applications produced for people with visual disabilities in Thailand fail to meet the required standard due to poor-quality regulations, defective design, lack of user support and impracticality; as a result, visually-impaired people are unable to use them. This research is motivated by the limited use of this technology in primary medical services and its aim is to enable people with disabilities to access effective digital health information. The research objective is to analyse, design and develop a mobile pharmaceutical application with functions that are appropriate for visually-impaired users, and test its usability. Results Based on the design and development of the application, it contained five necessary functions. When testing the usability and users’ satisfaction, it was found that the input or fill of information in the application was of low usability. According to the test results, the medicinal database function was missing 71 times and the voice command function was missing 34 times. Based on users’ satisfaction results, users who had the highest level of usage gave higher average scores to users’ attitude, users’ confidence, user interface and system performance than those with lower levels of usage. The scores of both groups were found to be the same when discussing the implementation of the development. Conclusions This mobile application, which was developed based on the use of smart technology, will play an important role in supporting visually-impaired people in Thailand by enhancing the efficacy of self-care. The design and development of the application will ensure the suitability of many functions for visually-impaired users. However, despite the high functional capacity of the application, the gap in healthcare services between the general public and disabled groups will still exist if users have inadequate IT skills.


2021 ◽  
Vol 4 (1) ◽  
pp. 57-70
Author(s):  
Marina V. Polyakova ◽  
Alexandr G. Nesteryuk

Optical character recognition systems for the images are used to convert books and documents into electronic form, to automate accounting systems in business, when recognizing markers using augmented reality technologies and etс. The quality of optical character recognition, provided that binarization is applied, is largely determined by the quality of separation of the foreground pixels from the background. Methods of text image binarization are analyzed and insufficient quality of binarization is noted. As a way of research the minimum-distance classifier for the improvement of the existing method of binarization of color text images is used. To improve the quality of the binarization of color text images, it is advisable to divide image pixels into two classes, “Foreground” and “Background”, to use classification methods instead of heuristic threshold selection, namely, a minimum-distance classifier. To reduce the amount of processed information before applying the classifier, it is advisable to select blocks of pixels for subsequent processing. This was done by analyzing the connected components on the original image. An improved method of the color text image binarization with the use of analysis of connected components and minimum-distance classifier has been elaborated. The research of the elaborated method showed that it is better than existing binarization methods in terms of robustness of binarization, but worse in terms of the error of the determining the boundaries of objects. Among the recognition errors, the pixels of images from the class labeled “Foreground” were more often mistaken for the class labeled “Background”. The proposed method of binarization with the uniqueness of class prototypes is recommended to be used in problems of the processing of color images of the printed text, for which the error in determining the boundaries of characters as a result of binarization is compensated by the thickness of the letters. With a multiplicity of class prototypes, the proposed binarization method is recommended to be used in problems of processing color images of handwritten text, if high performance is not required. The improved binarization method has shown its efficiency in cases of slow changes in the color and illumination of the text and background, however, abrupt changes in color and illumination, as well as a textured background, do not allowing the binarization quality required for practical problems.


Sign in / Sign up

Export Citation Format

Share Document