scholarly journals Braille Recognition for Reducing Asymmetric Communication between the Blind and Non-Blind

Symmetry ◽  
2020 ◽  
Vol 12 (7) ◽  
pp. 1069
Author(s):  
Bi-Min Hsu

Assistive braille technology has existed for many years with the purpose of aiding the blind in performing common tasks such as reading, writing, and communicating with others. Such technologies are aimed towards helping those who are visually impaired to better adapt to the visual world. However, an obvious gap exists in current technology when it comes to symmetric two-way communication between the blind and non-blind, as little technology allows non-blind individuals to understand the braille system. This research presents a novel approach to convert images of braille into English text by employing a convolutional neural network (CNN) model and a ratio character segmentation algorithm (RCSA). Further, a new dataset was constructed, containing a total of 26,724 labeled braille images, which consists of 37 braille symbols that correspond to 71 different English characters, including the alphabet, punctuation, and numbers. The performance of the CNN model yielded a prediction accuracy of 98.73% on the test set. The functionality performance of this artificial intelligence (AI) based recognition system could be tested through accessible user interfaces in the future.

Author(s):  
Oyeniran Oluwashina Akinloye ◽  
Oyebode Ebenezer Olukunle

Numerous works have been proposed and implemented in computerization of various human languages, nevertheless, miniscule effort have also been made so as to put Yorùbá Handwritten Character on the map of Optical Character Recognition. This study presents a novel technique in the development of Yorùbá alphabets recognition system through the use of deep learning. The developed model was implemented on Matlab R2018a environment using the developed framework where 10,500 samples of dataset were for training and 2100 samples were used for testing. The training of the developed model was conducted using 30 Epoch, at 164 iteration per epoch while the total iteration is 4920 iterations. Also, the training period was estimated to 11296 minutes 41 seconds. The model yielded the network accuracy of 100% while the accuracy of the test set is 97.97%, with F1 score of 0.9800, Precision of 0.9803 and Recall value of 0.9797.


2021 ◽  
Vol 13 (1) ◽  
pp. 30-38
Author(s):  
Nabila Husna Shabrina ◽  
Julando Omar ◽  
Akmal Nusa Bhakti ◽  
Axel Patria

This study is done in order to propose an Emotion Recognition System that uses Convolutional Neural Network in a Virtual Meeting Environment to detect non-verbal feedback that emerge when communicating. This study starts with the training process of the CNN model with version 2.3.0 of tensorflow-gpu library, along with FER-2013 dataset, where only 80% of the data is used as the training set, and the other 20% is used as the test set. The model is trained for 430 epochs that results in 73.86% rate of accuracy with a loss of 1.42. In the classification process, a Haar-Cascade Classifier algorithm is used to detect faces within an image that has been inputted using OpenCV. Next the already developed model is used to predict the image that has been pre-processed. Based on the results shown, it can be concluded that the study has provided satisfactory results and is expected to help in understanding non-verbal input given when communicating and among other various things.


2020 ◽  
Author(s):  
Karthika Kuppusamy ◽  
Chandra Eswaran

Abstract With the advent of conversational voice recognition systems growing such as Alexa, SIRI, OK Google, etc., natural language conversational systems including Chatbot and voice recognition systems are in new high and determining the age of a speaker is critical for setting the pertinent context. Age can be inferred from the speech signal by inferring various factors such as physical attributes of voice, linguistic attributes, frequency, speech rate,etc., The proposed research article discusses about extracting the spectral features of speech such as Cepstral Coefficients, Spectral Decrease, Centroid, Flatness, Spectral Entropy, F0DIFF, Jitter and Shimmer as inputs. This would help in classifying speaker age through deep learning techniques. A novel approach is addressed along with the model for implementation using Deep Neural Network and Convolutional Neural Network for classifying the features using three different classifiers which are Gaussian Mixture Model (GMM), Support Vector Machine (SVM) and GMM-SVM. The results obtained from the proposed system would outline the performance in speaker age recognition.


Pedestrians in the vehicle way are in peril of being hit, along these lines making extreme damage walkers and vehicle inhabitants. Hence, constant person on foot identification was done through a set of recorded videos and the system detects the persons/pedestrians in the given input videos. In this survey, a continuous plan was proposed dependent on Aggregated Channel Features (ACF) and CPU. The proposed technique doesn't have to resize the information picture neither the video quality. We also use SVM with HOG and SVM with HAAR to detect the pedestrians. In addition, the Convolutional Neural Networks (CNN) were trained with a set of pedestrian images datasets and later tested on some test-set of pedestrian images. The analyses demonstrated that the proposed technique could be utilized to distinguish people on foot in the video with satisfactory mistake rates and high prediction accuracy. In this manner, it tends to be applied progressively for any real-time streaming of videos and also for prediction of pedestrians in prerecorded videos.


2016 ◽  
Vol 136 (10) ◽  
pp. 719-726
Author(s):  
Junya Arakaki ◽  
Hitoshi Ishikawa ◽  
Itaru Nagayama

2020 ◽  
Author(s):  
Ganesh Awasthi ◽  
Dr. Hanumant Fadewar ◽  
Almas Siddiqui ◽  
Bharatratna P. Gaikwad

2019 ◽  
Author(s):  
CHIEN WEI ◽  
Chi Chow Julie ◽  
Chou Willy

UNSTRUCTURED Backgrounds: Dengue fever (DF) is an important public health issue in Asia. However, the disease is extremely hard to detect using traditional dichotomous (i.e., absent vs. present) evaluations of symptoms. Convolution neural network (CNN), a well-established deep learning method, can improve prediction accuracy on account of its usage of a large number of parameters for modeling. Whether the HT person fit statistic can be combined with CNN to increase the prediction accuracy of the model and develop an application (APP) to detect DF in children remains unknown. Objectives: The aim of this study is to build a model for the automatic detection and classification of DF with symptoms to help patients, family members, and clinicians identify the disease at an early stage. Methods: We extracted 19 feature variables of DF-related symptoms from 177 pediatric patients (69 diagnosed with DF) using CNN to predict DF risk. The accuracy of two sets of characteristics (19 symptoms and four other variables, including person mean, standard deviation, and two HT-related statistics matched to DF+ and DF−) for predicting DF, were then compared. Data were separated into training and testing sets, and the former was used to predict the latter. We calculated the sensitivity (Sens), specificity (Spec), and area under the receiver operating characteristic curve (AUC) across studies for comparison. Results: We observed that (1) the 23-item model yields a higher accuracy rate (0.95) and AUC (0.94) than the 19-item model (accuracy = 0.92, AUC = 0.90) based on the 177-case training set; (2) the Sens values are almost higher than the corresponding Spec values (90% in 10 scenarios) for predicting DF; (3) the Sens and Spec values of the 23-item model are consistently higher than those of the 19-item model. An APP was subsequently designed to detect DF in children. Conclusion: The 23-item model yielded higher accuracy rates (0.95) and AUC (0.94) than the 19-item model (accuracy = 0.92, AUC = 0.90). An APP could be developed to help patients, family members, and clinicians discriminate DF from other febrile illnesses at an early stage.


2017 ◽  
Vol MCSP2017 (01) ◽  
pp. 30-34
Author(s):  
Somalin Sandha ◽  
Debaraj Rana

In present day scenario the security and authentication is very much needed to make a safety world. Beside all security one vital issue is recognition of number plate from the car for Authorization. In the busy world everything cannot be monitor by a human, so automatic license plate recognition is one of the best application for authorization without involvement of human power. In the proposed method we have make the problem into three fold, firstly extraction of number plate region, secondly segmentation of character and finally Authorization through recognition and classification. For number plate extraction and segmentation we have used morphological based approaches where as for classification we have used Neural Network as classifier. The proposed method is working well in varieties of scenario and the performance level is quiet good.


Sign in / Sign up

Export Citation Format

Share Document