Intelligence Eye for Blinds and Visually Impaired by Using Region-Based Convolutional Neural Network (R-CNN)

Intelligence Eye is an Android based mobile application developed to help blind and visually impaired users to detect light and objects. Intelligence Eye used Region-based Convolutional Neural Networks (R-CNN) to recognize objects in the object recognition module and a vibration feedback is provided according to the light value in the light detection module. A voice guidance is provided in the application to guide the users and announce the result of the object recognition. TensorFlow Lite is used to train the neural network model for object recognition in conjunction with extensible markup language (XML) and Java in Android Studio for the programming language. For future works, improvements can be made to enhance the functionality of the Intelligence Eye application by increasing the object detection capacity in the object recognition module, add menu settings for vibration intensity in light detection module and support multiple languages for the voice guidance.

Download Full-text

Augmented Reality Maintenance Assistant Using YOLOv5

Applied Sciences ◽

10.3390/app11114758 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4758

Author(s):

Ana Malta ◽

Mateus Mendes ◽

Torres Farinha

Keyword(s):

Neural Network ◽

Deep Learning ◽

Object Recognition ◽

Augmented Reality ◽

Real Time ◽

Recognition System ◽

High Accuracy ◽

Video Streams ◽

The Neural Network ◽

Deep Learning Neural Network

Maintenance professionals and other technical staff regularly need to learn to identify new parts in car engines and other equipment. The present work proposes a model of a task assistant based on a deep learning neural network. A YOLOv5 network is used for recognizing some of the constituent parts of an automobile. A dataset of car engine images was created and eight car parts were marked in the images. Then, the neural network was trained to detect each part. The results show that YOLOv5s is able to successfully detect the parts in real time video streams, with high accuracy, thus being useful as an aid to train professionals learning to deal with new equipment using augmented reality. The architecture of an object recognition system using augmented reality glasses is also designed.

Download Full-text

APPLICATION OF CONVOLUTIONAL NEURAL NETWORK FOR DETECTION OF MELANOMA USING SKIN LESION IMAGE ON MOBILE DEVICE

Ukrainian Journal of Information Technology ◽

10.23939/ujit2021.03.008 ◽

2021 ◽

Vol 3 (1) ◽

pp. 8-14

Author(s):

D. V. Fedasyuk ◽

◽

T. V. Demianets ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Mobile Device ◽

Mobile Application ◽

High Accuracy ◽

File Size ◽

Melanoma Detection ◽

The Neural Network ◽

Classification Time

A melanoma is the deadliest skin cancer, so early diagnosis can provide a positive prognosis for treatment. Modern methods for early detecting melanoma on the image of the tumor are considered, and their advantages and disadvantages are analyzed. The article demonstrates a prototype of a mobile application for the detection of melanoma on the image of a mole based on a convolutional neural network, which is developed for the Android operating system. The mobile application contains melanoma detection functions, history of the previous examinations and a gallery with images of the previous examinations grouped by the location of the lesion. The HAM10000-based training dataset has been supplemented with the images of melanoma from the archive of The International Skin Imaging Collaboration to eliminate class imbalances and improve network accuracy. The search for existing neural networks that provide high accuracy was conducted, and VGG16, MobileNet, and NASNetMobile neural networks have been selected for research. Transfer learning and fine-tuning has been applied to the given neural networks to adapt the networks for the task of skin lesion classification. It is established that the use of these techniques allows to obtain high accuracy of the neural network for this task. The process of converting a convolutional neural network to an optimized Flatbuffer format using TensorFlow Lite for placement and use on a mobile device is described. The performance characteristics of the selected neural networks on the mobile device are evaluated according to the classification time on the CPU and GPU and the amount of memory occupied by the file of a single network is compared. The neural network file size was compared before and after conversion. It has been shown that the use of the TensorFlow Lite converter significantly reduces the file size of the neural network without affecting its accuracy by using an optimized format. The results of the study indicate a high speed of application and compactness of networks on the device, and the use of graphical acceleration can significantly decrease the image classification time of the tumor. According to the analyzed parameters, NASNetMobile was selected as the optimal neural network to be used in the mobile application of melanoma detection.

Download Full-text

An Accessible Keyboard for Android Devices as a Means for Promoting Braille Literacy

International Journal of Interactive Mobile Technologies (iJIM) ◽

10.3991/ijim.v10i2.5527 ◽

2016 ◽

Vol 10 (2) ◽

pp. 77

Author(s):

Paul D Hatzigiannakoglou ◽

Maria T Kampouraki

Keyword(s):

Young Adults ◽

Mobile Application ◽

Visually Impaired ◽

Daily Practice ◽

Evaluation Study ◽

Text Entry ◽

Text Input ◽

Blind And Visually Impaired ◽

Accessibility Barriers ◽

Braille Literacy

BrailleOne is a mobile application designed to provide blind and visually-impaired Braille readers with an accessible keyboard for manual text input on their Android smartphones or tablets. Its purpose is to make use of adolescents and young adults’ excitement for technology in order to facilitate Braille learning and daily practice. The results of a pilot evaluation study suggest that BrailleOne could contribute to the overcoming of accessibility barriers in the field of text entry.

Download Full-text

A Low-cost Artificial Neural Network Model for Raspberry Pi

Engineering, Technology & Applied Science Research ◽

10.48084/etasr.3357 ◽

2020 ◽

Vol 10 (2) ◽

pp. 5466-5469 ◽

Cited By ~ 1

Author(s):

S. N. Truong

Keyword(s):

Neural Network ◽

Object Recognition ◽

Embedded System ◽

Real Time ◽

Image Recognition ◽

Low Cost ◽

Recognition Rate ◽

Raspberry Pi ◽

The Neural Network ◽

Binary Array

In this paper, a ternary neural network with complementary binary arrays is proposed for representing the signed synaptic weights. The proposed ternary neural network is deployed on a low-cost Raspberry Pi board embedded system for the application of speech and image recognition. In conventional neural networks, the signed synaptic weights of –1, 0, and 1 are represented by 8-bit integers. To reduce the amount of required memory for signed synaptic weights, the signed values were represented by a complementary binary array. For the binary inputs, the multiplication of two binary numbers is replaced by the bit-wise AND operation to speed up the performance of the neural network. Regarding image recognition, the MINST dataset was used for training and testing of the proposed neural network. The recognition rate was as high as 94%. The proposed ternary neural network was applied to real-time object recognition. The recognition rate for recognizing 10 simple objects captured from the camera was 89%. The proposed ternary neural network with the complementary binary array for representing the signed synaptic weights can reduce the required memory for storing the model’s parameters and internal parameters by 75%. The proposed ternary neural network is 4.2, 2.7, and 2.4 times faster than the conventional ternary neural network for MNIST image recognition, speech commands recognition, and real-time object recognition respectively.

Download Full-text

Identification of Baikal phytoplankton inferred from computer vision methods and machine learning

Limnology and Freshwater Biology ◽

10.31951/2658-3518-2021-a-3-1143 ◽

2021 ◽

pp. 1143-1146

Author(s):

A.V. Lysenko ◽

◽

M.S. Oznobikhin ◽

E.A. Kireev ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Neural Networks ◽

Computer Vision ◽

Object Recognition ◽

Light Microscope ◽

Optimal Size ◽

Optimal Parameters ◽

The Neural Network

Abstract. This study discusses the problem of phytoplankton classification using computer vision methods and convolutional neural networks. We created a system for automatic object recognition consisting of two parts: analysis and primary processing of phytoplankton images and development of the neural network based on the obtained information about the images. We developed software that can detect particular objects in images from a light microscope. We trained a convolutional neural network in transfer learning and determined optimal parameters of this neural network and the optimal size of using dataset. To increase accuracy for these groups of classes, we created three neural networks with the same structure. The obtained accuracy in the classification of Baikal phytoplankton by these neural networks was up to 80%.

Download Full-text

A Systematic Literature Review of the Mobile Application for Object Recognition for Visually Impaired People

2020 8th International Conference on Information Technology and Multimedia (ICIMU) ◽

10.1109/icimu49871.2020.9243523 ◽

2020 ◽

Author(s):

Zulfadhlina Amira Nor Hisham ◽

Masyura Ahmad Faudzi ◽

Azimah Abdul Ghapar ◽

Fiza Abdul Rahim

Keyword(s):

Object Recognition ◽

Literature Review ◽

Systematic Literature Review ◽

Mobile Application ◽

Visually Impaired ◽

Visually Impaired People ◽

Impaired People

Download Full-text

Object Recognition and Hearing Assistive Technology Mobile Application using Convolutional Neural Network

Proceedings of the International Conference on Wireless Communication and Sensor Networks ◽

10.1145/3411201.3411208 ◽

2020 ◽

Author(s):

Arlene R. Caballero ◽

Kristopher Elijah I. Catli ◽

Aldwin Garry F. Babierra

Keyword(s):

Neural Network ◽

Object Recognition ◽

Assistive Technology ◽

Convolutional Neural Network ◽

Mobile Application

Download Full-text

Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet

PeerJ Computer Science ◽

10.7717/peerj-cs.197 ◽

2019 ◽

Vol 5 ◽

pp. e197 ◽

Cited By ~ 6

Author(s):

Andrey V. Savchenko

Keyword(s):

Neural Network ◽

Mobile Application ◽

Automatic Extraction ◽

Agglomerative Clustering ◽

Identity Recognition ◽

Second Stage ◽

The Neural Network ◽

Hierarchical Agglomerative Clustering ◽

And Gender

This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. A two-stage approach is proposed in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts facial representations suitable for face identification. Here the MobileNet is modified and is preliminarily trained to perform face recognition in order to additionally recognize age and gender. The age is estimated as the expected value of top predictions in the neural network. In the second stage of the proposed approach, extracted faces are grouped using hierarchical agglomerative clustering techniques. The birth year and gender of a person in each cluster are estimated using aggregation of predictions for individual photos. The proposed approach is implemented in an Android mobile application. It is experimentally demonstrated that the quality of facial clustering for the developed network is competitive with the state-of-the-art results achieved by deep neural networks, though implementation of the proposed approach is much computationally cheaper. Moreover, this approach is characterized by more accurate age/gender recognition when compared to the publicly available models.

Download Full-text