recognition accuracy
Recently Published Documents





Sangamesh Hosgurmath ◽  
Viswanatha Vanjre Mallappa ◽  
Nagaraj B. Patil ◽  
Vishwanath Petli

Face recognition is one of the important biometric authentication research areas for security purposes in many fields such as pattern recognition and image processing. However, the human face recognitions have the major problem in machine learning and deep learning techniques, since input images vary with poses of people, different lighting conditions, various expressions, ages as well as illumination conditions and it makes the face recognition process poor in accuracy. In the present research, the resolution of the image patches is reduced by the max pooling layer in convolutional neural network (CNN) and also used to make the model robust than other traditional feature extraction technique called local multiple pattern (LMP). The extracted features are fed into the linear collaborative discriminant regression classification (LCDRC) for final face recognition. Due to optimization using CNN in LCDRC, the distance ratio between the classes has maximized and the distance of the features inside the class reduces. The results stated that the CNN-LCDRC achieved 93.10% and 87.60% of mean recognition accuracy, where traditional LCDRC achieved 83.35% and 77.70% of mean recognition accuracy on ORL and YALE databases respectively for the training number 8 (i.e. 80% of training and 20% of testing data).

2022 ◽  
Vol 12 (5) ◽  
pp. 879-887
Jiantao Zhang ◽  
Xiaobo Zhang ◽  
Dong Qu ◽  
Yan Xue ◽  
Xinling Bi ◽  

Basal cell carcinomas and Bowen’s disease (squamous cell carcinoma in situ) are the most common cutaneous tumors. The early diagnoses of these diseases are very important due to their better prognosis. But it is a heavy workload for the pathologists to recognize a large number of pathological images and diagnose these diseases. So, there is an urgent need to develop an automatic method for detecting and classifying the skin cancers. This paper presents a recognition system of dermatopathology images based on the deep convolutional neural networks (CNN). The dermatopathology images are collected from the hospital. The deep learning model is trained using different image datasets. It can be found that the recognition accuracy of the system can be improved by using data augmentation even if the number of the clinical samples are not increased. But the recognition accuracy of the system is the highest when the number of the original histological image is increased. The experimental results that the system can correctly recognize 88.5% of patients with basal cell carcinoma and 86.5% of patients with Bowen’s disease.

2022 ◽  
Vol 18 (1) ◽  
pp. 1-31
Guohao Lan ◽  
Zida Liu ◽  
Yunfan Zhang ◽  
Tim Scargill ◽  
Jovan Stojkovic ◽  

Mobile Augmented Reality (AR), which overlays digital content on the real-world scenes surrounding a user, is bringing immersive interactive experiences where the real and virtual worlds are tightly coupled. To enable seamless and precise AR experiences, an image recognition system that can accurately recognize the object in the camera view with low system latency is required. However, due to the pervasiveness and severity of image distortions, an effective and robust image recognition solution for “in the wild” mobile AR is still elusive. In this article, we present CollabAR, an edge-assisted system that provides distortion-tolerant image recognition for mobile AR with imperceptible system latency . CollabAR incorporates both distortion-tolerant and collaborative image recognition modules in its design. The former enables distortion-adaptive image recognition to improve the robustness against image distortions, while the latter exploits the spatial-temporal correlation among mobile AR users to improve recognition accuracy. Moreover, as it is difficult to collect a large-scale image distortion dataset, we propose a Cycle-Consistent Generative Adversarial Network-based data augmentation method to synthesize realistic image distortion. Our evaluation demonstrates that CollabAR achieves over 85% recognition accuracy for “in the wild” images with severe distortions, while reducing the end-to-end system latency to as low as 18.2 ms.

Santosh Dhaigude

Abstract: In todays world during this pandemic situation Online Learning is the only source where one could learn. Online learning makes students more curious about the knowledge and so they decide their learning path . But considering the academics as they have to pass the course or exam given, they need to take time to study, and have to be disciplined about their dedication. And there are many barriers for Online learning as well. Students are lowering their grasping power the reason for this is that each and every student was used to rely on their teacher and offline classes. Virtual writing and controlling system is challenging research areas in field of image processing and pattern recognition in the recent years. It contributes extremely to the advancement of an automation process and can improve the interface between man and machine in numerous applications. Several research works have been focusing on new techniques and methods that would reduce the processing time while providing higher recognition accuracy. Given the real time webcam data, this jambord like python application uses OpenCV library to track an object-of-interest (a human palm/finger in this case) and allows the user to draw bymoving the finger, which makes it both awesome and interesting to draw simple thing. Keyword: Detection, Handlandmark , Keypoints, Computer vision, OpenCV

2022 ◽  
Vol 12 (2) ◽  
pp. 853
Cheng-Jian Lin ◽  
Yu-Cheng Liu ◽  
Chin-Ling Lee

In this study, an automatic receipt recognition system (ARRS) is developed. First, a receipt is scanned for conversion into a high-resolution image. Receipt characters are automatically placed into two categories according to the receipt characteristics: printed and handwritten characters. Images of receipts with these characters are preprocessed separately. For handwritten characters, template matching and the fixed features of the receipts are used for text positioning, and projection is applied for character segmentation. Finally, a convolutional neural network is used for character recognition. For printed characters, a modified You Only Look Once (version 4) model (YOLOv4-s) executes precise text positioning and character recognition. The proposed YOLOv4-s model reduces downsampling, thereby enhancing small-object recognition. Finally, the system produces recognition results in a tax declaration format, which can upload to a tax declaration system. Experimental results revealed that the recognition accuracy of the proposed system was 80.93% for handwritten characters. Moreover, the YOLOv4-s model had a 99.39% accuracy rate for printed characters; only 33 characters were misjudged. The recognition accuracy of the YOLOv4-s model was higher than that of the traditional YOLOv4 model by 20.57%. Therefore, the proposed ARRS can considerably improve the efficiency of tax declaration, reduce labor costs, and simplify operating procedures.

Song Li ◽  
Mustafa Ozkan Yerebakan ◽  
Yue Luo ◽  
Ben Amaba ◽  
William Swope ◽  

Abstract Voice recognition has become an integral part of our lives, commonly used in call centers and as part of virtual assistants. However, voice recognition is increasingly applied to more industrial uses. Each of these use cases has unique characteristics that may impact the effectiveness of voice recognition, which could impact industrial productivity, performance, or even safety. One of the most prominent among them is the unique background noises that are dominant in each industry. The existence of different machinery and different work layouts are primary contributors to this. Another important characteristic is the type of communication that is present in these settings. Daily communication often involves longer sentences uttered under relatively silent conditions, whereas communication in industrial settings is often short and conducted in loud conditions. In this study, we demonstrated the importance of taking these two elements into account by comparing the performances of two voice recognition algorithms under several background noise conditions: a regular Convolutional Neural Network (CNN) based voice recognition algorithm to an Auto Speech Recognition (ASR) based model with a denoising module. Our results indicate that there is a significant performance drop between the typical background noise use (white noise) and the rest of the background noises. Also, our custom ASR model with the denoising module outperformed the CNN based model with an overall performance increase between 14-35% across all background noises. . Both results give proof that specialized voice recognition algorithms need to be developed for these environments to reliably deploy them as control mechanisms.

2022 ◽  
Vol 12 (2) ◽  
pp. 579
Heonmoo Kim ◽  
Yosoon Choi

In this study, we propose a smart hopper system that automatically unblocks obstructions caused by rocks dropped into hoppers at mining sites. The proposed system captures RGB (red green blue) and D (depth) images of the upper surfaces of hopper models using an RGB-D camera and transmits them to a computer. Then, a virtual hopper system is used to identify rocks via machine vision-based image processing techniques, and an appropriate motion is simulated in a robot arm. Based on the simulation, the robot arm moves to the location of the rock in the real world and removes it from the actual hopper. The recognition accuracy of the proposed model is evaluated in terms of the quantity and location of rocks. The results confirm that rocks are accurately recognized at all positions in the hopper by the proposed system.

2022 ◽  
Vol 2022 ◽  
pp. 1-7
Xiajun Dong ◽  
Bin Huang ◽  
Yuncai Zhou

Aiming at the problem of long retrieval time for massive face image databases under a given threshold, a fast retrieval algorithm for massive face images based on fuzzy clustering is proposed. The algorithm builds a deep convolutional neural network model. The model can be used to extract features from face photos to obtain a high-dimensional vector to represent the high-level semantic features of face photos. On this basis, the fuzzy clustering algorithm is used to perform fuzzy clustering on the feature vectors of the face database to construct a retrieval pedigree map. When the threshold is passed in for database retrieval of the target face photos, the pedigree map can be quickly retrieved. Experiments on the LFW face dataset and self-collected face dataset show that the model is better than the commonly used K-means model in face recognition accuracy, clustering effect, and retrieval speed and has certain commercial value.

2022 ◽  
Vol 15 ◽  
Li Zeng ◽  
Mengsi Lin ◽  
Keyang Xiao ◽  
Jigan Wang ◽  
Hui Zhou

Neuromarketing is an emerging research field for prospective businesses on consumer’s preference. Consumer’s preference prediction based on electroencephalography (EEG) can reliably predict likes or dislikes of a product. However, the current EEG prediction and classification accuracy have yet to reach ideal level. In addition, it is still unclear how different brain region information and different features such as power spectral density, brain asymmetry, differential entropy, and Hjorth parameters affect the prediction accuracy. Our study shows that by taking footwear products as an example, the recognition accuracy of product likes or dislikes reaches 94.22%. Compared with other brain regions, the features of the frontal and occipital brain region obtained a higher prediction accuracy, but the fusion of the features of the whole brain region could improve the prediction accuracy of likes or dislikes even further. Future work would be done to correlate the EEG-based like or dislike prediction results with product sales and self-reports.

Sensors ◽  
2022 ◽  
Vol 22 (2) ◽  
pp. 419
Youchen Fan ◽  
Shuya Zhang ◽  
Kai Feng ◽  
Kechang Qian ◽  
Yitong Wang ◽  

Aiming at the problems of low accuracy of strawberry fruit picking and large rate of mispicking or missed picking, YOLOv5 combined with dark channel enhancement is proposed. In “Fengxiang” strawberry, the criterion of “bad fruit” is added to the conventional three criteria of ripeness, near-ripeness, and immaturity, because some of the bad fruits are close to the color of ripe fruits, but the fruits are small and dry. The training accuracy of the four kinds of strawberries with different ripeness is above 85%, and the testing accuracy is above 90%. Then, to meet the demand of all-day picking and address the problem of low illumination of images collected at night, an enhancement algorithm is proposed to enhance the images, which are recognized. We compare the actual detection results of the five enhancement algorithms, i.e., histogram equalization, Laplace transform, gamma transform, logarithmic variation, and dark channel enhancement processing under the different numbers of fruits, periods, and video tests. The results show that combined with dark channel enhancement, YOLOv5 has the highest recognition rate. Finally, the experimental results demonstrate that YOLOv5 is better than SSD, DSSD, and EfficientDet in terms of recognition accuracy, and the correct rate can reach more than 90%. Meanwhile, the method has good robustness in complex environments such as partial occlusion and multiple fruits.

Sign in / Sign up

Export Citation Format

Share Document