Fast Recognition System for Tree Images based on Caffe Platform and Deep Learning

Abstract Aiming at the difficult problem of complex extraction for tree image in the existing complex background, we took tree species as the research object and proposed a fast recognition system solution for tree image based on Caffe platform and deep learning. In the research of deep learning algorithm based on Caffe framework, the improved Dual-Task CNN model (DCNN) is applied to train the image extractor and classifier to accomplish the dual tasks of image cleaning and tree classification. In addition, when compared with the traditional classification methods represented by Support Vector Machine (SVM) and Single-Task CNN model, Dual-Task CNN model demonstrates its superiority in classification performance. Then, in order for further improvement to the recognition accuracy for similar species, Gabor kernel was introduced to extract the features of frequency domain for images in different scales and directions, so as to enhance the texture features of leaf images and improve the recognition effect. The improved model was tested on the data sets of similar species. As demonstrated by the results, the improved deep Gabor convolutional neural network (GCNN) is advantageous in tree recognition and similar tree classification when compared with the Dual-Task CNN classification method. Finally, the recognition results of trees can be displayed on the application graphical interface as well. In the application graphical interface designed based on Ubantu system, it is capable to perform such functions as quick reading of and search for picture files, snapshot, one-key recognition, one-key e

Download Full-text

Deep learning-based framework for the distinction of membranous nephropathy: a new approach through hyperspectral imagery

BMC Nephrology ◽

10.1186/s12882-021-02421-y ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Tianqi Tu ◽

Xueling Wei ◽

Yue Yang ◽

Nianrong Zhang ◽

Wei Li ◽

...

Keyword(s):

Deep Learning ◽

Renal Biopsy ◽

Membranous Nephropathy ◽

Learning Algorithm ◽

Hyperspectral Imagery ◽

Chinese Patients ◽

Support Vector ◽

Deep Learning Algorithm ◽

The Difference ◽

Complex Deposition

Abstract Background Common subtypes seen in Chinese patients with membranous nephropathy (MN) include idiopathic membranous nephropathy (IMN) and hepatitis B virus-related membranous nephropathy (HBV-MN). However, the morphologic differences are not visible under the light microscope in certain renal biopsy tissues. Methods We propose here a deep learning-based framework for processing hyperspectral images of renal biopsy tissue to define the difference between IMN and HBV-MN based on the component of their immune complex deposition. Results The proposed framework can achieve an overall accuracy of 95.04% in classification, which also leads to better performance than support vector machine (SVM)-based algorithms. Conclusion IMN and HBV-MN can be correctly separated via the deep learning framework using hyperspectral imagery. Our results suggest the potential of the deep learning algorithm as a new method to aid in the diagnosis of MN.

Download Full-text

Emotion Recognition System for Specially Needed People with Optimized Deep Learning Algorithm

2020 Fourth International Conference on Inventive Systems and Control (ICISC) ◽

10.1109/icisc47916.2020.9171190 ◽

2020 ◽

Author(s):

Arul Valiyavalappil Haridas ◽

Ramalatha Marimuthu ◽

Basabi Chakraborty

Keyword(s):

Deep Learning ◽

Emotion Recognition ◽

Learning Algorithm ◽

Recognition System ◽

Deep Learning Algorithm

Download Full-text

Deep learning para la clasificación de usos de suelo agrícola con Sentinel-2

Revista de Teledetección ◽

10.4995/raet.2020.13337 ◽

2020 ◽

pp. 35

Author(s):

M. Campos-Taberner ◽

F.J. García-Haro ◽

B. Martínez ◽

M.A. Gilabert

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Near Infrared ◽

Spatial Information ◽

Learning Algorithm ◽

Support Vector ◽

Sensing Applications ◽

Learning Techniques ◽

Remote Sensing Applications ◽

Sentinel 2

<p class="p1">The use of deep learning techniques for remote sensing applications has recently increased. These algorithms have proven to be successful in estimation of parameters and classification of images. However, little effort has been made to make them understandable, leading to their implementation as “black boxes”. This work aims to evaluate the performance and clarify the operation of a deep learning algorithm, based on a bi-directional recurrent network of long short-term memory (2-BiLSTM). The land use classification in the Valencian Community based on Sentinel-2 image time series in the framework of the common agricultural policy (CAP) is used as an example. It is verified that the accuracy of the deep learning techniques is superior (98.6 % overall success) to that other algorithms such as decision trees (DT), k-nearest neighbors (k-NN), neural networks (NN), support vector machines (SVM) and random forests (RF). The performance of the classifier has been studied as a function of time and of the predictors used. It is concluded that, in the study area, the most relevant information used by the network in the classification are the images corresponding to summer and the spectral and spatial information derived from the red and near infrared bands. These results open the door to new studies in the field of the explainable deep learning in remote sensing applications.</p>

Download Full-text

Deep learning based facial expressions recognition system for assisting visually impaired persons

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v9i3.2030 ◽

2020 ◽

Vol 9 (3) ◽

pp. 1208-1219

Author(s):

Hendra Kusuma ◽

Muhammad Attamimi ◽

Hasby Fahrudin

Keyword(s):

Deep Learning ◽

Facial Expressions ◽

Visually Impaired ◽

Learning Algorithm ◽

Recognition System ◽

Verbal Information ◽

Deep Learning Algorithm ◽

Facial Recognition System ◽

Visually Impaired Persons ◽

Facial Expressions Recognition

In general, a good interaction including communication can be achieved when verbal and non-verbal information such as body movements, gestures, facial expressions, can be processed in two directions between the speaker and listener. Especially the facial expression is one of the indicators of the inner state of the speaker and/or the listener during the communication. Therefore, recognizing the facial expressions is necessary and becomes the important ability in communication. Such ability will be a challenge for the visually impaired persons. This fact motivated us to develop a facial recognition system. Our system is based on deep learning algorithm. We implemented the proposed system on a wearable device which enables the visually impaired persons to recognize facial expressions during the communication. We have conducted several experiments involving the visually impaired persons to validate our proposed system and the promising results were achieved.

Download Full-text

Using geometric constraints to improve performance of image classifiers for automatic segmentation of traffic signs

GEOMATICA ◽

10.1139/geomat-2020-0010 ◽

2021 ◽

pp. 1-23

Author(s):

Roholah Yazdan ◽

Masood Varshosaz ◽

Saied Pirasteh ◽

Fabio Remondino

Keyword(s):

Deep Learning ◽

Learning Algorithm ◽

Automatic Segmentation ◽

Geometric Constraints ◽

Support Vector ◽

Svm Classifier ◽

Data Set ◽

Traffic Signs ◽

Deep Learning Algorithm ◽

Comparison Results

Automatic detection and recognition of traffic signs from images is an important topic in many applications. At first, we segmented the images using a classification algorithm to delineate the areas where the signs are more likely to be found. In this regard, shadows, objects having similar colours, and extreme illumination changes can significantly affect the segmentation results. We propose a new shape-based algorithm to improve the accuracy of the segmentation. The algorithm works by incorporating the sign geometry to filter out the wrong pixels from the classification results. We performed several tests to compare the performance of our algorithm against those obtained by popular techniques such as Support Vector Machine (SVM), K-Means, and K-Nearest Neighbours. In these tests, to overcome the unwanted illumination effects, the images are transformed into colour spaces Hue, Saturation, and Intensity, YUV, normalized red green blue, and Gaussian. Among the traditional techniques used in this study, the best results were obtained with SVM applied to the images transformed into the Gaussian colour space. The comparison results also suggested that by adding the geometric constraints proposed in this study, the quality of sign image segmentation is improved by 10%–25%. We also comparted the SVM classifier enhanced by incorporating the geometry of signs with a U-Shaped deep learning algorithm. Results suggested the performance of both techniques is very close. Perhaps the deep learning results could be improved if a more comprehensive data set is provided.

Download Full-text

Multipose Face Recognition-Based Combined Adaptive Deep Learning Vector Quantization

Computational Intelligence and Neuroscience ◽

10.1155/2020/8821868 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Shahenda Sarhan ◽

Aida A. Nasr ◽

Mahmoud Y. Shams

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Face Recognition ◽

Vector Quantization ◽

Confusion Matrix ◽

Recognition System ◽

Learning Vector Quantization ◽

Majority Voting ◽

Support Vector ◽

Reliability And Robustness

Multipose face recognition system is one of the recent challenges faced by the researchers interested in security applications. Different researches have been introduced discussing the accuracy improvement of multipose face recognition through enhancing the face detector as Viola-Jones, Real Adaboost, and Cascade Object Detector while others concentrated on the recognition systems as support vector machine and deep convolution neural networks. In this paper, a combined adaptive deep learning vector quantization (CADLVQ) classifier is proposed. The proposed classifier has boosted the weakness of the adaptive deep learning vector quantization classifiers through using the majority voting algorithm with the speeded up robust feature extractor. Experimental results indicate that, the proposed classifier provided promising results in terms of sensitivity, specificity, precision, and accuracy compared to recent approaches in deep learning, statistical, and classical neural networks. Finally, the comparison is empirically performed using confusion matrix to ensure the reliability and robustness of the proposed system compared to the state-of art.

Download Full-text

Hand Gesture Recognition Based on Single-Shot Multibox Detector Deep Learning

Mobile Information Systems ◽

10.1155/2019/3410348 ◽

2019 ◽

Vol 2019 ◽

pp. 1-7 ◽

Cited By ~ 2

Author(s):

Peng Liu ◽

Xiangxiang Li ◽

Haiting Cui ◽

Shanshan Li ◽

Yafei Yuan

Keyword(s):

Deep Learning ◽

Gesture Recognition ◽

Learning Algorithm ◽

Recognition System ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

Single Shot ◽

Complex Scene ◽

Hand Gestures ◽

Benchmark Database

Hand gesture recognition is an intuitive and effective way for humans to interact with a computer due to its high processing speed and recognition accuracy. This paper proposes a novel approach to identify hand gestures in complex scenes by the Single-Shot Multibox Detector (SSD) deep learning algorithm with 19 layers of a neural network. A benchmark database with gestures is used, and general hand gestures in the complex scene are chosen as the processing objects. A real-time hand gesture recognition system based on the SSD algorithm is constructed and tested. The experimental results show that the algorithm quickly identifies humans’ hands and accurately distinguishes different types of gestures. Furthermore, the maximum accuracy is 99.2%, which is significantly important for human-computer interaction application.

Download Full-text

Deep learning-based framework for the distinction of membranous nephropathy: A new approach through hyperspectral imagery

10.21203/rs.3.rs-30329/v3 ◽

2021 ◽

Author(s):

Tianqi Tu ◽

Xueling Wei ◽

Yue Yang ◽

Nianrong Zhang ◽

Wei Li ◽

...

Keyword(s):

Deep Learning ◽

Renal Biopsy ◽

Membranous Nephropathy ◽

Learning Algorithm ◽

Hyperspectral Imagery ◽

Chinese Patients ◽

Support Vector ◽

Deep Learning Algorithm ◽

The Difference ◽

Complex Deposition

Abstract Background: Common subtypes seen in Chinese patients with membranous nephropathy (MN) include idiopathic membranous nephropathy (IMN) and hepatitis B virus-related membranous nephropathy (HBV-MN). However, the morphologic differences are not visible under the light microscope in certain renal biopsy tissues. Methods: We propose here a deep learning-based framework for processing hyperspectral images of renal biopsy tissue to define the difference between IMN and HBV-MN based on the component of their immune complex deposition. Results: The proposed framework can achieve an overall accuracy of 95.04% in classification, which also leads to better performance than support vector machine (SVM)-based algorithms. Conclusion: IMN and HBV-MN can be correctly separated via the deep learning framework using hyperspectral imagery. Our results suggest the potential of the deep learning algorithm as a new method to aid in the diagnosis of MN.

Download Full-text

A Feature-Extraction-Based Lightweight Convolutional and Recurrent Neural Networks Adaptive Computing Model for Container Terminal Liner Handling Volume Forecasting

Discrete Dynamics in Nature and Society ◽

10.1155/2021/6721564 ◽

2021 ◽

Vol 2021 ◽

pp. 1-17

Author(s):

Bin Li ◽

Yuqing He

Keyword(s):

Feature Extraction ◽

Deep Learning ◽

Large Scale ◽

Learning Algorithm ◽

Container Terminal ◽

Support Vector ◽

Adaptive Computing ◽

Support Design ◽

Computing Model ◽

Design Paradigm

The synergy of computational logistics and deep learning provides a new methodology and solution to the operational decisions of container terminal handling systems (CTHS) at the strategic, tactical, and executive levels. Above all, the container terminal logistics tactical operational complexity is discussed by computational logistics, and the liner handling volume (LHV) has important influences on a series of terminal scheduling decision problems. Subsequently, a feature-extraction-based lightweight convolutional and recurrent neural network adaptive computing model (FEB-LCR-ACM) is presented initially to predict the LHV by the fusion of multiple deep learning algorithms and mechanisms, especially for the specific feature extraction package of tsfresh. Consequently, the container-terminal-oriented logistics service scheduling decision support design paradigm is put forward tentatively by FEB-LCR-ACM. Finally, a typical large-scale container terminal of China is chosen to implement, execute, and evaluate the FEB-LCR-ACM based on the terminal running log around the indicator of LHV. In the case of severe vibration of LHV between 2 twenty-foot equivalent units (TEUs) and 4215 TEUs, while forecasting the LHV of 300 liners by the log of five years, the forecasting error within 100 TEUs almost accounts for 80%. When predicting the operation of 350 ships by the log of six years, the forecasting deviation within 100 TEUs reaches up to nearly 90%. The abovementioned two deep learning experimental performances with FEB-LCR-ACM are so far ahead of the forecasting results by the classical machine learning algorithm that is similar to Gaussian support vector machine. Consequently, the FEB-LCR-ACM achieves sufficiently good performance for the LHV prediction with a lightweight deep learning architecture based on the typical small datasets, and then it is supposed to overcome the operational nonlinearity, dynamics, coupling, and complexity of CTHS partially.

Download Full-text

An Improved Arabic Handwritten Recognition System using Deep Support Vector Machines

International Journal of Multimedia Data Engineering and Management ◽

10.4018/ijmdem.2016040101 ◽

2016 ◽

Vol 7 (2) ◽

pp. 1-20 ◽

Cited By ~ 9

Author(s):

Mohamed Elleuch ◽

Monji Kherallah

Keyword(s):

Deep Learning ◽

Learning Algorithms ◽

Recognition System ◽

Machine Learning Algorithms ◽

Support Vector ◽

Learning Approaches ◽

Arabic Characters ◽

Rbf Kernel ◽

Data Points ◽

Handwritten Recognition

Deep learning algorithms, as a machine learning algorithms developed in recent years, have been successfully applied in various domains of computer vision, such as face recognition, object detection and image classification. These Deep algorithms aim at extracting a high representation of the data via multi-layers in a deep hierarchical structure. However, to the authors' knowledge, these deep learning approaches have not been extensively studied to recognize Arabic Handwritten Script (AHS). In this paper, they present a deep learning model based on Support Vector Machine (SVM) named Deep SVM. This model has an inherent ability to select data points crucial to classify good generalization capabilities. The deep SVM is constructed by a stack of SVMs allowing to extracting/learning automatically features from the raw images and to perform classification as well. The Multi-class SVM with an RBF kernel, as non-linear discriminative features for classification, was chosen and tested on Handwritten Arabic Characters Database (HACDB). Simulation results show the effectiveness of the proposed model.

Download Full-text