scholarly journals Automated stomata detection in oil palm with convolutional neural network

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Qi Bin Kwong ◽  
Yick Ching Wong ◽  
Phei Ling Lee ◽  
Muhammad Syafiq Sahaini ◽  
Yee Thung Kon ◽  
...  

AbstractStomatal density is an important trait for breeding selection of drought tolerant oil palms; however, its measurement is extremely tedious. To accelerate this process, we developed an automated system. Leaf samples from 128 palms ranging from nursery (1 years old), juvenile (2–3 years old) and mature (> 10 years old) were collected to build an oil palm specific stomata detection model. Micrographs were split into tiles, then used to train a stomata object detection convolutional neural network model through transfer learning. The detection model was then tested on leaf samples acquired from three independent oil palm populations of young seedlings (A), juveniles (B) and productive adults (C). The detection accuracy, measured in precision and recall, was 98.00% and 99.50% for set A, 99.70% and 97.65% for set B, and 99.55% and 99.62% for set C, respectively. The detection model was cross-applied to another set of adult palms using stomata images taken with a different microscope and under different conditions (D), resulting in precision and recall accuracy of 99.72% and 96.88%, respectively. This indicates that the model built generalized well, in addition has high transferability. With the completion of this detection model, stomatal density measurement can be accelerated. This in turn will accelerate the breeding selection for drought tolerance.

Author(s):  
Dima M. Alalharith ◽  
Hajar M. Alharthi ◽  
Wejdan M. Alghamdi ◽  
Yasmine M. Alsenbel ◽  
Nida Aslam ◽  
...  

Computer-based technologies play a central role in the dentistry field, as they present many methods for diagnosing and detecting various diseases, such as periodontitis. The current study aimed to develop and evaluate the state-of-the-art object detection and recognition techniques and deep learning algorithms for the automatic detection of periodontal disease in orthodontic patients using intraoral images. In this study, a total of 134 intraoral images were divided into a training dataset (n = 107 [80%]) and a test dataset (n = 27 [20%]). Two Faster Region-based Convolutional Neural Network (R-CNN) models using ResNet-50 Convolutional Neural Network (CNN) were developed. The first model detects the teeth to locate the region of interest (ROI), while the second model detects gingival inflammation. The detection accuracy, precision, recall, and mean average precision (mAP) were calculated to verify the significance of the proposed model. The teeth detection model achieved an accuracy, precision, recall, and mAP of 100 %, 100%, 51.85%, and 100%, respectively. The inflammation detection model achieved an accuracy, precision, recall, and mAP of 77.12%, 88.02%, 41.75%, and 68.19%, respectively. This study proved the viability of deep learning models for the detection and diagnosis of gingivitis in intraoral images. Hence, this highlights its potential usability in the field of dentistry and aiding in reducing the severity of periodontal disease globally through preemptive non-invasive diagnosis.


Informatics ◽  
2020 ◽  
Vol 17 (2) ◽  
pp. 36-43
Author(s):  
R. S. Vashkevich ◽  
E. S. Azarov

The paper investigates the problem of voice activity detection from a noisy sound signal. An extremely compact convolutional neural network is proposed. The model has only 385 trainable parameters. Proposed model doesn’t require a lot of computational resources that allows to use it as part of the “internet of things” concept for compact low power devices. At the same time the model provides state of the art results in voice activity detection in terms of detection accuracy. The properties of the model are achieved by using a special convolutional layer that considers the harmonic structure of vocal speech. This layer also eliminates redundancy of the model because it has invariance to changes of fundamental frequency. The model performance is evaluated in various noise conditions with different signal-to-noise ratios. The results show that the proposed model provides higher accuracy compared to voice activity detection model from the WebRTC framework by Google.


Author(s):  
Ioannis Papakis ◽  
Abhijit Sarkar ◽  
Andrei Svetovidov ◽  
Jeffrey S. Hickman ◽  
A. Lynn Abbott

This paper describes an approach for automatic detection and localization of drivers and passengers in automobiles using in-cabin images. We used a convolutional neural network (CNN) framework and conducted experiments based on the Faster R-CNN and Cascade R-CNN detectors. Training and evaluation were performed using the Second Strategic Highway Research Program (SHRP 2) naturalistic dataset. In SHRP 2, the cabin images have been blurred to maintain privacy. After detecting occupants inside the vehicle, the system classifies each occupant as driver, front-seat passenger, or back-seat passenger. For one SHRP 2 test set, the system detected occupants with an accuracy of 94.5%. Those occupants were correctly classified as front-seat passenger with an accuracy of 97.3%, as driver with 99.5% accuracy, and as back-seat passenger with 94.3% accuracy. The system performed slightly better for daytime images than for nighttime images. Unlike previous work, this method is capable of presence classification and location prediction of occupants. By fine-tuning the object detection model, there is also significant improvement in detection accuracy as compared with pretrained models. The study also provides a fully annotated dataset of in-cabin images. This work is expected to facilitate research involving interactions between drivers and passengers, particularly related to driver attention and safety.


2019 ◽  
Vol 9 (14) ◽  
pp. 2867 ◽  
Author(s):  
Hongyan Xu ◽  
Xiu Su ◽  
Yi Wang ◽  
Huaiyu Cai ◽  
Kerang Cui ◽  
...  

Concrete bridge crack detection is critical to guaranteeing transportation safety. The introduction of deep learning technology makes it possible to automatically and accurately detect cracks in bridges. We proposed an end-to-end crack detection model based on the convolutional neural network (CNN), taking the advantage of atrous convolution, Atrous Spatial Pyramid Pooling (ASPP) module and depthwise separable convolution. The atrous convolution obtains a larger receptive field without reducing the resolution. The ASPP module enables the network to extract multi-scale context information, while the depthwise separable convolution reduces computational complexity. The proposed model achieved a detection accuracy of 96.37% without pre-training. Experiments showed that, compared with traditional classification models, the proposed model has a better performance. Besides, the proposed model can be embedded in any convolutional network as an effective feature extraction structure.


2020 ◽  
Vol 14 ◽  
Author(s):  
Guoqiang Chen ◽  
Bingxin Bai ◽  
Huailong Yi

Background: Background: The development of deep learning technology has promoted the industrial intelligence, and automatic driving vehicles have become a hot research direction. As to the problem that pavement potholes threaten the safety of automatic driving vehicles, the pothole detection under complex environment conditions is studied. Objective: The goal of the work is to propose a new model of pavement pothole detection based on convolutional neural network. The main contribution is that the Multi-level Feature Fusion Block and the Detector Cascading Block are designed and a series of detectors are cascaded together to improve the detection accuracy of the proposed model. Methods: A pothole detection model is designed based on the original object detection model. In the study, the Transfer Connection Block in the Object Detection Module is removed and the Multi-level Feature Fusion Block is redesigned. At the same time, a Detector Cascading Block with multi-step detection is designed. Detectors are connected directly to the feature map and cascaded. In addition, the structure skips the transformation step. Results: The proposed method can be used to detect potholes efficiently. The real-time and accuracy of the model are improved after adjusting the network parameters and redesigning the model structure. The maximum detection accuracy of the proposed model is 75.24%. Conclusion: The Multi-level Feature Fusion Block designed enhances the fusion of high and low layer feature information and is conducive to extracting a large amount of target information. The Detector Cascade Block is a detector with cascade structure, which can realize more accurate prediction of the object. In a word, the model designed has greatly improved the detection accuracy and speed, which lays a solid foundation for pavement pothole detection under complex environmental conditions.


2019 ◽  
Vol 8 (3) ◽  
pp. 4494-4499 ◽  

Deep learning had provided good outcome in analyzing images of tumours, however, the deficiency of large annotated datasets reduces its importance. The proposed medical image processing system is based on image segmentation and image classification. It is to be used by medical field experts. In order to classify the Brain tumour images the semantic level classification and segmentation network techniques are applied. This includes prior knowledge of testing samples and training samples, using Convolutional Neural Networks (CNN). The CNN based classifier improves the detection accuracy compared to the existing segmentation based classifier. In this project, the automated system would help the medical image analyst to identify the Brain Tumour in patient by making use of deep convolutional neural network (CNN).The image is obtained from MRI scan of a brain. The tumourless patient’s image dataset is used as the training and testing data for the classification network. Patient image is compared with dataset of a tumour affected images for differentiating an image as Non-timorous sample, low grade glioma and glioblastoma after segmenting and classifying the image. The Watershed segmentation algorithm is used for segmenting images and CNN is used for classifying the images. Finally the system will detect the tumour is affected or not in the given image of a patient’s brain, then the system will identify the tumour affected region and differentiate the low grade glioma and glioblastoma in the image.


2020 ◽  
Vol 37 (5) ◽  
pp. 807-813
Author(s):  
Dabing Jin ◽  
Shiqing Xu ◽  
Lianjie Tong ◽  
Linyu Wu ◽  
Shimin Liu

The float glass contains various defects for reasons of raw materials and production process. These defects can be observed on the end images of the glass. Since the defects are correlated with specific links of the production process, it is possible to discover the process problems by identifying the location and type of defects in end images. Based on faster region-based convolutional neural network (Faster RCNN), this paper proposes a deep learning method that improves the feature extraction network, and adds a Laplacian convolutional layer to preprocess the end images. Considering the defect features in end images, the anchor box size was adjusted to speed up the training. Besides, the lack of generalizability induced by small dataset was solved through data enhancement. With improved VGG16 as the feature extraction layer, a glass defect detection model was established, whose generalizability was improved through transfer learning. The experimental results show that the proposed model achieved a mean detection accuracy of 94% on actual test set, meeting the requirements for actual use in factories.


Author(s):  
Fei Rong ◽  
Li Shasha ◽  
Xu Qingzheng ◽  
Liu Kun

The Station logo is a way for a TV station to claim copyright, which can realize the analysis and understanding of the video by the identification of the station logo, so as to ensure that the broadcasted TV signal will not be illegally interfered. In this paper, we design a station logo detection method based on Convolutional Neural Network by the characteristics of the station, such as small scale-to-height ratio change and relatively fixed position. Firstly, in order to realize the preprocessing and feature extraction of the station data, the video samples are collected, filtered, framed, labeled and processed. Then, the training sample data and the test sample data are divided proportionally to train the station detection model. Finally, the sample is tested to evaluate the effect of the training model in practice. The simulation experiments prove its validity.


Sign in / Sign up

Export Citation Format

Share Document