Deep Learning for Mango (Mangifera indica) Panicle Stage Classification

Automated assessment of the number of panicles by developmental stage can provide information on the time spread of flowering and thus inform farm management. A pixel-based segmentation method for the estimation of flowering level from tree images was confounded by the developmental stage. Therefore, the use of a single and a two-stage deep learning framework (YOLO and R2CNN) was considered, using either upright or rotated bounding boxes. For a validation image set and for a total panicle count, the models MangoYOLO(-upright), MangoYOLO-rotated, YOLOv3-rotated, R2CNN(-rotated) and R2CNN-upright achieved weighted F1 scores of 76.5, 76.1, 74.9, 74.0 and 82.0, respectively. For a test set of the images of another cultivar and using a different camera, the R2 for machine vision to human count of panicles per tree was 0.86, 0.80, 0.83, 0.81 and 0.76 for the same models, respectively. Thus, there was no consistent benefit from the use of rotated over the use of upright bounding boxes. The YOLOv3-rotated model was superior in terms of total panicle count, and the R2CNN-upright model was more accurate for panicle stage classification. To demonstrate practical application, panicle counts were made weekly for an orchard of 994 trees, with a peak detection routine applied to document multiple flowering events.

Download Full-text

Deep Learning for Mango (Mangifera Indica) Panicle Stage Classification

10.20944/preprints201912.0160.v1 ◽

2019 ◽

Author(s):

Anand Koirala ◽

Kerry Walsh ◽

Zhenglin Wang ◽

Nicholas Anderson

Keyword(s):

Deep Learning ◽

Developmental Stages ◽

Mangifera Indica ◽

Segmentation Method ◽

Practical Application ◽

Learning Framework ◽

Stage Classification ◽

Image Set ◽

Bounding Boxes ◽

Tree Image

A pixel-based segmentation method was demonstrated to be confounded by developmental stage in estimation of flowering of mango. Categorization of panicles into three developmental stages was undertaken with a single and a two-stage deep learning framework (YOLO and R2CNN), using either upright or rotated bounding boxes. For a validation image set and for total panicle count, the models MangoYOLO(-upright), MangoYOLO-rotated, YOLOv3-rotated, R2CNN(-rotated) and R2CNN-upright achieved: (i) RMSEs of 25.6, 16.0, 15.4, 25.8 and 32.3 panicles per tree image, (ii) Mean average precision (mAP) scores of 72.2, 69.1, 65.0, 62.5 and 70.9% and (iii) weighted F1-scores of 76.5, 76.1, 74.9, 74.0 and 82.0, respectively. For a test set of images involving a different orchard and cultivar and use of a different camera, the R2 for machine vision to human count of panicles per tree was 0.86, 0.80, 0.83, 0.81 and 0.76 for the same models, respectively. Thus, models generalised well, but with no consistent benefit from use of rotated over upright bounding boxes. While the YOLOv3-rotated model was superior in terms of total panicle count, the R2CNN-upright model was more accurate for panicle stage classification. To demonstrate practical application, panicle counts were made weekly for an orchard of 994 trees, with a peak detection routine applied to document multiple flowering events.

Download Full-text

Segmentation of Overlapping Cervical Cells with Mask Region Convolutional Neural Network

Computational and Mathematical Methods in Medicine ◽

10.1155/2021/3890988 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Jiajia Chen ◽

Baocan Zhang

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

State Of The Art ◽

Cytological Analysis ◽

Segmentation Method ◽

Challenging Tasks ◽

Diagnostic Technology ◽

Cervical Cells ◽

Bounding Boxes

The task of segmenting cytoplasm in cytology images is one of the most challenging tasks in cervix cytological analysis due to the presence of fuzzy and highly overlapping cells. Deep learning-based diagnostic technology has proven to be effective in segmenting complex medical images. We present a two-stage framework based on Mask RCNN to automatically segment overlapping cells. In stage one, candidate cytoplasm bounding boxes are proposed. In stage two, pixel-to-pixel alignment is used to refine the boundary and category classification is also presented. The performance of the proposed method is evaluated on publicly available datasets from ISBI 2014 and 2015. The experimental results demonstrate that our method outperforms other state-of-the-art approaches with DSC 0.92 and FPRp 0.0008 at the DSC threshold of 0.8. Those results indicate that our Mask RCNN-based segmentation method could be effective in cytological analysis.

Download Full-text

MLMT-CNN for object detection and segmentation in multi-layer and multi-spectral images

Machine Vision and Applications ◽

10.1007/s00138-021-01261-y ◽

2021 ◽

Vol 33 (1) ◽

Author(s):

Majedaldein Almahasneh ◽

Adeline Paiement ◽

Xianghua Xie ◽

Jean Aboudarham

Keyword(s):

Deep Learning ◽

Active Regions ◽

Magnetic Resonance Images ◽

Supervised Machine Learning ◽

Training Strategy ◽

Learning Framework ◽

3D Objects ◽

Main Challenge ◽

Spatial Configurations ◽

Bounding Boxes

AbstractPrecisely localising solar Active Regions (AR) from multi-spectral images is a challenging but important task in understanding solar activity and its influence on space weather. A main challenge comes from each modality capturing a different location of the 3D objects, as opposed to typical multi-spectral imaging scenarios where all image bands observe the same scene. Thus, we refer to this special multi-spectral scenario as multi-layer. We present a multi-task deep learning framework that exploits the dependencies between image bands to produce 3D AR localisation (segmentation and detection) where different image bands (and physical locations) have their own set of results. Furthermore, to address the difficulty of producing dense AR annotations for training supervised machine learning (ML) algorithms, we adapt a training strategy based on weak labels (i.e. bounding boxes) in a recursive manner. We compare our detection and segmentation stages against baseline approaches for solar image analysis (multi-channel coronal hole detection, SPOCA for ARs) and state-of-the-art deep learning methods (Faster RCNN, U-Net). Additionally, both detection and segmentation stages are quantitatively validated on artificially created data of similar spatial configurations made from annotated multi-modal magnetic resonance images. Our framework achieves an average of 0.72 IoU (segmentation) and 0.90 F1 score (detection) across all modalities, comparing to the best performing baseline methods with scores of 0.53 and 0.58, respectively, on the artificial dataset, and 0.84 F1 score in the AR detection task comparing to baseline of 0.82 F1 score. Our segmentation results are qualitatively validated by an expert on real ARs.

Download Full-text

A hybrid self-attention deep learning framework for multivariate sleep stage classification

BMC Bioinformatics ◽

10.1186/s12859-019-3075-z ◽

2019 ◽

Vol 20 (S16) ◽

Cited By ~ 3

Author(s):

Ye Yuan ◽

Kebin Jia ◽

Fenglong Ma ◽

Guangxu Xun ◽

Yaqing Wang ◽

...

Keyword(s):

Deep Learning ◽

Sleep Stage ◽

Sleep Stages ◽

Learning Framework ◽

Temporal Correlations ◽

Feature Spaces ◽

Important Research Topic ◽

Frequency Domains ◽

Stage Classification ◽

Sleep Stage Classification

Abstract Background Sleep is a complex and dynamic biological process characterized by different sleep patterns. Comprehensive sleep monitoring and analysis using multivariate polysomnography (PSG) records has achieved significant efforts to prevent sleep-related disorders. To alleviate the time consumption caused by manual visual inspection of PSG, automatic multivariate sleep stage classification has become an important research topic in medical and bioinformatics. Results We present a unified hybrid self-attention deep learning framework, namely HybridAtt, to automatically classify sleep stages by capturing channel and temporal correlations from multivariate PSG records. We construct a new multi-view convolutional representation module to learn channel-specific and global view features from the heterogeneous PSG inputs. The hybrid attention mechanism is designed to further fuse the multi-view features by inferring their dependencies without any additional supervision. The learned attentional representation is subsequently fed through a softmax layer to train an end-to-end deep learning model. Conclusions We empirically evaluate our proposed HybridAtt model on a benchmark PSG dataset in two feature domains, referred to as the time and frequency domains. Experimental results show that HybridAtt consistently outperforms ten baseline methods in both feature spaces, demonstrating the effectiveness of HybridAtt in the task of sleep stage classification.

Download Full-text

A unified framework for automated person re-indentification

Transport and Communication Science Journal ◽

10.25073/tcsj.71.7.11 ◽

2020 ◽

Vol 71 (7) ◽

pp. 868-880

Author(s):

Nguyen Hong-Quan ◽

Nguyen Thuy-Binh ◽

Tran Duc-Long ◽

Le Thi-Lan

Keyword(s):

Deep Learning ◽

Video Analysis ◽

Camera Network ◽

Unified Framework ◽

Person Detection ◽

Practical Applications ◽

Detection And Tracking ◽

Analysis System ◽

Bounding Boxes

Along with the strong development of camera networks, a video analysis system has been become more and more popular and has been applied in various practical applications. In this paper, we focus on person re-identification (person ReID) task that is a crucial step of video analysis systems. The purpose of person ReID is to associate multiple images of a given person when moving in a non-overlapping camera network. Many efforts have been made to person ReID. However, most of studies on person ReID only deal with well-alignment bounding boxes which are detected manually and considered as the perfect inputs for person ReID. In fact, when building a fully automated person ReID system the quality of the two previous steps that are person detection and tracking may have a strong effect on the person ReID performance. The contribution of this paper are two-folds. First, a unified framework for person ReID based on deep learning models is proposed. In this framework, the coupling of a deep neural network for person detection and a deep-learning-based tracking method is used. Besides, features extracted from an improved ResNet architecture are proposed for person representation to achieve a higher ReID accuracy. Second, our self-built dataset is introduced and employed for evaluation of all three steps in the fully automated person ReID framework.

Download Full-text

A DEEP LEARNING FRAMEWORK FOR CLASSIFICATION OF SEVERITY IN CHRONIC OBSTRUCTIVE PULMONARY DISEASE (COPD)

10.26226/morressier.5ade45fed462b8029238e7b4 ◽

2018 ◽

Author(s):

Roger Tam

Keyword(s):

Chronic Obstructive Pulmonary Disease ◽

Deep Learning ◽

Pulmonary Disease ◽

Chronic Obstructive ◽

Obstructive Pulmonary Disease ◽

Learning Framework

Download Full-text

A deep learning framework of elastic plates and shells

10.26226/morressier.5f5f8e69aa777f8ba5bd6036 ◽

2020 ◽

Author(s):

Juner Zhu

Keyword(s):

Deep Learning ◽

Elastic Plates ◽

Learning Framework ◽

Plates And Shells

Download Full-text

A Deep Learning Framework for Prediction of Retinal Disorders

SSRN Electronic Journal ◽

10.2139/ssrn.3563640 ◽

2020 ◽

Author(s):

Raniyaharini R ◽

Madhumitha K ◽

Mishaa S ◽

Virajaravi R

Keyword(s):

Deep Learning ◽

Learning Framework ◽

Retinal Disorders

Download Full-text

LNTP-MDBN: Big Data Integrated Learning Framework for Heterogeneous Image Set Classification

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405613666170721103949 ◽

2019 ◽

Vol 15 (2) ◽

pp. 227-236

Author(s):

D. Franklin Vinod ◽

V. Vasudevan

Keyword(s):

Big Data ◽

Medical Image ◽

Research Study ◽

Big Data Analytics ◽

Learning Framework ◽

Image Set Classification ◽

Filtering Technique ◽

Image Set ◽

Hidden Patterns ◽

Global Data

Background: With the explosive growth of global data, the term Big Data describes the enormous size of dataset through the detailed analysis. The big data analytics revealed the hidden patterns and secret correlations among the values. The major challenges in Big data analysis are due to increase of volume, variety, and velocity. The capturing of images with multi-directional views initiates the image set classification which is an attractive research study in the volumetricbased medical image processing. Methods: This paper proposes the Local N-ary Ternary Patterns (LNTP) and Modified Deep Belief Network (MDBN) to alleviate the dimensionality and robustness issues. Initially, the proposed LNTP-MDBN utilizes the filtering technique to identify and remove the dependent and independent noise from the images. Then, the application of smoothening and the normalization techniques on the filtered image improves the intensity of the images. Results: The LNTP-based feature extraction categorizes the heterogeneous images into different categories and extracts the features from each category. Based on the extracted features, the modified DBN classifies the normal and abnormal categories in the image set finally. Conclusion: The comparative analysis of proposed LNTP-MDBN with the existing pattern extraction and DBN learning models regarding classification accuracy and runtime confirms the effectiveness in mining applications.

Download Full-text

COVID-19 pneumonia diagnosis using a simple 2D deep learning framework with a single chest CT image (Preprint)

10.2196/preprints.19407 ◽

2020 ◽

Author(s):

Jinseok Lee

Keyword(s):

Deep Learning ◽

Diagnostic Performance ◽

Ct Images ◽

Chest Ct ◽

University Hospital ◽

Detection Accuracy ◽

Ct Image ◽

Test Dataset ◽

Learning Framework ◽

Testing Dataset

BACKGROUND The coronavirus disease (COVID-19) has explosively spread worldwide since the beginning of 2020. According to a multinational consensus statement from the Fleischner Society, computed tomography (CT) can be used as a relevant screening tool owing to its higher sensitivity for detecting early pneumonic changes. However, physicians are extremely busy fighting COVID-19 in this era of worldwide crisis. Thus, it is crucial to accelerate the development of an artificial intelligence (AI) diagnostic tool to support physicians. OBJECTIVE We aimed to quickly develop an AI technique to diagnose COVID-19 pneumonia and differentiate it from non-COVID pneumonia and non-pneumonia diseases on CT. METHODS A simple 2D deep learning framework, named fast-track COVID-19 classification network (FCONet), was developed to diagnose COVID-19 pneumonia based on a single chest CT image. FCONet was developed by transfer learning, using one of the four state-of-art pre-trained deep learning models (VGG16, ResNet50, InceptionV3, or Xception) as a backbone. For training and testing of FCONet, we collected 3,993 chest CT images of patients with COVID-19 pneumonia, other pneumonia, and non-pneumonia diseases from Wonkwang University Hospital, Chonnam National University Hospital, and the Italian Society of Medical and Interventional Radiology public database. These CT images were split into a training and a testing set at a ratio of 8:2. For the test dataset, the diagnostic performance to diagnose COVID-19 pneumonia was compared among the four pre-trained FCONet models. In addition, we tested the FCONet models on an additional external testing dataset extracted from the embedded low-quality chest CT images of COVID-19 pneumonia in recently published papers. RESULTS Of the four pre-trained models of FCONet, the ResNet50 showed excellent diagnostic performance (sensitivity 99.58%, specificity 100%, and accuracy 99.87%) and outperformed the other three pre-trained models in testing dataset. In additional external test dataset using low-quality CT images, the detection accuracy of the ResNet50 model was the highest (96.97%), followed by Xception, InceptionV3, and VGG16 (90.71%, 89.38%, and 87.12%, respectively). CONCLUSIONS The FCONet, a simple 2D deep learning framework based on a single chest CT image, provides excellent diagnostic performance in detecting COVID-19 pneumonia. Based on our testing dataset, the ResNet50-based FCONet might be the best model, as it outperformed other FCONet models based on VGG16, Xception, and InceptionV3.

Download Full-text