Accelerating the Automated Detection, Counting and Measurements of Reproductive Organs in Herbarium Collections in the Era of Deep Learning

Millions of herbarium records provide an invaluable legacy and knowledge of the spatial and temporal distributions of plants over centuries across all continents (Soltis et al. 2018). Due to recent efforts to digitize and to make publicly accessible most major natural collections, investigations of ecological and evolutionary patterns at unprecedented geographic scales are now possible (Carranza-Rojas et al. 2017, Lorieul et al. 2019). Nevertheless, biologists are now facing the problem of extracting from a huge number of herbarium sheets basic information such as textual descriptions, the numbers of organs, and measurements of various morphological traits. Deep learning technologies can dramatically accelerate the extraction of such basic information by automating the routines of organ identification, counts and measurements, thereby allowing biologists to spend more time on investigations such as phenological or geographic distribution studies. Recent progress on instance segmentation demonstrated by the Mask-RCNN method is very promising in the context of herbarium sheets, in particular for detecting with high precision different organs of interest on each specimen, including leaves, flowers, and fruits. However, like any deep learning approach, this method requires a significant number of labeled examples with fairly detailed outlines of individual organs. Creating such a training dataset can be very time-consuming and may be discouraging for researchers. We propose in this work to integrate the Mask-RCNN approach within a global system enabling an active learning mechanism (Sener and Savarese 2018) in order to minimize the number of outlines of organs that researchers must manually annotate. The principle is to alternate cycles of manual annotations and training updates of the deep learning model and predictions on the entire collection to process. Then, the challenge of the active learning mechanism is to estimate automatically at each cycle which are the most useful objects that must be manually extracted in the next manual annotation cycle in order to learn, in as few cycles as possible, an accurate model. We discuss experiments addressing the effectiveness, the limits and the time required of our approach for annotation, in the context of a phenological study of more than 10,000 reproductive organs (buds, flowers, fruits and immature fruits) of Streptanthus tortuosus, a species known to be highly variable in appearance and therefore very difficult to be processed by an instance segmentation deep learning model.

Download Full-text

Transfer Learning of a Deep Learning Model for Exploring Tourists’ Urban Image Using Geotagged Photos

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10030137 ◽

2021 ◽

Vol 10 (3) ◽

pp. 137

Author(s):

Youngok Kang ◽

Nahye Cho ◽

Jiyoung Yoon ◽

Soyeon Park ◽

Jiyeon Kim

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Field Studies ◽

Learning Model ◽

Learning Technologies ◽

Regional Study ◽

Processing Technologies ◽

Final Model ◽

Urban Image ◽

Deep Learning Model

Recently, as computer vision and image processing technologies have rapidly advanced in the artificial intelligence (AI) field, deep learning technologies have been applied in the field of urban and regional study through transfer learning. In the tourism field, studies are emerging to analyze the tourists’ urban image by identifying the visual content of photos. However, previous studies have limitations in properly reflecting unique landscape, cultural characteristics, and traditional elements of the region that are prominent in tourism. With the purpose of going beyond these limitations of previous studies, we crawled 168,216 Flickr photos, created 75 scenes and 13 categories as a tourist’ photo classification by analyzing the characteristics of photos posted by tourists and developed a deep learning model by continuously re-training the Inception-v3 model. The final model shows high accuracy of 85.77% for the Top 1 and 95.69% for the Top 5. The final model was applied to the entire dataset to analyze the regions of attraction and the tourists’ urban image in Seoul. We found that tourists feel attracted to Seoul where the modern features such as skyscrapers and uniquely designed architectures and traditional features such as palaces and cultural elements are mixed together in the city. This work demonstrates a tourist photo classification suitable for local characteristics and the process of re-training a deep learning model to effectively classify a large volume of tourists’ photos.

Download Full-text

NIMG-08. PREDICTION OF LOWER-GRADE GLIOMA MOLECULAR SUBTYPES USING DEEP LEARNING

Neuro-Oncology ◽

10.1093/neuonc/noaa215.621 ◽

2020 ◽

Vol 22 (Supplement_2) ◽

pp. ii148-ii148

Author(s):

Yoshihiro Muragaki ◽

Yutaka Matsui ◽

Takashi Maruyama ◽

Masayuki Nitta ◽

Taiichi Saito ◽

...

Keyword(s):

Deep Learning ◽

Cross Validation ◽

Molecular Subtype ◽

Learning Model ◽

Group Classification ◽

Training Dataset ◽

Lower Grade ◽

Test Dataset ◽

Ct Data ◽

Deep Learning Model

Abstract INTRODUCTION It is useful to know the molecular subtype of lower-grade gliomas (LGG) when deciding on a treatment strategy. This study aims to diagnose this preoperatively. METHODS A deep learning model was developed to predict the 3-group molecular subtype using multimodal data including magnetic resonance imaging (MRI), positron emission tomography (PET), and computed tomography (CT). The performance was evaluated using leave-one-out cross validation with a dataset containing information from 217 LGG patients. RESULTS The model performed best when the dataset contained MRI, PET, and CT data. The model could predict the molecular subtype with an accuracy of 96.6% for the training dataset and 68.7% for the test dataset. The model achieved test accuracies of 58.5%, 60.4%, and 59.4% when the dataset contained only MRI, MRI and PET, and MRI and CT data, respectively. The conventional method used to predict mutations in the isocitrate dehydrogenase (IDH) gene and the codeletion of chromosome arms 1p and 19q (1p/19q) sequentially had an overall accuracy of 65.9%. This is 2.8 percent point lower than the proposed method, which predicts the 3-group molecular subtype directly. CONCLUSIONS AND FUTURE PERSPECTIVE A deep learning model was developed to diagnose the molecular subtype preoperatively based on multi-modality data in order to predict the 3-group classification directly. Cross-validation showed that the proposed model had an overall accuracy of 68.7% for the test dataset. This is the first model to double the expected value for a 3-group classification problem, when predicting the LGG molecular subtype. We plan to apply the techniques of heat map and/or segmentation for an increase in prediction accuracy.

Download Full-text

A Dense Feature Pyramid Network-Based Deep Learning Model for Road Marking Instance Segmentation Using MLS Point Clouds

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2020.2996617 ◽

2020 ◽

pp. 1-17 ◽

Cited By ~ 1

Author(s):

Siyun Chen ◽

Zhenxin Zhang ◽

Ruofei Zhong ◽

Liqiang Zhang ◽

Hao Ma ◽

...

Keyword(s):

Deep Learning ◽

Point Clouds ◽

Learning Model ◽

Feature Pyramid ◽

Road Marking ◽

Deep Learning Model ◽

Instance Segmentation

Download Full-text

A Weakly Supervised Method for Mud Detection in Ores Based on Deep Active Learning

Mathematical Problems in Engineering ◽

10.1155/2020/3510313 ◽

2020 ◽

Vol 2020 ◽

pp. 1-10

Author(s):

Zhijian Huang ◽

Fangmin Li ◽

Xidao Luan ◽

Zuowei Cai

Keyword(s):

Deep Learning ◽

Active Learning ◽

Learning Model ◽

Experimental Results ◽

Detection Accuracy ◽

Challenging Problem ◽

Real Scene ◽

Weakly Supervised ◽

Deep Learning Model

Automatically detecting mud in bauxite ores is important and valuable, with which we can improve productivity and reduce pollution. However, distinguishing mud and ores in a real scene is challenging for their similarity in shape, color, and texture. Moreover, training a deep learning model needs a large amount of exactly labeled samples, which is expensive and time consuming. Aiming at the challenging problem, this paper proposed a novel weakly supervised method based on deep active learning (AL), named YOLO-AL. The method uses the YOLO-v3 model as the basic detector, which is initialized with the pretrained weights on the MS COCO dataset. Then, an AL framework-embedded YOLO-v3 model is constructed. In the AL process, it iteratively fine-tunes the last few layers of the YOLO-v3 model with the most valuable samples, which is selected by a Less Confident (LC) strategy. Experimental results show that the proposed method can effectively detect mud in ores. More importantly, the proposed method can obviously reduce the labeled samples without decreasing the detection accuracy.

Download Full-text

FoveaMask: A fast and accurate deep learning model for green fruit instance segmentation

Computers and Electronics in Agriculture ◽

10.1016/j.compag.2021.106488 ◽

2021 ◽

Vol 191 ◽

pp. 106488

Author(s):

Weikuan Jia ◽

Zhonghua Zhang ◽

Wenjing Shao ◽

Sujuan Hou ◽

Ze Ji ◽

...

Keyword(s):

Deep Learning ◽

Learning Model ◽

Green Fruit ◽

Deep Learning Model ◽

Instance Segmentation

Download Full-text

A Deep Learning Model for Estimation of Patients with Undiagnosed Diabetes

Applied Sciences ◽

10.3390/app10010421 ◽

2020 ◽

Vol 10 (1) ◽

pp. 421 ◽

Cited By ~ 1

Author(s):

Kwang Sun Ryu ◽

Sang Won Lee ◽

Erdenebileg Batbaatar ◽

Jae Wook Lee ◽

Kui Son Choi ◽

...

Keyword(s):

Deep Learning ◽

Medical Care ◽

Smoking Status ◽

Learning Model ◽

Machine Learning Techniques ◽

Undiagnosed Diabetes ◽

Training Dataset ◽

Family History Of Diabetes ◽

Screening Model ◽

Deep Learning Model

A screening model for undiagnosed diabetes mellitus (DM) is important for early medical care. Insufficient research has been carried out developing a screening model for undiagnosed DM using machine learning techniques. Thus, the primary objective of this study was to develop a screening model for patients with undiagnosed DM using a deep neural network. We conducted a cross-sectional study using data from the Korean National Health and Nutrition Examination Survey (KNHANES) 2013–2016. A total of 11,456 participants were selected, excluding those with diagnosed DM, an age < 20 years, or missing data. KNHANES 2013–2015 was used as a training dataset and analyzed to develop a deep learning model (DLM) for undiagnosed DM. The DLM was evaluated with 4444 participants who were surveyed in the 2016 KNHANES. The DLM was constructed using seven non-invasive variables (NIV): age, waist circumference, body mass index, gender, smoking status, hypertension, and family history of diabetes. The model showed an appropriate performance (area under curve (AUC): 80.11) compared with existing previous screening models. The DLM developed in this study for patients with undiagnosed diabetes could contribute to early medical care.

Download Full-text

Prediction of steady flows passing fixed cylinders using deep learning

Scientific Reports ◽

10.1038/s41598-021-03651-8 ◽

2022 ◽

Vol 12 (1) ◽

Author(s):

Hiroto Ozaki ◽

Takeshi Aoyagi

Keyword(s):

Deep Learning ◽

Velocity Field ◽

Computational Cost ◽

Learning Model ◽

Dynamics Simulation ◽

Machine Learning Techniques ◽

Training Dataset ◽

Steady Flows ◽

Fluid Force ◽

Deep Learning Model

AbstractConsiderable attention has been given to deep-learning and machine-learning techniques in an effort to reduce the computational cost of computational fluid dynamics simulation. The present paper addresses the prediction of steady flows passing many fixed cylinders using a deep-learning model and investigates the accuracy of the predicted velocity field. The deep-learning model outputs the x- and y-components of the flow velocity field when the cylinder arrangement is input. The accuracy of the predicted velocity field is investigated, focusing on the velocity profile of the fluid flow and the fluid force acting on the cylinders. The present model accurately predicts the flow when the number of cylinders is equal to or close to that set in the training dataset. The extrapolation of the prediction to a smaller number of cylinders results in error, which can be interpreted as internal friction of the fluid. The results of the fluid force acting on the cylinders suggest that the present deep-learning model has good generalization performance for systems with a larger number of cylinders.

Download Full-text

Improving the efficiency of using deep learning model to determine shoreline position in high-resolution satellite imagery

E3S Web of Conferences ◽

10.1051/e3sconf/202131004002 ◽

2021 ◽

Vol 310 ◽

pp. 04002

Author(s):

Nguyen Thanh Doan

Keyword(s):

Deep Learning ◽

Satellite Image ◽

Mekong Delta ◽

Learning Model ◽

Training Dataset ◽

Learning Technology ◽

Model Training ◽

High Resolution Satellite Imagery ◽

The Impact ◽

Deep Learning Model

Nowaday, expanding the application of deep learning technology is attracting attention of many researchers in the field of remote sensing. This paper presents methodology of using deep convolutional neural network model to determine the position of shoreline on Sentinel 2 satellite image. The methodology also provides techniques to reduce model retraining while ensuring the accuracy of the results. Methodological evaluation and analysis were conducted in the Mekong Delta region. The results from the study showed that interpolating the input images and calibrating the result thresholds improve accuracy and allow the trained deep learning model to externally test different images. The paper also evaluates the impact of the training dataset on the quality of the results obtained. Suggestions are also given for the number of files in the training dataset, as well as the information used for model training to solve the shoreline detection problem.

Download Full-text

OBJECT DETECTION SYSTEM FOR THE BLIND WITH VOICEGUIDANCE

International Journal of Engineering Applied Sciences and Technology ◽

10.33564/ijeast.2021.v06i02.013 ◽

2021 ◽

Vol 6 (2) ◽

Author(s):

Rajeshvaree Ravindra Karmarkar ◽

Prof.V.N Honmane

Keyword(s):

Deep Learning ◽

Object Recognition ◽

Object Detection ◽

Visually Impaired ◽

Detection System ◽

Learning Model ◽

Learning Technologies ◽

Industrial Facilities ◽

Guidance Technique ◽

Deep Learning Model

—As object recognition technology has developed recently, various technologies have been applied to autonomousvehicles, robots, and industrial facilities. However, the benefits ofthese technologies are not reaching the visually impaired, who need it the most. This paper proposed an object detection system for the blind using deep learning technologies. Furthermore, a voice guidance technique is used to inform sight impaired persons as to the location of objects. The object recognition deep learning model utilizes the You Only Look Once(YOLO) algorithm and a voice announcement is synthesized using text-tospeech (TTS) to make it easier for the blind to get information about objects. Asa result, it implements an efficient object-detection system that helps the blind find objects in a specific space without help from others, and the system is analyzed through experiments to verify performance.

Download Full-text