Deep learning architectures for semantic segmentation and automatic estimation of severity of foliar symptoms caused by diseases or pests

Analysis of Encoder-Decoder Based Deep Learning Architectures for Semantic Segmentation in Remote Sensing Images

Advances in Intelligent Systems and Computing - Intelligent Systems Design and Applications ◽

10.1007/978-3-030-16660-1_33 ◽

2019 ◽

pp. 332-341

Author(s):

R. Sivagami ◽

J. Srihari ◽

K. S. Ravichandran

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Semantic Segmentation ◽

Remote Sensing Images ◽

Learning Architectures

Download Full-text

Deep learning architectures for semantic segmentation of 2D image: A review

10.1063/5.0074633 ◽

2021 ◽

Author(s):

Ravindra Gadhiya ◽

Nilesh Kalani

Keyword(s):

Deep Learning ◽

Semantic Segmentation ◽

Learning Architectures

Download Full-text

Semantic segmentation of HeLa cells: An objective comparison between one traditional algorithm and four deep-learning architectures

PLoS ONE ◽

10.1371/journal.pone.0230605 ◽

2020 ◽

Vol 15 (10) ◽

pp. e0230605

Author(s):

Cefa Karabağ ◽

Martin L. Jones ◽

Christopher J. Peddie ◽

Anne E. Weston ◽

Lucy M. Collinson ◽

...

Keyword(s):

Deep Learning ◽

Hela Cells ◽

Semantic Segmentation ◽

Traditional Algorithm ◽

Learning Architectures

Download Full-text

Fusion-Based Semantic Segmentation Using Deep Learning Architecture in Case of Very Small Training Dataset

International Journal of Image and Graphics ◽

10.1142/s0219467822500437 ◽

2021 ◽

Author(s):

Ganesh R. Padalkar ◽

Madhuri B. Khambete

Keyword(s):

Deep Learning ◽

Real Life ◽

Class Imbalance ◽

Semantic Segmentation ◽

Training Dataset ◽

Processing Step ◽

Effective Training ◽

Unsupervised Algorithms ◽

Learning Architectures ◽

Small Dataset

Semantic segmentation is a pre-processing step in computer vision-based applications. It is the task of assigning a predefined class label to every pixel of an image. Several supervised and unsupervised algorithms are available to classify pixels of an image into predefined object classes. The algorithms, such as random forest and SVM are used to obtain the semantic segmentation. Recently, convolutional neural network (CNN)-based architectures have become popular for the tasks of object detection, object recognition, and segmentation. These deep architectures perform semantic segmentation with far better accuracy than the algorithms that were used earlier. CNN-based deep learning architectures require a large dataset for training. In real life, some of the applications may not have sufficient good quality samples for training of deep learning architectures e.g. medical applications. Such a requirement initiated a need to have a technique of effective training of deep learning architecture in case of a very small dataset. Class imbalance is another challenge in the process of training deep learning architecture. Due to class imbalance, the classifier overclassifies classes with large samples. In this paper, the challenge of training a deep learning architecture with a small dataset and class imbalance is addressed by novel fusion-based semantic segmentation technique which improves segmentation of minor and major classes.

Download Full-text

Semantic Segmentation of HeLa Cells: An Objective Comparison between one Traditional Algorithm and Three Deep-Learning Architectures

10.1101/2020.03.05.978478 ◽

2020 ◽

Author(s):

Cefa Karabağ ◽

Martin L. Jones ◽

Christopher J. Peddie ◽

Anne E. Weston ◽

Lucy M. Collinson ◽

...

Keyword(s):

Image Processing ◽

Deep Learning ◽

Ground Truth ◽

Semantic Segmentation ◽

Superior Performance ◽

Processing Algorithm ◽

Image Processing Algorithm ◽

Segmentation Accuracy ◽

Traditional Algorithm ◽

Learning Architectures

AbstractIn this work, images of a HeLa cancer cell were semantically segmented with one traditional image-processing algorithm and three deep learning architectures: VGG16, ResNet18 and Inception-ResNet-v2. Three hundred slices, each 2000 × 2000 pixels, of a HeLa Cell were acquired with Serial Block Face Scanning Electron Microscopy. The deep learning architectures were pre-trained with ImageNet and then fine-tuned with transfer learning. The image-processing algorithm followed a pipeline of several traditional steps like edge detection, dilation and morphological operators. The algorithms were compared by measuring pixel-based segmentation accuracy and Jaccard index against a labelled ground truth. The results indicated a superior performance of the traditional algorithm (Accuracy = 99%, Jaccard = 93%) over the deep learning architectures: VGG16 (93%, 90%), ResNet18 (94%, 88%), Inception-ResNet-v2 (94%, 89%).

Download Full-text

Deep Learning Methods Applied to 3D Point Clouds Based Instance Segmentation: A Review

10.20944/preprints202111.0228.v1 ◽

2021 ◽

Author(s):

Desire Mulindwa Burume ◽

Shengzhi Du

Keyword(s):

Deep Learning ◽

Semantic Segmentation ◽

Point Clouds ◽

Autonomous Driving ◽

Indoor Navigation ◽

Future Directions ◽

3D Point Clouds ◽

Speed Accuracy ◽

Learning Architectures ◽

Instance Segmentation

Beyond semantic segmentation,3D instance segmentation(a process to delineate objects of interest and also classifying the objects into a set of categories) is gaining more and more interest among researchers since numerous computer vision applications need accurate segmentation processes(autonomous driving, indoor navigation, and even virtual or augmented reality systems…) This paper gives an overview and a technical comparison of the existing deep learning architectures in handling unstructured Euclidean data for the rapidly developing 3D instance segmentation. First, the authors divide the 3D point clouds based instance segmentation techniques into two major categories which are proposal based methods and proposal free methods. Then, they also introduce and compare the most used datasets with regard to 3D instance segmentation. Furthermore, they compare and analyze these techniques performance (speed, accuracy, response to noise…). Finally, this paper provides a review of the possible future directions of deep learning for 3D sensor-based information and provides insight into the most promising areas for prospective research.

Download Full-text

Development of environment design support mixed reality system capable of environment estimation using deep learning

Impact ◽

10.21820/23987073.2020.2.9 ◽

2020 ◽

Vol 2020 (2) ◽

pp. 9-11

Author(s):

Tomohiro Fukuda

Keyword(s):

Deep Learning ◽

Real Time ◽

Computer Games ◽

Construction Projects ◽

Mixed Reality ◽

Semantic Segmentation ◽

Environment Design ◽

Aviation Training ◽

Architecture And Design ◽

World Environment

Mixed reality (MR) is rapidly becoming a vital tool, not just in gaming, but also in education, medicine, construction and environmental management. The term refers to systems in which computer-generated content is superimposed over objects in a real-world environment across one or more sensory modalities. Although most of us have heard of the use of MR in computer games, it also has applications in military and aviation training, as well as tourism, healthcare and more. In addition, it has the potential for use in architecture and design, where buildings can be superimposed in existing locations to render 3D generations of plans. However, one major challenge that remains in MR development is the issue of real-time occlusion. This refers to hiding 3D virtual objects behind real articles. Dr Tomohiro Fukuda, who is based at the Division of Sustainable Energy and Environmental Engineering, Graduate School of Engineering at Osaka University in Japan, is an expert in this field. Researchers, led by Dr Tomohiro Fukuda, are tackling the issue of occlusion in MR. They are currently developing a MR system that realises real-time occlusion by harnessing deep learning to achieve an outdoor landscape design simulation using a semantic segmentation technique. This methodology can be used to automatically estimate the visual environment prior to and after construction projects.

Download Full-text

Automatic Deep Learning Semantic Segmentation of Ultrasound Thyroid Cineclips using Recurrent Fully Convolutional Networks

IEEE Access ◽

10.1109/access.2020.3045906 ◽

2020 ◽

pp. 1-1

Author(s):

Jeremy M. Webb ◽

Duane D. Meixner ◽

Shaheeda A. Adusei ◽

Eric C. Polley ◽

Mostafa Fatemi ◽

...

Keyword(s):

Deep Learning ◽

Semantic Segmentation ◽

Convolutional Networks ◽

Fully Convolutional Networks

Download Full-text

A Generalization Performance Study Using Deep Learning Networks in Embedded Systems

Sensors ◽

10.3390/s21041031 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1031

Author(s):

Joseba Gorospe ◽

Rubén Mulero ◽

Olatz Arbelaitz ◽

Javier Muguerza ◽

Miguel Ángel Antón

Keyword(s):

Deep Learning ◽

Embedded Systems ◽

Embedded System ◽

General Purpose ◽

Learning Networks ◽

Performance Study ◽

Learning Techniques ◽

Wide Range ◽

Learning Architectures

Deep learning techniques are being increasingly used in the scientific community as a consequence of the high computational capacity of current systems and the increase in the amount of data available as a result of the digitalisation of society in general and the industrial world in particular. In addition, the immersion of the field of edge computing, which focuses on integrating artificial intelligence as close as possible to the client, makes it possible to implement systems that act in real time without the need to transfer all of the data to centralised servers. The combination of these two concepts can lead to systems with the capacity to make correct decisions and act based on them immediately and in situ. Despite this, the low capacity of embedded systems greatly hinders this integration, so the possibility of being able to integrate them into a wide range of micro-controllers can be a great advantage. This paper contributes with the generation of an environment based on Mbed OS and TensorFlow Lite to be embedded in any general purpose embedded system, allowing the introduction of deep learning architectures. The experiments herein prove that the proposed system is competitive if compared to other commercial systems.

Download Full-text

Identifying the Branch of Kiwifruit Based on Unmanned Aerial Vehicle (UAV) Images Using Deep Learning Method

Sensors ◽

10.3390/s21134442 ◽

2021 ◽

Vol 21 (13) ◽

pp. 4442

Author(s):

Zijie Niu ◽

Juntao Deng ◽

Xu Zhang ◽

Jun Zhang ◽

Shijia Pan ◽

...

Keyword(s):

Deep Learning ◽

Unmanned Aerial Vehicle ◽

Semantic Segmentation ◽

Dynamic Monitoring ◽

Support Vector ◽

Distribution Maps ◽

Time Operation ◽

Aerial Vehicle ◽

Uav Images ◽

Segmentation Image

It is important to obtain accurate information about kiwifruit vines to monitoring their physiological states and undertake precise orchard operations. However, because vines are small and cling to trellises, and have branches laying on the ground, numerous challenges exist in the acquisition of accurate data for kiwifruit vines. In this paper, a kiwifruit canopy distribution prediction model is proposed on the basis of low-altitude unmanned aerial vehicle (UAV) images and deep learning techniques. First, the location of the kiwifruit plants and vine distribution are extracted from high-precision images collected by UAV. The canopy gradient distribution maps with different noise reduction and distribution effects are generated by modifying the threshold and sampling size using the resampling normalization method. The results showed that the accuracies of the vine segmentation using PSPnet, support vector machine, and random forest classification were 71.2%, 85.8%, and 75.26%, respectively. However, the segmentation image obtained using depth semantic segmentation had a higher signal-to-noise ratio and was closer to the real situation. The average intersection over union of the deep semantic segmentation was more than or equal to 80% in distribution maps, whereas, in traditional machine learning, the average intersection was between 20% and 60%. This indicates the proposed model can quickly extract the vine distribution and plant position, and is thus able to perform dynamic monitoring of orchards to provide real-time operation guidance.

Download Full-text