An efficient semantic segmentation method based on transfer learning from object detection

Deep learning for automated river-level monitoring through river-camera images: an approach based on water segmentation and transfer learning

Hydrology and Earth System Sciences ◽

10.5194/hess-25-4435-2021 ◽

2021 ◽

Vol 25 (8) ◽

pp. 4435-4453

Author(s):

Remy Vandaele ◽

Sarah L. Dance ◽

Varun Ojha

Keyword(s):

Transfer Learning ◽

Image Annotation ◽

Semantic Segmentation ◽

Flood Events ◽

Segmentation Method ◽

River Level ◽

Image Series ◽

Manual Intervention ◽

Topographic Surveys ◽

Level Monitoring

Abstract. River-level estimation is a critical task required for the understanding of flood events and is often complicated by the scarcity of available data. Recent studies have proposed to take advantage of large networks of river-camera images to estimate river levels but, currently, the utility of this approach remains limited as it requires a large amount of manual intervention (ground topographic surveys and water image annotation). We have developed an approach using an automated water semantic segmentation method to ease the process of river-level estimation from river-camera images. Our method is based on the application of a transfer learning methodology to deep semantic neural networks designed for water segmentation. Using datasets of image series extracted from four river cameras and manually annotated for the observation of a flood event on the rivers Severn and Avon, UK (21 November–5 December 2012), we show that this algorithm is able to automate the annotation process with an accuracy greater than 91 %. Then, we apply our approach to year-long image series from the same cameras observing the rivers Severn and Avon (from 1 June 2019 to 31 May 2020) and compare the results with nearby river-gauge measurements. Given the high correlation (Pearson's correlation coefficient >0.94) between these results and the river-gauge measurements, it is clear that our approach to automation of the water segmentation on river-camera images could allow for straightforward, inexpensive observation of flood events, especially at ungauged locations.

Download Full-text

Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud

Sensors ◽

10.3390/s21123964 ◽

2021 ◽

Vol 21 (12) ◽

pp. 3964

Author(s):

Muhammad Imad ◽

Oualid Doukhi ◽

Deok-Jin Lee

Keyword(s):

Object Detection ◽

Transfer Learning ◽

Point Cloud ◽

Large Scale ◽

Semantic Segmentation ◽

Point Cloud Data ◽

Learning Stage ◽

3D Object ◽

Cloud Data ◽

3D Object Detection

Three-dimensional object detection utilizing LiDAR point cloud data is an indispensable part of autonomous driving perception systems. Point cloud-based 3D object detection has been a better replacement for higher accuracy than cameras during nighttime. However, most LiDAR-based 3D object methods work in a supervised manner, which means their state-of-the-art performance relies heavily on a large-scale and well-labeled dataset, while these annotated datasets could be expensive to obtain and only accessible in the limited scenario. Transfer learning is a promising approach to reduce the large-scale training datasets requirement, but existing transfer learning object detectors are primarily for 2D object detection rather than 3D. In this work, we utilize the 3D point cloud data more effectively by representing the birds-eye-view (BEV) scene and propose a transfer learning based point cloud semantic segmentation for 3D object detection. The proposed model minimizes the need for large-scale training datasets and consequently reduces the training time. First, a preprocessing stage filters the raw point cloud data to a BEV map within a specific field of view. Second, the transfer learning stage uses knowledge from the previously learned classification task (with more data for training) and generalizes the semantic segmentation-based 2D object detection task. Finally, 2D detection results from the BEV image have been back-projected into 3D in the postprocessing stage. We verify results on two datasets: the KITTI 3D object detection dataset and the Ouster LiDAR-64 dataset, thus demonstrating that the proposed method is highly competitive in terms of mean average precision (mAP up to 70%) while still running at more than 30 frames per second (FPS).

Download Full-text

Multi-model Integrated Weakly Supervised Semantic Segmentation Method

Journal of Computer-Aided Design & Computer Graphics ◽

10.3724/sp.j.1089.2019.17379 ◽

2019 ◽

Vol 31 (5) ◽

pp. 800

Author(s):

Changzhen Xiong ◽

Hui Zhi

Keyword(s):

Semantic Segmentation ◽

Segmentation Method ◽

Weakly Supervised

Download Full-text

Utilizing semantic segmentation method with convolutional neural net to model a partial nephrectomy simulator for 3D printing

European Urology Supplements ◽

10.1016/s1569-9056(19)31264-3 ◽

2019 ◽

Vol 18 (1) ◽

pp. e1747-e1748

Author(s):

Y.S. Kyung ◽

B. Lim ◽

S.Y. Choi ◽

J.H. Han ◽

W. Lee ◽

...

Keyword(s):

3D Printing ◽

Partial Nephrectomy ◽

Semantic Segmentation ◽

Neural Net ◽

Segmentation Method

Download Full-text

Vision-Based Navigation of Autonomous Vehicles in Roadway Environments with Unexpected Hazards

Transportation Research Record Journal of the Transportation Research Board ◽

10.1177/0361198119855606 ◽

2019 ◽

Vol 2673 (12) ◽

pp. 494-507 ◽

Cited By ~ 1

Author(s):

Mhafuzul Islam ◽

Mashrur Chowdhury ◽

Hongda Li ◽

Hongxin Hu

Keyword(s):

Object Detection ◽

Autonomous Vehicles ◽

Autonomous Vehicle ◽

Semantic Segmentation ◽

Steering Wheel ◽

Potential Hazard ◽

Driving System ◽

Hazardous Object ◽

Vision Based Navigation ◽

Navigational System

Vision-based navigation of autonomous vehicles primarily depends on the deep neural network (DNN) based systems in which the controller obtains input from sensors/detectors, such as cameras, and produces a vehicle control output, such as a steering wheel angle to navigate the vehicle safely in a roadway traffic environment. Typically, these DNN-based systems in the autonomous vehicle are trained through supervised learning; however, recent studies show that a trained DNN-based system can be compromised by perturbation or adverse inputs. Similarly, this perturbation can be introduced into the DNN-based systems of autonomous vehicles by unexpected roadway hazards, such as debris or roadblocks. In this study, we first introduce a hazardous roadway environment that can compromise the DNN-based navigational system of an autonomous vehicle, and produce an incorrect steering wheel angle, which could cause crashes resulting in fatality or injury. Then, we develop a DNN-based autonomous vehicle driving system using object detection and semantic segmentation to mitigate the adverse effect of this type of hazard, which helps the autonomous vehicle to navigate safely around such hazards. We find that our developed DNN-based autonomous vehicle driving system, including hazardous object detection and semantic segmentation, improves the navigational ability of an autonomous vehicle to avoid a potential hazard by 21% compared with the traditional DNN-based autonomous vehicle driving system.

Download Full-text

A progressive image semantic segmentation method using recurrent neural network

2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP) ◽

10.1109/icsp51882.2021.9408920 ◽

2021 ◽

Author(s):

Li Yi

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Semantic Segmentation ◽

Segmentation Method

Download Full-text

Deep Near Infrared Colorization with Semantic Segmentation and Transfer Learning

2020 IEEE International Conference on Visual Communications and Image Processing (VCIP) ◽

10.1109/vcip49819.2020.9301788 ◽

2020 ◽

Author(s):

Fengqiao Wang ◽

Lu Liu ◽

Cheolkon Jung

Keyword(s):

Transfer Learning ◽

Near Infrared ◽

Semantic Segmentation

Download Full-text

A Two-Phase Fashion Apparel Detection Method Based on YOLOv4

Applied Sciences ◽

10.3390/app11093782 ◽

2021 ◽

Vol 11 (9) ◽

pp. 3782

Author(s):

Chu-Hui Lee ◽

Chen-Wei Lin

Keyword(s):

Object Detection ◽

Transfer Learning ◽

Detection Method ◽

Phase Transfer ◽

Recognition Task ◽

Phase Detection ◽

Target Domain ◽

Two Phase ◽

Detection Technology ◽

Fashion Apparel

Object detection is one of the important technologies in the field of computer vision. In the area of fashion apparel, object detection technology has various applications, such as apparel recognition, apparel detection, fashion recommendation, and online search. The recognition task is difficult for a computer because fashion apparel images have different characteristics of clothing appearance and material. Currently, fast and accurate object detection is the most important goal in this field. In this study, we proposed a two-phase fashion apparel detection method named YOLOv4-TPD (YOLOv4 Two-Phase Detection), based on the YOLOv4 algorithm, to address this challenge. The target categories for model detection were divided into the jacket, top, pants, skirt, and bag. According to the definition of inductive transfer learning, the purpose was to transfer the knowledge from the source domain to the target domain that could improve the effect of tasks in the target domain. Therefore, we used the two-phase training method to implement the transfer learning. Finally, the experimental results showed that the mAP of our model was better than the original YOLOv4 model through the two-phase transfer learning. The proposed model has multiple potential applications, such as an automatic labeling system, style retrieval, and similarity detection.

Download Full-text

Multiscale Object Detection from Drone Imagery Using Ensemble Transfer Learning

Drones ◽

10.3390/drones5030066 ◽

2021 ◽

Vol 5 (3) ◽

pp. 66

Author(s):

Rahee Walambe ◽

Aboli Marathe ◽

Ketan Kotecha

Keyword(s):

Object Detection ◽

Transfer Learning ◽

Data Augmentation ◽

Test Time ◽

Complex Task ◽

Open Domain ◽

End User ◽

Aerial Vehicle ◽

Uav Images ◽

Voting Strategy

Object detection in uncrewed aerial vehicle (UAV) images has been a longstanding challenge in the field of computer vision. Specifically, object detection in drone images is a complex task due to objects of various scales such as humans, buildings, water bodies, and hills. In this paper, we present an implementation of ensemble transfer learning to enhance the performance of the base models for multiscale object detection in drone imagery. Combined with a test-time augmentation pipeline, the algorithm combines different models and applies voting strategies to detect objects of various scales in UAV images. The data augmentation also presents a solution to the deficiency of drone image datasets. We experimented with two specific datasets in the open domain: the VisDrone dataset and the AU-AIR Dataset. Our approach is more practical and efficient due to the use of transfer learning and two-level voting strategy ensemble instead of training custom models on entire datasets. The experimentation shows significant improvement in the mAP for both VisDrone and AU-AIR datasets by employing the ensemble transfer learning method. Furthermore, the utilization of voting strategies further increases the 3reliability of the ensemble as the end-user can select and trace the effects of the mechanism for bounding box predictions.

Download Full-text