Deep Learning Based Active Monitoring for Anti-collision between Vessels and Bridges

<p>Deep-learning based navigational object detection is discussed with respect to active monitoring system for anti-collision between vessel and bridge. Motion based object detection method widely used in existing anti-collision monitoring systems is incompetent in dealing with complicated and changeable waterway for its limitations in accuracy, robustness and efficiency. The video surveillance system proposed contains six modules, including image acquisition, detection, tracking, prediction, risk evaluation and decision-making, and the detection module is discussed in detail. A vessel-exclusive dataset with tons of image samples is established for neural network training and a SSD (Single Shot MultiBox Detector) based object detection model with both universality and pertinence is generated attributing to tactics of sample filtering, data augmentation and large-scale optimization, which make it capable of stable and intelligent vessel detection. Comparison results with conventional methods indicate that the proposed deep-learning method shows remarkable advantages in robustness, accuracy, efficiency and intelligence. In-situ test is carried out at Songpu Bridge in Shanghai, and the results illustrate that the method is qualified for long-term monitoring and providing information support for further analysis and decision making.</p>

Download Full-text

Road Object Detection: A Comparative Study of Deep Learning-Based Algorithms

Electronics ◽

10.3390/electronics10161932 ◽

2021 ◽

Vol 10 (16) ◽

pp. 1932

Author(s):

Malik Haris ◽

Adam Glowacz

Keyword(s):

Image Processing ◽

Deep Learning ◽

Object Detection ◽

Real Time ◽

Large Scale ◽

Single Shot ◽

Automated Driving ◽

Convolutional Network ◽

Image Processing Algorithms ◽

Processing Algorithms

Automated driving and vehicle safety systems need object detection. It is important that object detection be accurate overall and robust to weather and environmental conditions and run in real-time. As a consequence of this approach, they require image processing algorithms to inspect the contents of images. This article compares the accuracy of five major image processing algorithms: Region-based Fully Convolutional Network (R-FCN), Mask Region-based Convolutional Neural Networks (Mask R-CNN), Single Shot Multi-Box Detector (SSD), RetinaNet, and You Only Look Once v4 (YOLOv4). In this comparative analysis, we used a large-scale Berkeley Deep Drive (BDD100K) dataset. Their strengths and limitations are analyzed based on parameters such as accuracy (with/without occlusion and truncation), computation time, precision-recall curve. The comparison is given in this article helpful in understanding the pros and cons of standard deep learning-based algorithms while operating under real-time deployment restrictions. We conclude that the YOLOv4 outperforms accurately in detecting difficult road target objects under complex road scenarios and weather conditions in an identical testing environment.

Download Full-text

Deep Metallic Surface Defect Detection: The New Benchmark and Detection Network

Sensors ◽

10.3390/s20061562 ◽

2020 ◽

Vol 20 (6) ◽

pp. 1562 ◽

Cited By ~ 1

Author(s):

Xiaoming Lv ◽

Fajie Duan ◽

Jia-jia Jiang ◽

Xiao Fu ◽

Lin Gan

Keyword(s):

Defect Detection ◽

Large Scale ◽

Surface Defect ◽

Data Augmentation ◽

Training Data ◽

Metallic Surface ◽

Single Shot ◽

Limited Data ◽

Detection Model ◽

Surface Defect Detection

Metallic surface defect detection is an essential and necessary process to control the qualities of industrial products. However, due to the limited data scale and defect categories, existing defect datasets are generally unavailable for the deployment of the detection model. To address this problem, we contribute a new dataset called GC10-DET for large-scale metallic surface defect detection. The GC10-DET dataset has great challenges on defect categories, image number, and data scale. Besides, traditional detection approaches are poor in both efficiency and accuracy for the complex real-world environment. Thus, we also propose a novel end-to-end defect detection network (EDDN) based on the Single Shot MultiBox Detector. The EDDN model can deal with defects with different scales. Furthermore, a hard negative mining method is designed to alleviate the problem of data imbalance, while some data augmentation methods are adopted to enrich the training data for the expensive data collection problem. Finally, the extensive experiments on two datasets demonstrate that the proposed method is robust and can meet accuracy requirements for metallic defect detection.

Download Full-text

Hybrid Techniques for Object Detection using Deep Learning

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.f8165.038620 ◽

2020 ◽

Vol 8 (6) ◽

pp. 1959-1963

Keyword(s):

Deep Learning ◽

Object Detection ◽

Lane Detection ◽

Single Shot ◽

Efficient Utilization ◽

The Road ◽

Detection Model ◽

Hybrid Techniques ◽

Geographical Regions ◽

Improved Performance

Deep learning is a one of the major concept of Artificial Intelligence and Machine learning, which deals with the object detection task. On the other hand, a new targeted dataset is built according to commonly used existing datasets, and two networks called Single Shot Multi box Detector (SSD) and You Only Look Once (YOLO) are chosen to work on this new dataset. Through experimentation strengthen the understanding of these networks, and through the analysis of the results, learn the importance of targeted and inclusive datasets for deep learning. In addition, to this optimize the networks for efficient utilization when integrated with the necessary system or application. Further, explore the applications corresponding to these networks. The implementation includes two major concepts. The first concept is Object detection. Object detection is the process of object recognition and classification. There are several Training sets available online for training an object detection model. But the models are not trained to detect the same object from different geographical regions. The second concept is lane detection and steering suggestion. The model detects using the concept of radius or curvature of the road and also distance of the car from both the lane lines. Using these parameters it also gives steering suggestions such as move right or left by a certain distance. In addition to this it gives the distance and speed attributes of the surrounding objects such as cars, motorcycles, etc. Finally, the model developed is capable of detecting all the parameters required in order to be integrated and to create a self-driving car and it can be used efficiently in India. Using the above parameters that are obtained from the model the car can navigate through lanes in real-time. Its improved performance is due to the fact that it can detect road specific objects and because it is specifically trained for Indian roads.

Download Full-text

Design of Desktop Audiovisual Entertainment System with Deep Learning and Haptic Sensations

Symmetry ◽

10.3390/sym12101718 ◽

2020 ◽

Vol 12 (10) ◽

pp. 1718

Author(s):

Chien-Hsing Chou ◽

Yu-Sheng Su ◽

Che-Ju Hsu ◽

Kong-Chang Lee ◽

Ping-Hsuan Han

Keyword(s):

Deep Learning ◽

Object Detection ◽

User Experience ◽

Recognition System ◽

Scene Recognition ◽

Single Shot ◽

Auditory Signals ◽

Hot Weather ◽

Viewing Experience ◽

At Home

In this study, we designed a four-dimensional (4D) audiovisual entertainment system called Sense. This system comprises a scene recognition system and hardware modules that provide haptic sensations for users when they watch movies and animations at home. In the scene recognition system, we used Google Cloud Vision to detect common scene elements in a video, such as fire, explosions, wind, and rain, and further determine whether the scene depicts hot weather, rain, or snow. Additionally, for animated videos, we applied deep learning with a single shot multibox detector to detect whether the animated video contained scenes of fire-related objects. The hardware module was designed to provide six types of haptic sensations set as line-symmetry to provide a better user experience. After the system considers the results of object detection via the scene recognition system, the system generates corresponding haptic sensations. The system integrates deep learning, auditory signals, and haptic sensations to provide an enhanced viewing experience.

Download Full-text

Deep Learning-Based Differentiation between Mucinous Cystic Neoplasm and Serous Cystic Neoplasm in the Pancreas Using Endoscopic Ultrasonography

Diagnostics ◽

10.3390/diagnostics11061052 ◽

2021 ◽

Vol 11 (6) ◽

pp. 1052

Author(s):

Leang Sim Nguon ◽

Kangwon Seo ◽

Jung-Hyun Lim ◽

Tae-Jun Song ◽

Sung-Hyun Cho ◽

...

Keyword(s):

Decision Making ◽

Deep Learning ◽

Network Model ◽

Endoscopic Ultrasonography ◽

Data Augmentation ◽

Clinical Information ◽

Training Data ◽

Fine Tuning ◽

Cystic Neoplasm ◽

Cystic Neoplasms

Mucinous cystic neoplasms (MCN) and serous cystic neoplasms (SCN) account for a large portion of solitary pancreatic cystic neoplasms (PCN). In this study we implemented a convolutional neural network (CNN) model using ResNet50 to differentiate between MCN and SCN. The training data were collected retrospectively from 59 MCN and 49 SCN patients from two different hospitals. Data augmentation was used to enhance the size and quality of training datasets. Fine-tuning training approaches were utilized by adopting the pre-trained model from transfer learning while training selected layers. Testing of the network was conducted by varying the endoscopic ultrasonography (EUS) image sizes and positions to evaluate the network performance for differentiation. The proposed network model achieved up to 82.75% accuracy and a 0.88 (95% CI: 0.817–0.930) area under curve (AUC) score. The performance of the implemented deep learning networks in decision-making using only EUS images is comparable to that of traditional manual decision-making using EUS images along with supporting clinical information. Gradient-weighted class activation mapping (Grad-CAM) confirmed that the network model learned the features from the cyst region accurately. This study proves the feasibility of diagnosing MCN and SCN using a deep learning network model. Further improvement using more datasets is needed.

Download Full-text

Prediction of Crime Scene Objects Using Deep Learning

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.37512 ◽

2021 ◽

Vol 9 (VIII) ◽

pp. 857-864

Author(s):

Vibhavari B Rao

Keyword(s):

Deep Learning ◽

Object Detection ◽

Computer Architecture ◽

Real Time ◽

Surveillance System ◽

Crime Scene ◽

Detection Model ◽

Detection Algorithms ◽

Performance Metric ◽

Smart Surveillance

The crime rates today can inevitably put a civilian's life in danger. While consistent efforts are being made to alleviate crime, there is also a dire need to create a smart and proactive surveillance system. Our project implements a smart surveillance system that would alert the authorities in real-time when a crime is being committed. During armed robberies and hostage situations, most often, the police cannot reach the place on time to prevent it from happening, owing to the lag in communication between the informants of the crime scene and the police. We propose an object detection model that implements deep learning algorithms to detect objects of violence such as pistols, knives, rifles from video surveillance footage, and in turn send real-time alerts to the authorities. There are a number of object detection algorithms being developed, each being evaluated under the performance metric mAP. On implementing Faster R-CNN with ResNet 101 architecture we found the mAP score to be about 91%. However, the downside to this is the excessive training and inferencing time it incurs. On the other hand, YOLOv5 architecture resulted in a model that performed very well in terms of speed. Its training speed was found to be 0.012 s / image during training but naturally, the accuracy was not as high as Faster R-CNN. With good computer architecture, it can run at about 40 fps. Thus, there is a tradeoff between speed and accuracy and it's important to strike a balance. We use transfer learning to improve accuracy by training the model on our custom dataset. This project can be deployed on any generic CCTV camera by setting up a live RTSP (real-time streaming protocol) and streaming the footage on a laptop or desktop where the deep learning model is being run.

Download Full-text

Comparison of Object Detection and Patch-Based Classification Deep Learning Models on Mid- to Late-Season Weed Detection in UAV Imagery

Remote Sensing ◽

10.3390/rs12132136 ◽

2020 ◽

Vol 12 (13) ◽

pp. 2136 ◽

Cited By ~ 1

Author(s):

Arun Narenthiran Veeranampalayam Sivakumar ◽

Jiating Li ◽

Stephen Scott ◽

Eric Psota ◽

Amit J. Jhala ◽

...

Keyword(s):

Object Detection ◽

Weed Management ◽

Network Models ◽

Detection Performance ◽

Single Shot ◽

Neural Network Models ◽

Weed Detection ◽

Late Season ◽

Detection Model ◽

Growing Seasons

Mid- to late-season weeds that escape from the routine early-season weed management threaten agricultural production by creating a large number of seeds for several future growing seasons. Rapid and accurate detection of weed patches in field is the first step of site-specific weed management. In this study, object detection-based convolutional neural network models were trained and evaluated over low-altitude unmanned aerial vehicle (UAV) imagery for mid- to late-season weed detection in soybean fields. The performance of two object detection models, Faster RCNN and the Single Shot Detector (SSD), were evaluated and compared in terms of weed detection performance using mean Intersection over Union (IoU) and inference speed. It was found that the Faster RCNN model with 200 box proposals had similar good weed detection performance to the SSD model in terms of precision, recall, f1 score, and IoU, as well as a similar inference time. The precision, recall, f1 score and IoU were 0.65, 0.68, 0.66 and 0.85 for Faster RCNN with 200 proposals, and 0.66, 0.68, 0.67 and 0.84 for SSD, respectively. However, the optimal confidence threshold of the SSD model was found to be much lower than that of the Faster RCNN model, which indicated that SSD might have lower generalization performance than Faster RCNN for mid- to late-season weed detection in soybean fields using UAV imagery. The performance of the object detection model was also compared with patch-based CNN model. The Faster RCNN model yielded a better weed detection performance than the patch-based CNN with and without overlap. The inference time of Faster RCNN was similar to patch-based CNN without overlap, but significantly less than patch-based CNN with overlap. Hence, Faster RCNN was found to be the best model in terms of weed detection performance and inference time among the different models compared in this study. This work is important in understanding the potential and identifying the algorithms for an on-farm, near real-time weed detection and management.

Download Full-text

The Assisted Positioning Technology for High Speed Train Based on Deep Learning

Applied Sciences ◽

10.3390/app10238625 ◽

2020 ◽

Vol 10 (23) ◽

pp. 8625

Author(s):

Yali Song ◽

Yinghong Wen

Keyword(s):

Deep Learning ◽

High Speed ◽

Detection Method ◽

Generative Adversarial Networks ◽

Single Shot ◽

High Speed Train ◽

Data Set ◽

Detection Model ◽

Model Based ◽

Cumulative Error

In the positioning process of a high-speed train, cumulative error may result in a reduction in the positioning accuracy. The assisted positioning technology based on kilometer posts can be used as an effective method to correct the cumulative error. However, the traditional detection method of kilometer posts is time-consuming and complex, which greatly affects the correction efficiency. Therefore, in this paper, a kilometer post detection model based on deep learning is proposed. Firstly, the Deep Convolutional Generative Adversarial Networks (DCGAN) algorithm is introduced to construct an effective kilometer post data set. This greatly reduces the cost of real data acquisition and provides a prerequisite for the construction of the detection model. Then, by using the existing optimization as a reference and further simplifying the design of the Single Shot multibox Detector (SSD) model according to the specific application scenario of this paper, the kilometer post detection model based on an improved SSD algorithm is established. Finally, from the analysis of the experimental results, we know that the detection model established in this paper ensures both detection accuracy and efficiency. The accuracy of our model reached 98.92%, while the detection time was only 35.43 ms. Thus, our model realizes the rapid and accurate detection of kilometer posts and improves the assisted positioning technology based on kilometer posts by optimizing the detection method.

Download Full-text

An Embedded Deep Learning Object Detection Model For Traffic In Asian Countries

2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) ◽

10.1109/icmew46912.2020.9105994 ◽

2020 ◽

Author(s):

Weiju Chen ◽

WanChen Wu ◽

Hao-Wei Chang ◽

Wei-Liang Lin ◽

Changhua Yang ◽

...

Keyword(s):

Deep Learning ◽

Object Detection ◽

Learning Object ◽

Asian Countries ◽

Detection Model

Download Full-text

Object Detection and Image Segmentation with Deep Learning on Earth Observation Data: A Review—Part II: Applications

Remote Sensing ◽

10.3390/rs12183053 ◽

2020 ◽

Vol 12 (18) ◽

pp. 3053 ◽

Cited By ~ 1

Author(s):

Thorsten Hoeser ◽

Felix Bachofer ◽

Claudia Kuenzer

Keyword(s):

Image Segmentation ◽

Deep Learning ◽

Object Detection ◽

Land Surface ◽

Large Scale ◽

Earth Observation ◽

Observation Data ◽

Fine Grained ◽

The Impact ◽

Object Dynamics

In Earth observation (EO), large-scale land-surface dynamics are traditionally analyzed by investigating aggregated classes. The increase in data with a very high spatial resolution enables investigations on a fine-grained feature level which can help us to better understand the dynamics of land surfaces by taking object dynamics into account. To extract fine-grained features and objects, the most popular deep-learning model for image analysis is commonly used: the convolutional neural network (CNN). In this review, we provide a comprehensive overview of the impact of deep learning on EO applications by reviewing 429 studies on image segmentation and object detection with CNNs. We extensively examine the spatial distribution of study sites, employed sensors, used datasets and CNN architectures, and give a thorough overview of applications in EO which used CNNs. Our main finding is that CNNs are in an advanced transition phase from computer vision to EO. Upon this, we argue that in the near future, investigations which analyze object dynamics with CNNs will have a significant impact on EO research. With a focus on EO applications in this Part II, we complete the methodological review provided in Part I.

Download Full-text