scholarly journals A Deep Learning-Based Embedding Framework for Object Detection and Recognition in Underwater Marine Organisms

Author(s):  
Jinde Zhua

Abstract The detection of marine organisms is an important part of the intelligent strategy in marine ranch, which requires an underwater robot to detect the marine organism quickly and accurately in the complex ocean environment. Based on the latest deep learning arithmetic, this paper put forward to find the marine organism in a picture or video to construct a real-time objective invention system for marine organisms. The neural network arithmetic: YOLOv4 was employed to extract the deep features of marine organisms, implementing the accurate detection and size detection of different fish can use arithmetic for evaluation in fisheries. Furthermore, improving the architecture of the backbone and the neck connection is called YOLOv4-embedding. As a result, compared with other object detection arithmetic, YOLOv4-embedding object detection arithmetic was better at detection accuracy--higher detection confidence and higher detection ratio than other one-stage object detection arithmetic, EfficientDet-D3 example. The consequence demonstrates that the suggested instrument could implement the rapid invention of different varieties in marine organisms. Compared to the YOLOv4, the mAP 75 of the YOLOv4-embedding achieves an improvement of 2.92% for the marine organism dataset at a rapid rate of ~51 FPS on RTX 3090, 60.8% AP 50 for the MS COCO dataset.

2021 ◽  
Author(s):  
Sung Hyun Noh ◽  
Chansik An ◽  
Dain Kim ◽  
Seung Hyun Lee ◽  
Min-Yung Chang ◽  
...  

Abstract Background A computer algorithm that automatically detects sacroiliac joint abnormalities on plain radiograph would help radiologists avoid missing sacroiliitis. This study aimed to develop and validate a deep learning model to detect and diagnose sacroiliitis on plain radiograph in young patients with low back pain. Methods This Institutional Review Board-approved retrospective study included 478 and 468 plain radiographs from 241 and 433 young (< 40 years) patients who complained of low back pain with and without ankylosing spondylitis, respectively. They were randomly split into training and test datasets with a ratio of 8:2. Radiologists reviewed the images and labeled the coordinates of a bounding box and determined the presence or absence of sacroiliitis for each sacroiliac joint. We fine-tined and optimized the EfficientDet-D4 object detection model pre-trained on the COCO 2107 dataset on the training dataset and validated the final model on the test dataset. Results The mean average precision, an evaluation metric for object detection accuracy, was 0.918 at 0.5 intersection over union. In the diagnosis of sacroiliitis, the area under the curve, sensitivity, specificity, accuracy, and F1-score were 0.932 (95% confidence interval, 0.903–0.961), 96.9% (92.9–99.0), 86.8% (81.5–90.9), 91.1% (87.7–93.7), and 90.2% (85.0–93.9), respectively. Conclusions The EfficientDet, a deep learning-based object detection algorithm, could be used to automatically diagnose sacroiliitis on plain radiograph.


2020 ◽  
Vol 17 (8) ◽  
pp. 3478-3483
Author(s):  
V. Sravan Chowdary ◽  
G. Penchala Sai Teja ◽  
D. Mounesh ◽  
G. Manideep ◽  
C. T. Manimegalai

Road injuries are a big drawback in society for a few time currently. Ignoring sign boards while moving on roads has significantly become a major cause for road accidents. Thus we came up with an approach to face this issue by detecting the sign board and recognition of sign board. At this moment there are several deep learning models for object detection using totally different algorithms like RCNN, faster RCNN, SPP-net, etc. We prefer to use Yolo-3, which improves the speed and precision of object detection. This algorithm will increase the accuracy by utilizing residual units, skip connections and up-sampling. This algorithm uses a framework named Dark-net. This framework is intended specifically to create the neural network for training the Yolo algorithm. To thoroughly detect the sign board, we used this algorithm.


Mekatronika ◽  
2020 ◽  
Vol 2 (2) ◽  
pp. 49-54
Author(s):  
Arzielah Ashiqin Alwi ◽  
Ahmad Najmuddin Ibrahim ◽  
Muhammad Nur Aiman Shapiee ◽  
Muhammad Ar Rahim Ibrahim ◽  
Mohd Azraai Mohd Razman ◽  
...  

Dynamic gameplay, fast-paced and fast-changing gameplay, where angle shooting (top and bottom corner) has the best chance of a good goal, are the main aspects of handball. When it comes to the narrow-angle area, the goalkeeper has trouble blocked the goal. Therefore, this research discusses image processing to investigate the shooting precision performance analysis to detect the ball's accuracy at high speed. In the handball goal, the participants had to complete 50 successful shots at each of the four target locations. Computer vision will then be implemented through a camera to identify the ball, followed by determining the accuracy of the ball position of floating, net tangle and farthest or smallest using object detection as the accuracy marker. The model will be trained using Deep Learning (DL)  models of YOLOv2, YOLOv3, and Faster R-CNN and the best precision models of ball detection accuracy were compared. It was found that the best performance of the accuracy of the classifier Faster R-CNN produces 99% for all ball positions.


2021 ◽  
Vol 2021 ◽  
pp. 1-16
Author(s):  
Di Tian ◽  
Yi Han ◽  
Biyao Wang ◽  
Tian Guan ◽  
Wei Wei

Pedestrian detection is a specific application of object detection. Compared with general object detection, it shows similarities and unique characteristics. In addition, it has important application value in the fields of intelligent driving and security monitoring. In recent years, with the rapid development of deep learning, pedestrian detection technology has also made great progress. However, there still exists a huge gap between it and human perception. Meanwhile, there are still a lot of problems, and there remains a lot of room for research. Regarding the application of pedestrian detection in intelligent driving technology, it is of necessity to ensure its real-time performance. Additionally, it is necessary to lighten the model while ensuring detection accuracy. This paper first briefly describes the development process of pedestrian detection and then concentrates on summarizing the research results of pedestrian detection technology in the deep learning stage. Subsequently, by summarizing the pedestrian detection dataset and evaluation criteria, the core issues of the current development of pedestrian detection are analyzed. Finally, the next possible development direction of pedestrian detection technology is explained at the end of the paper.


2019 ◽  
Vol 11 (7) ◽  
pp. 786 ◽  
Author(s):  
Yang-Lang Chang ◽  
Amare Anagaw ◽  
Lena Chang ◽  
Yi Wang ◽  
Chih-Yu Hsiao ◽  
...  

Synthetic aperture radar (SAR) imagery has been used as a promising data source for monitoring maritime activities, and its application for oil and ship detection has been the focus of many previous research studies. Many object detection methods ranging from traditional to deep learning approaches have been proposed. However, majority of them are computationally intensive and have accuracy problems. The huge volume of the remote sensing data also brings a challenge for real time object detection. To mitigate this problem a high performance computing (HPC) method has been proposed to accelerate SAR imagery analysis, utilizing the GPU based computing methods. In this paper, we propose an enhanced GPU based deep learning method to detect ship from the SAR images. The You Only Look Once version 2 (YOLOv2) deep learning framework is proposed to model the architecture and training the model. YOLOv2 is a state-of-the-art real-time object detection system, which outperforms Faster Region-Based Convolutional Network (Faster R-CNN) and Single Shot Multibox Detector (SSD) methods. Additionally, in order to reduce computational time with relatively competitive detection accuracy, we develop a new architecture with less number of layers called YOLOv2-reduced. In the experiment, we use two types of datasets: A SAR ship detection dataset (SSDD) dataset and a Diversified SAR Ship Detection Dataset (DSSDD). These two datasets were used for training and testing purposes. YOLOv2 test results showed an increase in accuracy of ship detection as well as a noticeable reduction in computational time compared to Faster R-CNN. From the experimental results, the proposed YOLOv2 architecture achieves an accuracy of 90.05% and 89.13% on the SSDD and DSSDD datasets respectively. The proposed YOLOv2-reduced architecture has a similarly competent detection performance as YOLOv2, but with less computational time on a NVIDIA TITAN X GPU. The experimental results shows that the deep learning can make a big leap forward in improving the performance of SAR image ship detection.


IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 194228-194239 ◽  
Author(s):  
Yanfen Li ◽  
Hanxiang Wang ◽  
L. Minh Dang ◽  
Tan N. Nguyen ◽  
Dongil Han ◽  
...  

2020 ◽  
Vol 12 (3) ◽  
pp. 458 ◽  
Author(s):  
Ugur Alganci ◽  
Mehmet Soydas ◽  
Elif Sertel

Object detection from satellite images has been a challenging problem for many years. With the development of effective deep learning algorithms and advancement in hardware systems, higher accuracies have been achieved in the detection of various objects from very high-resolution (VHR) satellite images. This article provides a comparative evaluation of the state-of-the-art convolutional neural network (CNN)-based object detection models, which are Faster R-CNN, Single Shot Multi-box Detector (SSD), and You Look Only Once-v3 (YOLO-v3), to cope with the limited number of labeled data and to automatically detect airplanes in VHR satellite images. Data augmentation with rotation, rescaling, and cropping was applied on the test images to artificially increase the number of training data from satellite images. Moreover, a non-maximum suppression algorithm (NMS) was introduced at the end of the SSD and YOLO-v3 flows to get rid of the multiple detection occurrences near each detected object in the overlapping areas. The trained networks were applied to five independent VHR test images that cover airports and their surroundings to evaluate their performance objectively. Accuracy assessment results of the test regions proved that Faster R-CNN architecture provided the highest accuracy according to the F1 scores, average precision (AP) metrics, and visual inspection of the results. The YOLO-v3 ranked as second, with a slightly lower performance but providing a balanced trade-off between accuracy and speed. The SSD provided the lowest detection performance, but it was better in object localization. The results were also evaluated in terms of the object size and detection accuracy manner, which proved that large- and medium-sized airplanes were detected with higher accuracy.


Sign in / Sign up

Export Citation Format

Share Document