Underwater Object Detection and Pose Estimation using Deep Learning

Developing innovative and pervasive smart technologies that provide medical support and improve the welfare of the elderly has become increasingly important as populations age. Elderly people frequently experience incidents of discomfort in their daily lives, including the deterioration of cognitive and memory abilities. To provide auxiliary functions and ensure the safety of the elderly in daily living situations, we propose a projection-based augmented reality (PAR) system equipped with a deep-learning module. In this study, we propose three-dimensional space reconstruction of a pervasive PAR space for the elderly. In addition, we propose the application of a deep-learning module to lay the foundation for contextual awareness. Performance experiments were conducted for grafting the deep-learning framework (pose estimation, face recognition, and object detection) onto the PAR technology through the proposed hardware for verification of execution possibility, real-time execution, and applicability. The precision of the face pose is particularly high by pose estimation; it is used to determine an abnormal user state. For face recognition results of whole class, the average detection rate (DR) was 74.84% and the precision was 78.72%. However, for face occlusions, the average DR was 46.83%. It was confirmed that the face recognition can be performed properly if the face occlusion situation is not frequent. By object detection experiment results, the DR increased as the distance from the system decreased for a small object. For a large object, the miss rate increased when the distance between the object and the system decreased. Scenarios for supporting the elderly, who experience degradation in movement and cognitive functions, were designed and realized, constructed using the proposed platform. In addition, several user interfaces (UI) were implemented according to the scenarios regardless of distance between users and the proposed system. In this study, we developed a bidirectional PAR system that provides the relevant information by understanding the user environment and action intentions instead of a unidirectional PAR system for simple information provision. We present a discussion of the possibility of care systems for the elderly through the fusion of PAR and deep-learning frameworks.

Download Full-text

Underwater Object Detection using Transfer Learning with Deep Learning

Proceedings of the 2020 International Conference on Computers, Information Processing and Advanced Education ◽

10.1145/3419635.3419678 ◽

2020 ◽

Author(s):

Zhu Kaiyan ◽

Li Xiang ◽

Song Weibo

Keyword(s):

Deep Learning ◽

Object Detection ◽

Transfer Learning ◽

Underwater Object

Download Full-text

Deep Learning for Underwater Object Detection

24th Pan-Hellenic Conference on Informatics ◽

10.1145/3437120.3437301 ◽

2020 ◽

Author(s):

Panagiotis Rizos ◽

Vana Kalogeraki

Keyword(s):

Deep Learning ◽

Object Detection ◽

Underwater Object

Download Full-text

Underwater object detection using Invert Multi-Class Adaboost with deep learning

2020 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn48605.2020.9207506 ◽

2020 ◽

Author(s):

Long Chen ◽

Zhihua Liu ◽

Lei Tong ◽

Zheheng Jiang ◽

Shengke Wang ◽

...

Keyword(s):

Deep Learning ◽

Object Detection ◽

Underwater Object

Download Full-text

Realistic Sonar Image Simulation Using Deep Learning for Underwater Object Detection

International Journal of Control Automation and Systems ◽

10.1007/s12555-019-0691-3 ◽

2020 ◽

Vol 18 (3) ◽

pp. 523-534 ◽

Cited By ~ 1

Author(s):

Minsung Sung ◽

Jason Kim ◽

Meungsuk Lee ◽

Byeongjin Kim ◽

Taesik Kim ◽

...

Keyword(s):

Deep Learning ◽

Object Detection ◽

Image Simulation ◽

Sonar Image ◽

Underwater Object

Download Full-text

A survey on joint object detection and pose estimation using monocular vision

MATEC Web of Conferences ◽

10.1051/matecconf/201927702029 ◽

2019 ◽

Vol 277 ◽

pp. 02029

Author(s):

Aniruddha V Patil ◽

Pankaj Rabha

Keyword(s):

Deep Learning ◽

Object Detection ◽

Pose Estimation ◽

Monocular Vision ◽

Estimation Methods ◽

Probabilistic Networks ◽

Hybrid Approaches ◽

Genetic Matching ◽

Multi Stage ◽

Traditional Approaches

In this survey we present a complete landscape of joint object detection and pose estimation methods that use monocular vision. Descriptions of traditional approaches that involve descriptors or models and various estimation methods have been provided. These descriptors or models include chordiograms, shape-aware deformable parts model, bag of boundaries, distance transform templates, natural 3D markers and facet features whereas the estimation methods include iterative clustering estimation, probabilistic networks and iterative genetic matching. Hybrid approaches that use handcrafted feature extraction followed by estimation by deep learning methods have been outlined. We have investigated and compared, wherever possible, pure deep learning based approaches (single stage and multi stage) for this problem. Comprehensive details of the various accuracy measures and metrics have been illustrated. For the purpose of giving a clear overview, the characteristics of relevant datasets are discussed. The trends that prevailed from the infancy of this problem until now have also been highlighted.

Download Full-text

A Comprehensive Review on 3D Object Detection and 6D Pose Estimation with Deep Learning

IEEE Access ◽

10.1109/access.2021.3114399 ◽

2021 ◽

pp. 1-1

Author(s):

Sabera Hoque ◽

MD. Yasir Arafat ◽

Shuxiang Xu ◽

Ananda Maiti ◽

Yuchen Wei

Keyword(s):

Deep Learning ◽

Object Detection ◽

Pose Estimation ◽

Comprehensive Review ◽

3D Object ◽

3D Object Detection

Download Full-text

Pose Estimation for Non-cooperative Spacecraft based on Deep Learning

2020 39th Chinese Control Conference (CCC) ◽

10.23919/ccc50068.2020.9189253 ◽

2020 ◽

Author(s):

Wenxiu Huan ◽

Mingmin Liu ◽

Qinglei Hu

Keyword(s):

Deep Learning ◽

Pose Estimation

Download Full-text

Saliency detection in deep learning era: trends of development

Information and Control Systems ◽

10.31799/1684-8853-2019-3-10-36 ◽

2019 ◽

pp. 10-36 ◽

Cited By ~ 2

Author(s):

M. N. Favorskaya ◽

L. C. Jain

Keyword(s):

Deep Learning ◽

Object Detection ◽

Event Detection ◽

Visual Analysis ◽

Saliency Detection ◽

Salient Object Detection ◽

Public Image ◽

Detection Methods ◽

Salient Object ◽

Salient Event

Introduction:Saliency detection is a fundamental task of computer vision. Its ultimate aim is to localize the objects of interest that grab human visual attention with respect to the rest of the image. A great variety of saliency models based on different approaches was developed since 1990s. In recent years, the saliency detection has become one of actively studied topic in the theory of Convolutional Neural Network (CNN). Many original decisions using CNNs were proposed for salient object detection and, even, event detection.Purpose:A detailed survey of saliency detection methods in deep learning era allows to understand the current possibilities of CNN approach for visual analysis conducted by the human eyes’ tracking and digital image processing.Results:A survey reflects the recent advances in saliency detection using CNNs. Different models available in literature, such as static and dynamic 2D CNNs for salient object detection and 3D CNNs for salient event detection are discussed in the chronological order. It is worth noting that automatic salient event detection in durable videos became possible using the recently appeared 3D CNN combining with 2D CNN for salient audio detection. Also in this article, we have presented a short description of public image and video datasets with annotated salient objects or events, as well as the often used metrics for the results’ evaluation.Practical relevance:This survey is considered as a contribution in the study of rapidly developed deep learning methods with respect to the saliency detection in the images and videos.

Download Full-text