Automatic fish detection in underwater videos by a deep neural network-based hybrid motion learning system

Abstract It is interesting to develop effective fish sampling techniques using underwater videos and image processing to automatically estimate and consequently monitor the fish biomass and assemblage in water bodies. Such approaches should be robust against substantial variations in scenes due to poor luminosity, orientation of fish, seabed structures, movement of aquatic plants in the background and image diversity in the shape and texture among fish of different species. Keeping this challenge in mind, we propose a unified approach to detect freely moving fish in unconstrained underwater environments using a Region-Based Convolutional Neural Network, a state-of-the-art machine learning technique used to solve generic object detection and localization problems. To train the neural network, we employ a novel approach to utilize motion information of fish in videos via background subtraction and optical flow, and subsequently combine the outcomes with the raw image to generate fish-dependent candidate regions. We use two benchmark datasets extracted from a large Fish4Knowledge underwater video repository, Complex Scenes dataset and the LifeCLEF 2015 fish dataset to validate the effectiveness of our hybrid approach. We achieve a detection accuracy (F-Score) of 87.44% and 80.02% respectively on these datasets, which advocate the utilization of our approach for fish detection task.

Download Full-text

Copy-Move Forgery Detection and Localization Using a Generative Adversarial Network and Convolutional Neural-Network

Information ◽

10.3390/info10090286 ◽

2019 ◽

Vol 10 (9) ◽

pp. 286

Author(s):

Younis Abdalla ◽

M. Tariq Iqbal ◽

Mohamed Shehata

Keyword(s):

Neural Network ◽

New Technologies ◽

Generative Adversarial Networks ◽

Detection Accuracy ◽

Forgery Detection ◽

Generative Adversarial Network ◽

Neural Network Approach ◽

Adversarial Network ◽

Copy Move Forgery Detection ◽

Detection And Localization

The problem of forged images has become a global phenomenon that is spreading mainly through social media. New technologies have provided both the means and the support for this phenomenon, but they are also enabling a targeted response to overcome it. Deep convolution learning algorithms are one such solution. These have been shown to be highly effective in dealing with image forgery derived from generative adversarial networks (GANs). In this type of algorithm, the image is altered such that it appears identical to the original image and is nearly undetectable to the unaided human eye as a forgery. The present paper investigates copy-move forgery detection using a fusion processing model comprising a deep convolutional model and an adversarial model. Four datasets are used. Our results indicate a significantly high detection accuracy performance (~95%) exhibited by the deep learning CNN and discriminator forgery detectors. Consequently, an end-to-end trainable deep neural network approach to forgery detection appears to be the optimal strategy. The network is developed based on two-branch architecture and a fusion module. The two branches are used to localize and identify copy-move forgery regions through CNN and GAN.

Download Full-text

Detecting and Localizing Dents on Vehicle Bodies Using Region-Based Convolutional Neural Network

Applied Sciences ◽

10.3390/app10041250 ◽

2020 ◽

Vol 10 (4) ◽

pp. 1250

Author(s):

Sung Hyun Park ◽

Amir Tjolleng ◽

Joonho Chang ◽

Myeongsup Cha ◽

Jongcheol Park ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Automobile Industry ◽

Absolute Error ◽

Vehicle Body ◽

Detection Accuracy ◽

Data Set ◽

Mach Bands ◽

Detection And Localization ◽

Vehicle Bodies

Detection and localization of the dents on a vehicle body that occurs during manufacturing is critical to achieve the appearance quality of a new vehicle. This study proposes a region-based convolutional neural network (R-CNN) to detect and localize dents for a vehicle body inspection. For a better feature extraction, this study employed a lighting system, which can highlight dents on an image by projecting the Mach bands (bright-dark stripes). The R-CNN was trained using the highlighted images by the Mach bands, and heat-maps were prepared with the classification scores estimated from the R-CNN to localize dents. This study applied the proposed R-CNN to the inspection of dents on the surface of a car body and quantitatively analyzed its performances. The detection accuracy of the dents was 98.5% for the testing data set, and mean absolute error between the actual dents and estimated dents were 13.7 pixels, which were close to one another. The proposed R-CNN could be applied to detect and localize surface dents during the manufacture of vehicle bodies in the automobile industry.

Download Full-text

Leveraging the Bhattacharyya coefficient for uncertainty quantification in deep neural networks

Neural Computing and Applications ◽

10.1007/s00521-021-05789-y ◽

2021 ◽

Author(s):

Pieter Van Molle ◽

Tim Verbelen ◽

Bert Vankeirsbilck ◽

Jonas De Vylder ◽

Bart Diricx ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Networks ◽

State Of The Art ◽

Use Case ◽

Bhattacharyya Coefficient ◽

Output Uncertainty ◽

Novel Approach ◽

Benchmark Datasets ◽

Network Approaches

AbstractModern deep learning models achieve state-of-the-art results for many tasks in computer vision, such as image classification and segmentation. However, its adoption into high-risk applications, e.g. automated medical diagnosis systems, happens at a slow pace. One of the main reasons for this is that regular neural networks do not capture uncertainty. To assess uncertainty in classification, several techniques have been proposed casting neural network approaches in a Bayesian setting. Amongst these techniques, Monte Carlo dropout is by far the most popular. This particular technique estimates the moments of the output distribution through sampling with different dropout masks. The output uncertainty of a neural network is then approximated as the sample variance. In this paper, we highlight the limitations of such a variance-based uncertainty metric and propose an novel approach. Our approach is based on the overlap between output distributions of different classes. We show that our technique leads to a better approximation of the inter-class output confusion. We illustrate the advantages of our method using benchmark datasets. In addition, we apply our metric to skin lesion classification—a real-world use case—and show that this yields promising results.

Download Full-text

A Novel Approach to Coral Fish Detection And Classification in Underwater Footage Based on Convolutional Neural Network

Journal of Physics Conference Series ◽

10.1088/1742-6596/1650/3/032012 ◽

2020 ◽

Vol 1650 ◽

pp. 032012

Author(s):

Zhijian Sun

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Novel Approach ◽

Coral Fish ◽

Fish Detection

Download Full-text

Application of a Convolutional Neural Network for the Detection of Sea Ice Leads

Remote Sensing ◽

10.3390/rs13224571 ◽

2021 ◽

Vol 13 (22) ◽

pp. 4571

Author(s):

Jay P. Hoffman ◽

Steven A. Ackerman ◽

Yinghui Liu ◽

Jeffrey R. Key ◽

Iain L. McConnell

Keyword(s):

Neural Network ◽

Sea Ice ◽

Convolutional Neural Network ◽

Infrared Imaging ◽

Critical Role ◽

The Arctic ◽

Detection Accuracy ◽

New Approach ◽

Novel Approach ◽

Ice Leads

Despite accounting for a small fraction of the surface area in the Arctic, long and narrow sea ice fractures, known as “leads”, play a critical role in the energy flux between the ocean and atmosphere. As the volume of sea ice in the Arctic has declined over the past few decades, it is increasingly important to monitor the corresponding changes in sea ice leads. A novel approach has been developed using artificial intelligence (AI) to detect sea ice leads using satellite thermal infrared window data from the Moderate Resolution Imaging Spectroradiometer (MODIS) and the Visible Infrared Imaging Radiometer Suite (VIIRS). In this new approach, a particular type of convolutional neural network, a U-Net, replaces a series of conventional image processing tests from our legacy algorithm. Results show the new approach has a high detection accuracy with F1 Scores on the order of 0.7. Compared to the legacy algorithm, the new algorithm shows improvement, with more true positives, fewer false positives, fewer false negatives, and better agreement between satellite instruments.

Download Full-text

Salient Object Detection Based on Multiscale Segmentation and Fuzzy Broad Learning

The Computer Journal ◽

10.1093/comjnl/bxaa158 ◽

2020 ◽

Author(s):

Xiao Lin ◽

Zhi-Jie Wang ◽

Lizhuang Ma ◽

Renjie Li ◽

Mei-E Fang

Keyword(s):

Clustering Algorithm ◽

Saliency Detection ◽

Texture Features ◽

Saliency Map ◽

Label Propagation ◽

Learning System ◽

Saliency Maps ◽

Novel Approach ◽

Multiscale Segmentation ◽

Benchmark Datasets

Abstract Saliency detection has been a hot topic in the field of computer vision. In this paper, we propose a novel approach that is based on multiscale segmentation and fuzzy broad learning. The core idea of our method is to segment the image into different scales, and then the extracted features are fed to the fuzzy broad learning system (FBLS) for training. More specifically, it first segments the image into superpixel blocks at different scales based on the simple linear iterative clustering algorithm. Then, it uses the local binary pattern algorithm to extract texture features and computes the average color information for each superpixel of these segmentation images. These extracted features are then fed to the FBLS to obtain multiscale saliency maps. After that, it fuses these saliency maps into an initial saliency map and uses the label propagation algorithm to further optimize it, obtaining the final saliency map. We have conducted experiments based on several benchmark datasets. The results show that our solution can outperform several existing algorithms. Particularly, our method is significantly faster than most of deep learning-based saliency detection algorithms, in terms of training and inferring time.

Download Full-text

Detection and localization of helipad in autonomous UAV landing: a coupled visual-inertial approach with artificial intelligence

Transport and Communication Science Journal ◽

10.25073/tcsj.71.7.8 ◽

2020 ◽

Vol 71 (7) ◽

pp. 828-839

Author(s):

Thinh Hoang Dinh ◽

Hieu Le Thi Hong

Keyword(s):

Neural Network ◽

Unmanned Aerial Vehicles ◽

Fiducial Marker ◽

Localization Algorithm ◽

Challenging Problem ◽

Autonomous Landing ◽

Aerial Vehicles ◽

Set Up ◽

Detection And Localization ◽

Rotary Wing

Autonomous landing of rotary wing type unmanned aerial vehicles is a challenging problem and key to autonomous aerial fleet operation. We propose a method for localizing the UAV around the helipad, that is to estimate the relative position of the helipad with respect to the UAV. This data is highly desirable to design controllers that have robust and consistent control characteristics and can find applications in search – rescue operations. AI-based neural network is set up for helipad detection, followed by optimization by the localization algorithm. The performance of this approach is compared against fiducial marker approach, demonstrating good consensus between two estimations

Download Full-text

Accurate and Transferable Multitask Prediction of Chemical Properties with an Atoms-in-Molecule Neural Network

10.26434/chemrxiv.7151435.v2 ◽

2018 ◽

Author(s):

Roman Zubatyuk ◽

Justin S. Smith ◽

Jerzy Leszczynski ◽

Olexandr Isayev

Keyword(s):

Neural Network ◽

Molecular System ◽

Computational Cost ◽

Chemical Properties ◽

The State ◽

Molecular Properties ◽

Training Data ◽

Dft Methods ◽

Benchmark Datasets ◽

Quantum Phenomena

<p>Atomic and molecular properties could be evaluated from the fundamental Schrodinger’s equation and therefore represent different modalities of the same quantum phenomena. Here we present AIMNet, a modular and chemically inspired deep neural network potential. We used AIMNet with multitarget training to learn multiple modalities of the state of the atom in a molecular system. The resulting model shows on several benchmark datasets the state-of-the-art accuracy, comparable to the results of orders of magnitude more expensive DFT methods. It can simultaneously predict several atomic and molecular properties without an increase in computational cost. With AIMNet we show a new dimension of transferability: the ability to learn new targets utilizing multimodal information from previous training. The model can learn implicit solvation energy (like SMD) utilizing only a fraction of original training data, and archive MAD error of 1.1 kcal/mol compared to experimental solvation free energies in MNSol database.</p>

Download Full-text

A Hybrid Approach for Intrusion Detection using OPSO and Hybridization of Feed Forward Neural Network (FFNN) with Probabilistic Neural Network (PNN)- HFFPNN Classifier

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2020/31912020 ◽

2020 ◽

Vol 9 (1) ◽

pp. 206-210

Author(s):

Sangita Babu

Keyword(s):

Neural Network ◽

Intrusion Detection ◽

Probabilistic Neural Network ◽

Hybrid Approach ◽

Feed Forward Neural Network ◽

Feed Forward

Download Full-text

An Anatomy of a Hybrid Color Descriptor with a Neural Network Model to Enhance the Retrieval Accuracy of an Image Retrieval System

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813666191122113801 ◽

2019 ◽

Vol 13 ◽

Author(s):

Shikha Bhardwaj ◽

Gitanjali Pandove ◽

Pawan Kumar Dahiya

Keyword(s):

Neural Network ◽

Deep Learning ◽

Image Retrieval ◽

Hybrid System ◽

Back Propagation ◽

Back Propagation Neural Network ◽

Retrieval Accuracy ◽

Color Descriptor ◽

Benchmark Datasets ◽

Color Moment

Background: In order to retrieve a particular image from vast repository of images, an efficient system is required and such an eminent system is well-known by the name Content-based image retrieval (CBIR) system. Color is indeed an important attribute of an image and the proposed system consist of a hybrid color descriptor which is used for color feature extraction. Deep learning, has gained a prominent importance in the current era. So, the performance of this fusion based color descriptor is also analyzed in the presence of Deep learning classifiers. Method: This paper describes a comparative experimental analysis on various color descriptors and the best two are chosen to form an efficient color based hybrid system denoted as combined color moment-color autocorrelogram (Co-CMCAC). Then, to increase the retrieval accuracy of the hybrid system, a Cascade forward back propagation neural network (CFBPNN) is used. The classification accuracy obtained by using CFBPNN is also compared to Patternnet neural network. Results: The results of the hybrid color descriptor depict that the proposed system has superior results of the order of 95.4%, 88.2%, 84.4% and 96.05% on Corel-1K, Corel-5K, Corel-10K and Oxford flower benchmark datasets respectively as compared to many state-of-the-art related techniques. Conclusion: This paper depict an experimental and analytical analysis on different color feature descriptors namely, Color moment (CM), Color auto-correlogram (CAC), Color histogram (CH), Color coherence vector (CCV) and Dominant color descriptor (DCD). The proposed hybrid color descriptor (Co-CMCAC) is utilized for the withdrawal of color features with Cascade forward back propagation neural network (CFBPNN) is used as a classifier on four benchmark datasets namely Corel-1K, Corel-5K and Corel-10K and Oxford flower.

Download Full-text