Multiple Instance Learning Convolutional Neural Networks for Fine-Grained Aircraft Recognition

The key to fine-grained aircraft recognition is discovering the subtle traits that can distinguish different subcategories. Early approaches leverage part annotations of fine-grained objects to derive rich representations. However, manual labeling part information is cumbersome. In response to this issue, previous CNN-based methods reuse the backbone network to extract part-discrimination features, the inference process of which consumes much time. Therefore, we introduce generalized multiple instance learning (MIL) into fine-grained recognition. In generalized MIL, an aircraft is assumed to consist of multiple instances (such as head, tail, and body). Firstly, instance-level representations are obtained by the feature extractor and instance conversion component. Secondly, the obtained instance features are scored by an MIL classifier, which can yield high-level part semantics. Finally, a fine-grained object label is inferred by a MIL pooling function that aggregates multiple instance scores. The proposed approach is trained end-to-end without part annotations and complex location networks. Experimental evidence is conducted to prove the feasibility and effectiveness of our approach on combined aircraft images (CAIs).

Download Full-text

Interpretation of Swedish Sign Language Using Convolutional Neural Networks and Transfer Learning

SN Computer Science ◽

10.1007/s42979-021-00612-w ◽

2021 ◽

Vol 2 (3) ◽

Author(s):

Gustaf Halvardsson ◽

Johanna Peterson ◽

César Soto-Valero ◽

Benoit Baudry

Keyword(s):

Neural Networks ◽

Sign Language ◽

Transfer Learning ◽

Convolutional Neural Networks ◽

Web Application ◽

Training Dataset ◽

Motion Processing ◽

Image Perception ◽

Sign Languages ◽

High Level

AbstractThe automatic interpretation of sign languages is a challenging task, as it requires the usage of high-level vision and high-level motion processing systems for providing accurate image perception. In this paper, we use Convolutional Neural Networks (CNNs) and transfer learning to make computers able to interpret signs of the Swedish Sign Language (SSL) hand alphabet. Our model consists of the implementation of a pre-trained InceptionV3 network, and the usage of the mini-batch gradient descent optimization algorithm. We rely on transfer learning during the pre-training of the model and its data. The final accuracy of the model, based on 8 study subjects and 9400 images, is 85%. Our results indicate that the usage of CNNs is a promising approach to interpret sign languages, and transfer learning can be used to achieve high testing accuracy despite using a small training dataset. Furthermore, we describe the implementation details of our model to interpret signs as a user-friendly web application.

Download Full-text

Firearm Detection via Convolutional Neural Networks: Comparing a Semantic Segmentation Model Against End-to-End Solutions

2020 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata50022.2020.9377745 ◽

2020 ◽

Author(s):

Alexander Egiazarov ◽

Fabio Massimo Zennaro ◽

Vasileios Mavroeidis

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Semantic Segmentation ◽

End To End

Download Full-text

An Automated End-To-End Pipeline for Fine-Grained Video Annotation using Deep Neural Networks

Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval - ICMR '16 ◽

10.1145/2911996.2912028 ◽

2016 ◽

Cited By ~ 2

Author(s):

Baptist Vandersmissen ◽

Lucas Sterckx ◽

Thomas Demeester ◽

Azarakhsh Jalalvand ◽

Wesley De Neve ◽

...

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Video Annotation ◽

Fine Grained ◽

End To End

Download Full-text

An End-to-End Real-Time Face Identification and Attendance System using Convolutional Neural Networks

2019 IEEE 16th India Council International Conference (INDICON) ◽

10.1109/indicon47234.2019.9029001 ◽

2019 ◽

Author(s):

Aashish Rai ◽

Rashmi Karnani ◽

Vishal Chudasama ◽

Kishor Upla

Keyword(s):

Neural Networks ◽

Real Time ◽

Convolutional Neural Networks ◽

Face Identification ◽

End To End

Download Full-text

A Modular Software Library for Effective High Level Synthesis of Convolutional Neural Networks

Applied Reconfigurable Computing. Architectures, Tools, and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-030-44534-8_16 ◽

2020 ◽

pp. 211-220

Author(s):

Hector Gerardo Munoz Hernandez ◽

Safdar Mahmood ◽

Marcelo Brandalero ◽

Michael Hübner

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

High Level Synthesis ◽

Software Library ◽

Modular Software ◽

High Level

Download Full-text

Leukocyte Segmentation via End-to-End Learning of Deep Convolutional Neural Networks

Intelligence Science and Big Data Engineering. Visual Data Engineering - Lecture Notes in Computer Science ◽

10.1007/978-3-030-36189-1_16 ◽

2019 ◽

pp. 191-200 ◽

Cited By ~ 1

Author(s):

Yan Lu ◽

Haoyi Fan ◽

Zuoyong Li

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Deep Convolutional Neural Networks ◽

End To End ◽

Leukocyte Segmentation

Download Full-text

Fully Automatic Cell Segmentation with Fourier Descriptors

10.1101/2021.12.17.472408 ◽

2021 ◽

Author(s):

Dominik Hirling ◽

Peter Horvath

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Fundamental Problem ◽

Cell Segmentation ◽

Fourier Descriptors ◽

Control Points ◽

Coefficient Vector ◽

Fully Automatic ◽

High Level ◽

And Control

Cell segmentation is a fundamental problem in biology for which convolutional neural networks yield the best results nowadays. In this paper, we present HarmonicNet, a network, which is a modification of the popular StarDist and SplineDist architectures. While StarDist and SplineDist describe an object by the lengths of equiangular rays and control points respectively, our network utilizes Fourier descriptors, predicting a coefficient vector for every pixel on the image, which implicitly define the resulting segmentation. We evaluate our model on three different datasets, and show that Fourier descriptors can achieve a high level of accuracy with a small number of coefficients. HarmonicNet is also capable of accurately segmenting objects that are not star-shaped, a case where StarDist performs suboptimally according to our experiments.

Download Full-text