Deep learning is combined with massive-scale citizen science to improve large-scale image classification

Deep learning models have demonstrated improved efficacy in image classification since the ImageNet Large Scale Visual Recognition Challenge started since 2010. Classification of images has further augmented in the field of computer vision with the dawn of transfer learning. To train a model on huge dataset demands huge computational resources and add a lot of cost to learning. Transfer learning allows to reduce on cost of learning and also help avoid reinventing the wheel. There are several pretrained models like VGG16, VGG19, ResNet50, Inceptionv3, EfficientNet etc which are widely used. This paper demonstrates image classification using pretrained deep neural network model VGG16 which is trained on images from ImageNet dataset. After obtaining the convolutional base model, a new deep neural network model is built on top of it for image classification based on fully connected network. This classifier will use features extracted from the convolutional base model.

Download Full-text

Active Learning Plus Deep Learning Can Establish Cost-Effective and Robust Model for Multichannel Image: A Case on Hyperspectral Image Classification

Sensors ◽

10.3390/s20174975 ◽

2020 ◽

Vol 20 (17) ◽

pp. 4975

Author(s):

Fangyu Shi ◽

Zhaodi Wang ◽

Menghan Hu ◽

Guangtao Zhai

Keyword(s):

Deep Learning ◽

Active Learning ◽

Image Classification ◽

Large Scale ◽

Hyperspectral Image ◽

Image Annotation ◽

Learning Algorithm ◽

Magnetic Resonance Images ◽

Biological Engineering ◽

Hyperspectral Image Classification

Relying on large scale labeled datasets, deep learning has achieved good performance in image classification tasks. In agricultural and biological engineering, image annotation is time-consuming and expensive. It also requires annotators to have technical skills in specific areas. Obtaining the ground truth is difficult because natural images are expensive. In addition, images in these areas are usually stored as multichannel images, such as computed tomography (CT) images, magnetic resonance images (MRI), and hyperspectral images (HSI). In this paper, we present a framework using active learning and deep learning for multichannel image classification. We use three active learning algorithms, including least confidence, margin sampling, and entropy, as the selection criteria. Based on this framework, we further introduce an “image pool” to make full advantage of images generated by data augmentation. To prove the availability of the proposed framework, we present a case study on agricultural hyperspectral image classification. The results show that the proposed framework achieves better performance compared with the deep learning model. Manual annotation of all the training sets achieves an encouraging accuracy. In comparison, using active learning algorithm of entropy and image pool achieves a similar accuracy with only part of the whole training set manually annotated. In practical application, the proposed framework can remarkably reduce labeling effort during the model development and upadting processes, and can be applied to multichannel image classification in agricultural and biological engineering.

Download Full-text

Deep learning identification for citizen science surveillance of tiger mosquitoes

Scientific Reports ◽

10.1038/s41598-021-83657-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Balint Armin Pataki ◽

Joan Garriga ◽

Roger Eritja ◽

John R. B. Palmer ◽

Frederic Bartumeus ◽

...

Keyword(s):

Deep Learning ◽

Data Collection ◽

Citizen Science ◽

Large Scale ◽

Mosquito Species ◽

Characteristic Curve ◽

Science System ◽

Public Health Agencies ◽

Manual Inspection ◽

Long Run

AbstractGlobal monitoring of disease vectors is undoubtedly becoming an urgent need as the human population rises and becomes increasingly mobile, international commercial exchanges increase, and climate change expands the habitats of many vector species. Traditional surveillance of mosquitoes, vectors of many diseases, relies on catches, which requires regular manual inspection and reporting, and dedicated personnel, making large-scale monitoring difficult and expensive. New approaches are solving the problem of scalability by relying on smartphones and the Internet to enable novel community-based and digital observatories, where people can upload pictures of mosquitoes whenever they encounter them. An example is the Mosquito Alert citizen science system, which includes a dedicated mobile phone app through which geotagged images are collected. This system provides a viable option for monitoring the spread of various mosquito species across the globe, although it is partly limited by the quality of the citizen scientists’ photos. To make the system useful for public health agencies, and to give feedback to the volunteering citizens, the submitted images are inspected and labeled by entomology experts. Although citizen-based data collection can greatly broaden disease-vector monitoring scales, manual inspection of each image is not an easily scalable option in the long run, and the system could be improved through automation. Based on Mosquito Alert’s curated database of expert-validated mosquito photos, we trained a deep learning model to find tiger mosquitoes (Aedes albopictus), a species that is responsible for spreading chikungunya, dengue, and Zika among other diseases. The highly accurate 0.96 area under the receiver operating characteristic curve score promises not only a helpful pre-selector for the expert validation process but also an automated classifier giving quick feedback to the app participants, which may help to keep them motivated. In the paper, we also explored the possibilities of using the model to improve future data collection quality as a feedback loop.

Download Full-text

Combining citizen science and deep learning for large-scale estimation of outdoor nitrogen dioxide concentrations

Environmental Research ◽

10.1016/j.envres.2020.110389 ◽

2020 ◽

pp. 110389

Author(s):

Scott Weichenthal ◽

Evi Dons ◽

Kris Y. Hong ◽

Pedro O. Pinheiro ◽

Filip J.R. Meysman

Keyword(s):

Deep Learning ◽

Nitrogen Dioxide ◽

Citizen Science ◽

Large Scale ◽

Scale Estimation

Download Full-text

Combining citizen science and deep learning to amplify expertise in neuroimaging

10.1101/363382 ◽

2018 ◽

Cited By ~ 11

Author(s):

Anisha Keshavan ◽

Jason D. Yeatman ◽

Ariel Rokem

Keyword(s):

Decision Making ◽

Quality Control ◽

Big Data ◽

Deep Learning ◽

Citizen Science ◽

Large Scale ◽

Learning Algorithm ◽

Area Under The Curve ◽

Magnetic Resonance Images ◽

Complex Datasets

AbstractResearch in many fields has become increasingly reliant on large and complex datasets. “Big Data” holds untold promise to rapidly advance science by tackling new questions that cannot be answered with smaller datasets. While powerful, research with Big Data poses unique challenges, as many standard lab protocols rely on experts examining each one of the samples. This is not feasible for large-scale datasets because manual approaches are time-consuming and hence difficult to scale. Meanwhile, automated approaches lack the accuracy of examination by highly trained scientists and this may introduce major errors, sources of noise, and unforeseen biases into these large and complex datasets. Our proposed solution is to 1) start with a small, expertly labelled dataset, 2) amplify labels through web-based tools that engage citizen scientists, and 3) train machine learning on amplified labels to emulate expert decision making. As a proof of concept, we developed a system to quality control a large dataset of three-dimensional magnetic resonance images (MRI) of human brains. An initial dataset of 200 brain images labeled by experts were amplified by citizen scientists to label 722 brains, with over 80,000 ratings done through a simple web interface. A deep learning algorithm was then trained to predict data quality, based on a combination of the citizen scientist labels that accounts for differences in the quality of classification by different citizen scientists. In an ROC analysis (on left out test data), the deep learning network performed as well as a state-of-the-art, specialized algorithm (MRIQC) for quality control of T1-weighted images, each with an area under the curve of 0.99. Finally, as a specific practical application of the method, we explore how brain image quality relates to the replicability of a well established relationship between brain volume and age over development. Combining citizen science and deep learning can generalize and scale expert decision making; this is particularly important in emerging disciplines where specialized, automated tools do not already exist.

Download Full-text

Large-scale Landsat image classification based on deep learning methods

APSIPA Transactions on Signal and Information Processing ◽

10.1017/atsip.2019.18 ◽

2019 ◽

Vol 8 ◽

Author(s):

Xuemei Zhao ◽

Lianru Gao ◽

Zhengchao Chen ◽

Bing Zhang ◽

Wenzhi Liao

Keyword(s):

Deep Learning ◽

Image Classification ◽

Large Scale ◽

Texture Features ◽

Henan Province ◽

Small Samples ◽

Landsat Images ◽

Remote Sensing Image Classification ◽

Training Samples ◽

Geographical Regions

Deep learning has demonstrated its superiority in computer vision. Landsat images have specific characteristics compared with natural images. The spectral and texture features of the same class vary along with the imaging conditions. In this paper, we extend the use of deep learning to remote sensing image classification to large geographical regions, and explore a way to make deep learning classifiers transferable for different regions. We take Jingjinji region and Henan province in China as the study areas, and choose FCN, ResNet, and PSPNet as classifiers. The models are trained by different proportions of training samples from Jingjinji region. Then we use the trained models to predict results of the study areas. Experimental results show that the overall accuracy decreases when trained by small samples, but the recognition ability on mislabeled areas increases. All methods can obtain great performance when used to Jingjinji region while they all need to be fine-tuned with new training samples from Henan province, due to the reason that images of Henan province have different spectral features from the original trained area.

Download Full-text

HyAdamC: A New Adam-Based Hybrid Optimization Algorithm for Convolution Neural Networks

Sensors ◽

10.3390/s21124054 ◽

2021 ◽

Vol 21 (12) ◽

pp. 4054

Author(s):

Kyung-Soo Kim ◽

Yong-Suk Choi

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Robust Optimization ◽

Image Classification ◽

Large Scale ◽

Optimization Methods ◽

Optimization Method ◽

Hybrid Optimization ◽

Computation Method ◽

Convolution Neural Networks

As the performance of devices that conduct large-scale computations has been rapidly improved, various deep learning models have been successfully utilized in various applications. Particularly, convolution neural networks (CNN) have shown remarkable performance in image processing tasks such as image classification and segmentation. Accordingly, more stable and robust optimization methods are required to effectively train them. However, the traditional optimizers used in deep learning still have unsatisfactory training performance for the models with many layers and weights. Accordingly, in this paper, we propose a new Adam-based hybrid optimization method called HyAdamC for training CNNs effectively. HyAdamC uses three new velocity control functions to adjust its search strength carefully in term of initial, short, and long-term velocities. Moreover, HyAdamC utilizes an adaptive coefficient computation method to prevent that a search direction determined by the first momentum is distorted by any outlier gradients. Then, these are combined into one hybrid method. In our experiments, HyAdamC showed not only notable test accuracies but also significantly stable and robust optimization abilities when training various CNN models. Furthermore, we also found that HyAdamC could be applied into not only image classification and image segmentation tasks.

Download Full-text

DeepME: Deep Mixture Experts for Large-scale Image Classification

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/100 ◽

2021 ◽

Author(s):

Ming He ◽

Guangyi Lv ◽

Weidong He ◽

Jianping Fan ◽

Guihua Zeng

Keyword(s):

Deep Learning ◽

Image Classification ◽

Spectral Clustering ◽

Large Scale ◽

Clustering Method ◽

Task Groups ◽

Large Numbers ◽

Deep Networks ◽

Classification Tasks ◽

Spectral Clustering Method

Although deep learning has demonstrated its outstanding performance on image classification, most well-known deep networks make efforts to optimize both their structures and their node weights for recognizing fewer (e.g., no more than 1000) object classes. Therefore, it is attractive to extend or mixture such well-known deep networks to support large-scale image classification. According to our best knowledge, how to adaptively and effectively fuse multiple CNNs for large-scale image classification is still under-explored. On this basis, a deep mixture algorithm is developed to support large-scale image classification in this paper. First, a soft spectral clustering method is developed to construct a two-layer ontology (group layer and category layer) by assigning large numbers of image categories into a set of groups according to their inter-category semantic correlations, where the semantically-related image categories under the neighbouring group nodes may share similar learning complexities. Then, such two-layer ontology is further used to generate the task groups, in which each task group contains partial image categories with similar learning complexities and one particular base deep network is learned. Finally, a gate network is learned to combine all base deep networks with fewer diverse outputs to generate a mixture network with larger outputs. Our experimental results on ImageNet10K have demonstrated that our proposed deep mixture algorithm can achieve very competitive results (top 1 accuracy: 32.13%) on large-scale image classification tasks.

Download Full-text

Identifikasi Foto Wanita Berhijab dari Majalah Untuk Pembuatan Katalog Busana Muslim Otomatis Memanfaatkan Convolutional Neural Network

Journal of Intelligent System and Computation ◽

10.52985/insyst.v1i2.87 ◽

2019 ◽

Vol 1 (2) ◽

pp. 85-91

Author(s):

M. Najamudin Ridha ◽

Endang Setyati ◽

Yosi Kristian

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Convolutional Neural Network ◽

Image Classification ◽

Large Scale ◽

Data Augmentation ◽

Rectified Linear Unit

Abstrak—Perkembangan Fashion Muslim di Indonesia terus meningkat, disisi lain terobosan baru pada Deep Learning dengan memadukan arsitektur seperti dropout regularizations dan Rectified Linear Unit (ReLU) sebagai fungsi aktivasi dan data augmentation, mampu mencapai terobosan pada large scale image classification. Penelitian ini menggunakan metode deteksi objek wajah dengan Haar Cascades Classification untuk mendapatkan sample dataset wajah dan preprocessing data testing untuk dilanjutkan pada metode machine learning untuk klasifikasi citra dengan Convolutional Neural Network. Dataset yang digunakan adalah kumpulan katalog busana online, dataset yang sudah di preprocessing dibagi menjadi dua kategori, yaitu Hijab untuk semua citra wanita berhijab, dan Non Hijab untuk citra yang bukan wanita berhijab. selanjutnya klasifikasi citra menggunakan data ujicoba majalah digital terbitan Hijabella, Joy Indonesia dan Scarf Indonesia. Semakin besar resolusi citra input untuk preprocessing pada majalah digital, maka akan semakin banyak objek citra yang terdeteksi, dengan meningkatkan jumlah dataset untuk training dan validasi, mampu menambah hasil akurasi yang didapatkan, terjadi peningkatan akurasi pada dataset 2.500 wajah perkategori ke 5.000 wajah perkategori dengan resolusi 720p meningkat dari rata-rata 81.30% menjadi 82.31%, peningkatan rata-rata 1.01% dan tertinggi 2.14%, sedangkan resolusi 1080p meningkat dari rata-rata 83.03% menjadi 83.68%, peningkatan rata-rata 0.65% dan tertinggi 1.73%, akurasi tertinggi adalah sebesar 84.72% menggunakan model dataset 5.000 secara acak perkategori.

Download Full-text

A Survey on Computer Vision Architectures for Large Scale Image Classification using Deep Learning

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2021.0121013 ◽

2021 ◽

Vol 12 (10) ◽

Author(s):

D. Dakshayani Himabindu ◽

S. Praveen Kumar

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Image Classification ◽

Large Scale

Download Full-text