Korean Tourist Spot Multi-Modal Dataset for Deep Learning Applications

Changhoon Jeong; Sung-Eun Jang; Sanghyuck Na; Juntae Kim

doi:10.3390/data4040139

Korean Tourist Spot Multi-Modal Dataset for Deep Learning Applications

Data ◽

10.3390/data4040139 ◽

2019 ◽

Vol 4 (4) ◽

pp. 139

Author(s):

Changhoon Jeong ◽

Sung-Eun Jang ◽

Sanghyuck Na ◽

Juntae Kim

Keyword(s):

Deep Learning ◽

Image Classification ◽

Image Captioning ◽

Learning Research

Recently, deep learning-based methods for solving multi-modal tasks such as image captioning, multi-modal classification, and cross-modal retrieval have attracted much attention. To apply deep learning for such tasks, large amounts of data are needed for training. However, although there are several Korean single-modal datasets, there are not enough Korean multi-modal datasets. In this paper, we introduce a KTS (Korean tourist spot) dataset for Korean multi-modal deep-learning research. The KTS dataset has four modalities (image, text, hashtags, and likes) and consists of 10 classes related to Korean tourist spots. All data were extracted from Instagram and preprocessed. We performed two experiments, image classification and image captioning with the dataset, and they showed appropriate results. We hope that many researchers will use this dataset for multi-modal deep-learning research.

Two-Way Feature Extraction Using Sequential and Multimodal Approach for Hateful Meme Classification

Complexity ◽

10.1155/2021/5510253 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Apeksha Aggarwal ◽

Vibhav Sharma ◽

Anshul Trivedi ◽

Mayank Yadav ◽

Chirag Agrawal ◽

...

Keyword(s):

Social Media ◽

Feature Extraction ◽

Deep Learning ◽

Image Classification ◽

Sentiment Analysis ◽

Validation Dataset ◽

Multimodal Approach ◽

Image Captioning ◽

Psychological Conditions ◽

Social Media Platforms

Millions of memes are created and shared every day on social media platforms. Memes are a great tool to spread humour. However, some people use it to target an individual or a group generating offensive content in a polite and sarcastic way. Lack of moderation of such memes spreads hatred and can lead to depression like psychological conditions. Many successful studies related to analysis of language such as sentiment analysis and analysis of images such as image classification have been performed. However, most of these studies rely only upon either one of these components. As classifying meme is one problem which cannot be solved by relying upon only any one of these aspects, the present work identifies, addresses, and ensembles both the aspects for analyzing such data. In this research, we propose a solution to the problems in which the classification depends on more than one model. This paper proposes two different approaches to solve the problem of identifying hate memes. The first approach uses sentiment analysis based on image captioning and text written on the meme. The second approach is to combine features from different modalities. These approaches utilize a combination of glove, encoder-decoder, and OCR with Adamax optimizer deep learning algorithms. Facebook Challenge Hateful Meme Dataset is utilized which contains approximately 8500 meme images. Both the approaches are implemented on the live challenge competition by Facebook and predicted quite acceptable results. Both approaches are tested on the validation dataset, and results are found to be promising for both models.

Deep Learning Based High-Resolution Remote Sensing Image classification

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse.v7i10.384 ◽

2017 ◽

Vol 7 (10) ◽

pp. 22

Author(s):

Sumit Kaur

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Deep Learning ◽

Image Classification ◽

Language Processing ◽

Object Perception ◽

Remote Sensing Image ◽

Research Area ◽

Remote Sensing Image Classification ◽

Unsupervised Algorithms

Abstract- Deep learning is an emerging research area in machine learning and pattern recognition field which has been presented with the goal of drawing Machine Learning nearer to one of its unique objectives, Artificial Intelligence. It tries to mimic the human brain, which is capable of processing and learning from the complex input data and solving different kinds of complicated tasks well. Deep learning (DL) basically based on a set of supervised and unsupervised algorithms that attempt to model higher level abstractions in data and make it self-learning for hierarchical representation for classification. In the recent years, it has attracted much attention due to its state-of-the-art performance in diverse areas like object perception, speech recognition, computer vision, collaborative filtering and natural language processing. This paper will present a survey on different deep learning techniques for remote sensing image classification.

Deep learning based character-oriented image captioning method for visually impaired

Journal of rehabilitation welfare engineering & assistive technology ◽

10.21288/resko.2019.13.2.143 ◽

2019 ◽

Vol 13 (2) ◽

pp. 143-149

Author(s):

H. W. Seol ◽

C. Poleak ◽

J. W. Kwon

Keyword(s):

Deep Learning ◽

Visually Impaired ◽

Image Captioning

Structure-Aware Deep Learning for Product Image Classification

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3231742 ◽

2019 ◽

Vol 15 (1s) ◽

pp. 1-20 ◽

Cited By ~ 9

Author(s):

Zhineng Chen ◽

Shanshan Ai ◽

Caiyan Jia

Keyword(s):

Deep Learning ◽

Image Classification ◽

Product Image

Transfer-Deep Learning Application for Ultrasonic Computed Tomographic Image Classification

2020 International Conference on Control, Automation and Diagnosis (ICCAD) ◽

10.1109/iccad49821.2020.9260569 ◽

2020 ◽

Author(s):

Marwa Fradi ◽

Mouna Afif ◽

El-Hadi Zahzeh ◽

Kais Bouallegue ◽

Mohsen Machhout

Keyword(s):

Deep Learning ◽

Image Classification ◽

Tomographic Image ◽

Computed Tomographic

Real UAV-Bird Image Classification Using CNN with a Synthetic Dataset

Applied Sciences ◽

10.3390/app11093863 ◽

2021 ◽

Vol 11 (9) ◽

pp. 3863

Author(s):

Ali Emre Öztürk ◽

Ergun Erçelebi

Keyword(s):

Deep Learning ◽

Image Classification ◽

Synthetic Data ◽

Real Data ◽

Corner Detection ◽

Batch Size ◽

Test Accuracy ◽

Classification Problems ◽

Auc Value ◽

Classification Test

A large amount of training image data is required for solving image classification problems using deep learning (DL) networks. In this study, we aimed to train DL networks with synthetic images generated by using a game engine and determine the effects of the networks on performance when solving real-image classification problems. The study presents the results of using corner detection and nearest three-point selection (CDNTS) layers to classify bird and rotary-wing unmanned aerial vehicle (RW-UAV) images, provides a comprehensive comparison of two different experimental setups, and emphasizes the significant improvements in the performance in deep learning-based networks due to the inclusion of a CDNTS layer. Experiment 1 corresponds to training the commonly used deep learning-based networks with synthetic data and an image classification test on real data. Experiment 2 corresponds to training the CDNTS layer and commonly used deep learning-based networks with synthetic data and an image classification test on real data. In experiment 1, the best area under the curve (AUC) value for the image classification test accuracy was measured as 72%. In experiment 2, using the CDNTS layer, the AUC value for the image classification test accuracy was measured as 88.9%. A total of 432 different combinations of trainings were investigated in the experimental setups. The experiments were trained with various DL networks using four different optimizers by considering all combinations of batch size, learning rate, and dropout hyperparameters. The test accuracy AUC values for networks in experiment 1 ranged from 55% to 74%, whereas the test accuracy AUC values in experiment 2 networks with a CDNTS layer ranged from 76% to 89.9%. It was observed that the CDNTS layer has considerable effects on the image classification accuracy performance of deep learning-based networks. AUC, F-score, and test accuracy measures were used to validate the success of the networks.

Deep Learning Based Mineral Image Classification Combined with Visual Attention Mechanism

IEEE Access ◽

10.1109/access.2021.3095368 ◽

2021 ◽

pp. 1-1

Author(s):

Yang Liu ◽

Zelin Zhang ◽

Xiang Liu ◽

Lei Wang ◽

Xuhui Xia

Keyword(s):

Deep Learning ◽

Visual Attention ◽

Image Classification ◽

Attention Mechanism ◽

Visual Attention Mechanism

An Imbalanced Image Classification Method for the Cell Cycle Phase

Information ◽

10.3390/info12060249 ◽

2021 ◽

Vol 12 (6) ◽

pp. 249

Author(s):

Xin Jin ◽

Yuanwen Zou ◽

Zhongbing Huang

Keyword(s):

Cell Cycle ◽

Deep Learning ◽

Image Classification ◽

Classification Accuracy ◽

Data Augmentation ◽

Cycle Phase ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Cellular Life

The cell cycle is an important process in cellular life. In recent years, some image processing methods have been developed to determine the cell cycle stages of individual cells. However, in most of these methods, cells have to be segmented, and their features need to be extracted. During feature extraction, some important information may be lost, resulting in lower classification accuracy. Thus, we used a deep learning method to retain all cell features. In order to solve the problems surrounding insufficient numbers of original images and the imbalanced distribution of original images, we used the Wasserstein generative adversarial network-gradient penalty (WGAN-GP) for data augmentation. At the same time, a residual network (ResNet) was used for image classification. ResNet is one of the most used deep learning classification networks. The classification accuracy of cell cycle images was achieved more effectively with our method, reaching 83.88%. Compared with an accuracy of 79.40% in previous experiments, our accuracy increased by 4.48%. Another dataset was used to verify the effect of our model and, compared with the accuracy from previous results, our accuracy increased by 12.52%. The results showed that our new cell cycle image classification system based on WGAN-GP and ResNet is useful for the classification of imbalanced images. Moreover, our method could potentially solve the low classification accuracy in biomedical images caused by insufficient numbers of original images and the imbalanced distribution of original images.

Deep learning based Diagnosis of diseases using Image Classification

2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence) ◽

10.1109/confluence51648.2021.9377154 ◽

2021 ◽

Author(s):

Ansh Saxena ◽

Shiva Singh Tomar ◽

Gaurav Jain ◽

Richa Gupta

Keyword(s):

Deep Learning ◽

Image Classification ◽

Diagnosis Of Diseases

HPWO-LS-based deep learning approach with S-ROA-optimized optic cup segmentation for fundus image classification

Neural Computing and Applications ◽

10.1007/s00521-021-05732-1 ◽

2021 ◽

Author(s):

J. Ramya ◽

M. P. Rajakumar ◽

B. Uma Maheswari

Keyword(s):

Deep Learning ◽

Image Classification ◽

Learning Approach ◽

Fundus Image ◽

Optic Cup