scholarly journals A localization strategy combined with transfer learning for image annotation

PLoS ONE ◽  
2021 ◽  
Vol 16 (12) ◽  
pp. e0260758
Author(s):  
Zhiqiang Chen ◽  
Leelavathi Rajamanickam ◽  
Jianfang Cao ◽  
Aidi Zhao ◽  
Xiaohui Hu

This study aims to solve the overfitting problem caused by insufficient labeled images in the automatic image annotation field. We propose a transfer learning model called CNN-2L that incorporates the label localization strategy described in this study. The model consists of an InceptionV3 network pretrained on the ImageNet dataset and a label localization algorithm. First, the pretrained InceptionV3 network extracts features from the target dataset that are used to train a specific classifier and fine-tune the entire network to obtain an optimal model. Then, the obtained model is used to derive the probabilities of the predicted labels. For this purpose, we introduce a squeeze and excitation (SE) module into the network architecture that augments the useful feature information, inhibits useless feature information, and conducts feature reweighting. Next, we perform label localization to obtain the label probabilities and determine the final label set for each image. During this process, the number of labels must be determined. The optimal K value is obtained experimentally and used to determine the number of predicted labels, thereby solving the empty label set problem that occurs when the predicted label values of images are below a fixed threshold. Experiments on the Corel5k multilabel image dataset verify that CNN-2L improves the labeling precision by 18% and 15% compared with the traditional multiple-Bernoulli relevance model (MBRM) and joint equal contribution (JEC) algorithms, respectively, and it improves the recall by 6% compared with JEC. Additionally, it improves the precision by 20% and 11% compared with the deep learning methods Weight-KNN and adaptive hypergraph learning (AHL), respectively. Although CNN-2L fails to improve the recall compared with the semantic extension model (SEM), it improves the comprehensive index of the F1 value by 1%. The experimental results reveal that the proposed transfer learning model based on a label localization strategy is effective for automatic image annotation and substantially boosts the multilabel image annotation performance.

2021 ◽  
Vol 10 (3) ◽  
pp. 137
Author(s):  
Youngok Kang ◽  
Nahye Cho ◽  
Jiyoung Yoon ◽  
Soyeon Park ◽  
Jiyeon Kim

Recently, as computer vision and image processing technologies have rapidly advanced in the artificial intelligence (AI) field, deep learning technologies have been applied in the field of urban and regional study through transfer learning. In the tourism field, studies are emerging to analyze the tourists’ urban image by identifying the visual content of photos. However, previous studies have limitations in properly reflecting unique landscape, cultural characteristics, and traditional elements of the region that are prominent in tourism. With the purpose of going beyond these limitations of previous studies, we crawled 168,216 Flickr photos, created 75 scenes and 13 categories as a tourist’ photo classification by analyzing the characteristics of photos posted by tourists and developed a deep learning model by continuously re-training the Inception-v3 model. The final model shows high accuracy of 85.77% for the Top 1 and 95.69% for the Top 5. The final model was applied to the entire dataset to analyze the regions of attraction and the tourists’ urban image in Seoul. We found that tourists feel attracted to Seoul where the modern features such as skyscrapers and uniquely designed architectures and traditional features such as palaces and cultural elements are mixed together in the city. This work demonstrates a tourist photo classification suitable for local characteristics and the process of re-training a deep learning model to effectively classify a large volume of tourists’ photos.


2012 ◽  
Vol 39 (12) ◽  
pp. 11011-11021 ◽  
Author(s):  
Hugo Jair Escalante ◽  
Manuel Montes ◽  
L. Enrique Sucar

IEEE Access ◽  
2021 ◽  
pp. 1-1
Author(s):  
Myasar Mundher Adnan ◽  
Mohd Shafry Mohd Rahim ◽  
Amjad Rehman ◽  
Zahid Mehmood ◽  
Tanzila Saba ◽  
...  

2011 ◽  
Vol 4 (2) ◽  
pp. 88
Author(s):  
Peter Baggetta

The Teaching Games for Understanding (TGfU) model was first developed by Bunker and Thorpe in 1982 as a model for coaches to help players become more skillful players. Since then other versions of the model have been developed such as the tactical decision-learning model (Grehaigne, Godbout, & Bouthier, 2001) in France and the game–sense approach (Australian Sports Commission, 1991) in Australia and New Zealand. The key aspect of all the models is the design of well-structured conditioned and modified games that require players to make decisions to develop their game understanding and tactical awareness. However, both novice and experienced coaches often struggle with connecting theory to practice especially in the area of creating and developing contextualized games that actually transfer learning from training to performance in games. In order to effectively create and use games that transfer learning, coaches can use a Principles-Based approach to develop games. The Principles-Based approach removes the dichotomy of traditional drills versus games and instead combines the drills approach with a games-context approach that links principles to skills that allow for increased individual and team expertise development. This presentation will first describe a model for developing and connecting principles, policies, tactics and skills for team play. Following this the presentation will then describe how to use the principles to create contextualized games that connect practices with performance and progresses novice players toward becoming more competent performers.


Sign in / Sign up

Export Citation Format

Share Document