Visualization-Based Active Learning for Video Annotation

Active learning has been demonstrated to be an effective approach to reducing human labeling effort in multimedia annotation tasks. However, most of the existing active learning methods for video annotation are studied in a relatively simple context where concepts are sequentially annotated with fixed effort and only a single modality is applied. However, we usually have to deal with multiple modalities, and sequentially annotating concepts without preference cannot suitably assign annotation effort. To address these two issues, in this paper we propose a multi-concept multi-modality active learning method for video annotation in which multiple concepts and multiple modalities can be simultaneously taken into consideration. In each round of active learning, this method selects the concept that is expected to get the highest performance gain and a batch of suitable samples to be annotated for this concept. Then, a graph-based semi-supervised learning is conducted on each modality for the selected concept. The proposed method is able to sufficiently explore the human effort by considering both the learnabilities of different concepts and the potentials of different modalities. Experimental results on TRECVID 2005 benchmark have demonstrated its effectiveness and efficiency.

Download Full-text

Multi-Concept Multi-Modality Active Learning for Interactive Video Annotation

International Conference on Semantic Computing (ICSC 2007) ◽

10.1109/icosc.2007.4338365 ◽

2007 ◽

Author(s):

Meng Wang ◽

Xian-Sheng Hua ◽

Yan Song ◽

Jinhui Tang ◽

Li-Rong Dai

Keyword(s):

Active Learning ◽

Interactive Video ◽

Video Annotation

Download Full-text

Video Annotation by Active Learning and Cluster Tuning

2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06) ◽

10.1109/cvprw.2006.211 ◽

2006 ◽

Cited By ~ 2

Author(s):

Guo-jun Qi ◽

Yan Song ◽

Xian-Sheng Hua ◽

Hong-Jiang Zhang ◽

Li-Rong Dai

Keyword(s):

Active Learning ◽

Video Annotation

Download Full-text

Active Video Annotation

Semantic Mining Technologies for Multimedia Databases ◽

10.4018/978-1-60566-188-9.ch013 ◽

2011 ◽

pp. 298-322

Author(s):

Meng Wang ◽

Xian-Sheng Hua ◽

Jinhui Tang ◽

Guo-Jun Qi

Keyword(s):

Support Vector Machine ◽

Active Learning ◽

Sample Selection ◽

Training Data ◽

Video Annotation ◽

Support Vector ◽

Learning Approaches ◽

Selection Strategies ◽

Learning Scheme ◽

Active Video

This chapter introduces the application of active learning in video annotation. The insufficiency of training data is a major obstacle in learning-based video annotation. Active learning is a promising approach to dealing with this difficulty. It iteratively annotates a selected set of most informative samples, such that the obtained training set is more effective than that gathered randomly. The authors present a brief review of the typical active learning approaches. They categorize the sample selection strategies in these methods into five criteria, that is, risk reduction, uncertainty, positivity, density, and diversity. In particular, they introduce the Support Vector Machine (SVM)-based active learning scheme which has been widely applied. Afterwards, they analyze the deficiency of the existing active learning methods for video annotation, that is, in most of these methods the to-be-annotated concepts are treated equally without preference and only one modality is applied. To address these two issues, the authors introduce a multi-concept multi-modality active learning scheme. This scheme is able to better explore human labeling effort by considering both the learnabilities of different concepts and the potential of different modalities.

Download Full-text