A Genetic Algorithm for Efficient Video Content Representation

In this paper, we present a novel scheme on video content representation by exploring the spatio-temporal information. A pseudo-object-based shot representation containing more semantics is proposed to measure shot similarity and force competition approach is proposed to group shots into scene based on content coherences between shots. Two content descriptors, color objects: Dominant Color Histograms (DCH) and Spatial Structure Histograms (SSH), are introduced. To represent temporal content variations, a shot can be segmented into several subshots that are of coherent content, and shot similarity measure is formulated as subshot similarity measure that serves to shot retrieval. With this shot representation, scene structure can be extracted by analyzing the splitting and merging force competitions at each shot boundary. Experimental results on real-world sports video prove that our proposed approach for video shot retrievals achieve the best performance on the average recall (AR) and average normalized modified retrieval rank (ANMRR), and Experiment on MPEG-7 test videos achieves promising results by the proposed scene extraction algorithm.

Download Full-text

Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/98 ◽

2018 ◽

Cited By ~ 10

Author(s):

Hehe Fan ◽

Zhongwen Xu ◽

Linchao Zhu ◽

Chenggang Yan ◽

Jianjun Ge ◽

...

Keyword(s):

Large Scale ◽

Sampling Rate ◽

Computational Cost ◽

Confidence Score ◽

Video Content ◽

Video Classification ◽

Video Frames ◽

Similar Accuracy ◽

Efficient Video

We aim to significantly reduce the computational cost for classification of temporally untrimmed videos while retaining similar accuracy. Existing video classification methods sample frames with a predefined frequency over entire video. Differently, we propose an end-to-end deep reinforcement approach which enables an agent to classify videos by watching a very small portion of frames like what we do. We make two main contributions. First, information is not equally distributed in video frames along time. An agent needs to watch more carefully when a clip is informative and skip the frames if they are redundant or irrelevant. The proposed approach enables the agent to adapt sampling rate to video content and skip most of the frames without the loss of information. Second, in order to have a confident decision, the number of frames that should be watched by an agent varies greatly from one video to another. We incorporate an adaptive stop network to measure confidence score and generate timely trigger to stop the agent watching videos, which improves efficiency without loss of accuracy. Our approach reduces the computational cost significantly for the large-scale YouTube-8M dataset, while the accuracy remains the same.

Download Full-text

A fuzzy video content representation for video summarization and content-based retrieval

Signal Processing ◽

10.1016/s0165-1684(00)00019-0 ◽

2000 ◽

Vol 80 (6) ◽

pp. 1049-1067 ◽

Cited By ~ 90

Author(s):

Anastasios D. Doulamis ◽

Nikolaos D. Doulamis ◽

Stefanos D. Kollias

Keyword(s):

Video Summarization ◽

Video Content ◽

Content Based Retrieval ◽

Content Representation

Download Full-text

A hybrid approach for image/video content representation and identification

2012 7th IEEE Conference on Industrial Electronics and Applications (ICIEA) ◽

10.1109/iciea.2012.6360863 ◽

2012 ◽

Cited By ~ 3

Author(s):

Lekha Chaisorn ◽

Zixiang Fu

Keyword(s):

Hybrid Approach ◽

Video Content ◽

Content Representation

Download Full-text

A New Method for Key Frame Based Video Content Representation

Series on Software Engineering and Knowledge Engineering - Image Databases and Multi-Media Search ◽

10.1142/9789812797988_0009 ◽

1998 ◽

pp. 97-107 ◽

Cited By ~ 14

Author(s):

Alan Hanjalic ◽

Reginald L. Lagendijk ◽

Jan Biemond

Keyword(s):

New Method ◽

Video Content ◽

Key Frame ◽

Content Representation

Download Full-text

Video content representation on tiny devices

2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763) ◽

10.1109/icme.2004.1394583 ◽

2005 ◽

Cited By ~ 3

Author(s):

Jun Wang ◽

M.J.T. Reinders ◽

R.L. Lagendijk ◽

J. Lindenberg ◽

M.S. Kankanhalli

Keyword(s):

Video Content ◽

Content Representation

Download Full-text

Erratum to: “A fuzzy video content representation for video summarization and content-based retrieval” [Signal Processing 80(6) (2000) 1049–1067]

Signal Processing ◽

10.1016/s0165-1684(01)00165-7 ◽

2002 ◽

Vol 82 (4) ◽

pp. 545

Author(s):

Anastasios D Doulamis ◽

Nikolaos D Doulamis ◽

Stefanos D Kollias

Keyword(s):

Signal Processing ◽

Video Summarization ◽

Video Content ◽

Content Based Retrieval ◽

Content Representation

Download Full-text

An Interactive Device for Quick Arabic News Story Browsing

International Journal of Mobile Computing and Multimedia Communications ◽

10.4018/jmcmc.2012100104 ◽

2012 ◽

Vol 4 (4) ◽

pp. 62-82

Author(s):

Hichem Karray ◽

Monji Kherallah ◽

Mohamed Ben Halima ◽

Adel M. Alimi

Keyword(s):

Genetic Algorithm ◽

Recognition System ◽

Text Recognition ◽

News Story ◽

Video Content ◽

Visual Coding ◽

Multimodal Analysis ◽

On Line ◽

Video Summaries ◽

Textual Content

The authors propose a framework for multimodal analysis of Arabic news broadcast which helps users of pervasive devices to browsing quickly into news archive; their solution integrating many aspects such as summarizing, indexing textual content and on on-line recognition of the handwriting. Firstly, the summarizing process is to accelerate the video content browsing based on genetic algorithm. Secondly, the indexing process, which operates on video summaries based on text recognition. Finally users communicate by writing keywords on PDA screen and keep only summaries speaking about this topic. This PDA contains an on line recognition system of Arabic of handwritten based on visual coding and genetic algorithm.

Download Full-text

Parameter Based Multi-Objective Optimization of Video CODECs

Applied Signal and Image Processing ◽

10.4018/978-1-60960-477-6.ch017 ◽

2011 ◽

pp. 288-308

Author(s):

F. Al-Abri ◽

E.A. Edirisinghe ◽

C. Grecos

Keyword(s):

Genetic Algorithm ◽

Mathematical Formulation ◽

Video Codec ◽

Video Content ◽

Multi Objective Optimization ◽

Nsga Ii ◽

Video Codecs ◽

Multi Objective ◽

On Demand

This chapter presents a generalised framework for multi-objective optimisation of video CODECs for use in off-line, on-demand applications. In particular, an optimization scheme is proposed to determine the optimum coding parameters for a H.264 AVC video codec in a memory and bandwidth constrained environment, which minimises codec complexity and video distortion. The encoding/decoding parameters that have a significant impact on the performance of the codec are initially obtained through experimental analysis. A mathematical formulation by means of regression is subsequently used to associate these parameters with the relevant objectives and define a Multi-Objective Optimization (MOO) problem. Solutions to the optimization problem are reached through a Non-dominated Sorting Genetic Algorithm (NSGA-II). It is shown that the proposed framework is flexible on the number of objectives that can jointly be optimized. Furthermore, any of the objectives can be included as constraints depending on the requirements of the services to be supported. Practical use of the proposed framework is described using a case study that involves video content transmission to a mobile hand.

Download Full-text

A Genetic Algorithm for Efficient Video Content Representation

Efficient video summarization based on a fuzzy video content representation

VIDEO CONTENT REPRESENTATION FOR SHOT RETRIEVAL AND SCENE EXTRACTION

Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification

A fuzzy video content representation for video summarization and content-based retrieval

A hybrid approach for image/video content representation and identification

A New Method for Key Frame Based Video Content Representation

Video content representation on tiny devices

Erratum to: “A fuzzy video content representation for video summarization and content-based retrieval” [Signal Processing 80(6) (2000) 1049–1067]

An Interactive Device for Quick Arabic News Story Browsing

Parameter Based Multi-Objective Optimization of Video CODECs

Export Citation Format