Spatio-temporal salient feature extraction for perceptual content based video retrieval

Videos are recorded and uploaded daily to the sites like YouTube, Facebook etc. from devices such as mobile phones and digital cameras with less or without metadata (semantic tags) associated with it. This makes extremely difficult to retrieve similar videos based on this metadata without using content based semantic search. Content based video retrieval is problem of retrieving most similar videos to a given query video and has wide range of applications such as video browsing, content filtering, video indexing, etc. Traditional video level features based on key frame level hand engineered features which does not exploit rich dynamics present in the video. In this paper we propose a fast content based video retrieval framework using compact spatio-temporal features learned by deep learning. Specifically, deep CNN along with LSTM is deploy to learn spatio-temporal representations of video. For fast retrieval, binary code is generated by hashing learning component in the framework. For fast and effective learning of hash code proposed framework is trained in two stages. First stage learns the video dynamics and in second stage compact code is learn using learned video’s temporal variation from the first stage. UCF101 dataset is used to test the proposed method and results compared by other hashing methods. Results show that our approach is able to improve the performance over existing methods.

Download Full-text

MST-CSS (Multi-Spectro-Temporal Curvature Scale Space), a Novel Spatio-Temporal Representation for Content-Based Video Retrieval

IEEE Transactions on Circuits and Systems for Video Technology ◽

10.1109/tcsvt.2010.2051367 ◽

2010 ◽

Vol 20 (8) ◽

pp. 1080-1094 ◽

Cited By ~ 17

Author(s):

A Dyana ◽

S Das

Keyword(s):

Video Retrieval ◽

Scale Space ◽

Temporal Representation ◽

Curvature Scale Space ◽

Spatio Temporal ◽

Content Based Video Retrieval

Download Full-text

Content based video retrieval using integrated feature extraction and personalization of results

2015 International Conference on Information Processing (ICIP) ◽

10.1109/infop.2015.7489372 ◽

2015 ◽

Cited By ~ 5

Author(s):

Pradeep Chivadshetti ◽

Kishor Sadafale ◽

Kalpana Thakare

Keyword(s):

Feature Extraction ◽

Video Retrieval ◽

Content Based Video Retrieval

Download Full-text

A Scheme for Shot Detection and Video Retreival using Spatio Temporal Features

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b1139.0982s1019 ◽

2019 ◽

Vol 8 (2S10) ◽

pp. 605-608

Keyword(s):

Feature Extraction ◽

Video Retrieval ◽

Video Recording ◽

Technological Advancement ◽

Video Content ◽

Scale Invariant ◽

Motion Feature ◽

Video Information ◽

Combined Features ◽

Spatio Temporal

There has been a revolution in multimedia with technological advancement. Hence, Video recording has increased in leaps and bounds. Video retrieval from a huge database is cumbersome by the existing text based search since a lot of human effort is involved and the retrieval efficiency is meager as well. In view of the present challenges, video retrieval based on video content prevails over the existing conventional methods. Content implies real video information such as video features. The performance of the Content Based Video Retrieval (CBVR) depends on Feature extraction and similar features matching. Since the selection of features in the existing algorithms is not effective, the retrieval processing time is more and the efficiency is less. Combined features of color and motion have been proposed for feature extraction and Spatio-Temporal Scale Invariant Feature Transform is used for Shot Boundary Detection. Since the characteristic of color feature is visual video content and that of motion feature is temporal content, these two features are significant in effective video retrieval. The performance of the CBVR system has been evaluated on the TRECVID dataset and the retrieved videos reveal the effectiveness of proposed algorithm.

Download Full-text