Cross-Modal Learning for Audio-Visual Video Parsing

Mapping Intimacies ◽

10.21437/interspeech.2021-2135 ◽

2021 ◽

Author(s):

Jatin Lamba ◽

- Abhishek ◽

Jayaprakash Akula ◽

Rishabh Dabral ◽

Preethi Jyothi ◽

...

Keyword(s):

Download Full-text

Video Parsing, Retrieval and Browsing: An Integrated and Content-Based Solution

Readings in Multimedia Computing and Networking ◽

10.1016/b978-155860651-7/50116-9 ◽

2002 ◽

pp. 350-359

Author(s):

H.J. Zhang ◽

C.Y. Low ◽

S.W. Smoliar ◽

J.H. Wu

Keyword(s):

Download Full-text

An automatic news video parsing, indexing and browsing system

Proceedings of the fourth ACM international conference on Multimedia - MULTIMEDIA '96 ◽

10.1145/244130.244453 ◽

1996 ◽

Author(s):

Chien Yong Low ◽

Qi Tian ◽

Hongjiang Zhang

Keyword(s):

Video Parsing ◽

Download Full-text

Surveillance Video Parsing with Single Frame Supervision

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr.2017.114 ◽

2017 ◽

Author(s):

Si Liu ◽

Changhu Wang ◽

Ruihe Qian ◽

Han Yu ◽

Renda Bao ◽

...

Keyword(s):

Surveillance Video ◽

Single Frame ◽

Download Full-text

Audiovisual integration with Segment Models for tennis video parsing

Computer Vision and Image Understanding ◽

10.1016/j.cviu.2007.09.002 ◽

2008 ◽

Vol 111 (2) ◽

pp. 142-154 ◽

Author(s):

Manolis Delakis ◽

Guillaume Gravier ◽

Patrick Gros

Keyword(s):

Audiovisual Integration ◽

Download Full-text

Video parsing and camera pose estimation for 2D to 3D video conversion

10.5353/th_b5699957 ◽

2015 ◽

Author(s):

Tianrui Liu

Keyword(s):

Pose Estimation ◽

3D Video ◽

Camera Pose Estimation ◽

Camera Pose ◽

Video Parsing ◽

Download Full-text

TVParser: An automatic TV video parsing method

CVPR 2011 ◽

10.1109/cvpr.2011.5995681 ◽

2011 ◽

Author(s):

Chao Liang ◽

Changsheng Xu ◽

Jian Cheng ◽

Hanqing Lu

Keyword(s):

Download Full-text

Automatic video parsing using shot boundary detection and camera operation analysis

Pattern Recognition ◽

10.1016/s0031-3203(00)00007-8 ◽

2001 ◽

Vol 34 (3) ◽

pp. 711-719 ◽

Author(s):

Mee-Sook Lee ◽

Yun-Mo Yang ◽

Seong-Whan Lee

Keyword(s):

Boundary Detection ◽

Shot Boundary Detection ◽

Operation Analysis ◽

Shot Boundary ◽

Download Full-text

Content-based video parsing and indexing based on audio-visual interaction

IEEE Transactions on Circuits and Systems for Video Technology ◽

10.1109/76.915358 ◽

2001 ◽

Vol 11 (4) ◽

pp. 522-535 ◽

Author(s):

S. Tsekeridou ◽

I. Pitas

Keyword(s):

Visual Interaction ◽

Download Full-text

Compressed-domain video parsing using energy histograms of the lower-frequency DCT coefficients

10.1117/12.373561 ◽

1999 ◽

Author(s):

Oliver K. Bao ◽

Jose A. Lay ◽

Ling Guan

Keyword(s):

Compressed Domain ◽

Video Parsing ◽

Dct Coefficients

Download Full-text

Video Summarization by Redundancy Removing and Content Ranking

Computer Vision for Multimedia Applications ◽

10.4018/978-1-60960-024-2.ch006 ◽

2011 ◽

pp. 91-101

Author(s):

Tao Wang ◽

Yue Gao ◽

Patricia Wang ◽

Wei Hu ◽

Jianguo Li ◽

...

Keyword(s):

Video Summarization ◽

Time Constraint ◽

Key Frame ◽

Video Summary ◽

Video Parsing ◽

Content Ranking

Video summary is very important for users to grasp a whole video’s content quickly for efficient browsing and editing. In this chapter, we propose a novel video summarization approach based on redundancy removing and content ranking. Firstly, by video parsing and cast indexing, the approach constructs a story board to let user know about the main scenes and the main actors in the video. Then it removes redundant frames to generate a “story-constraint summary” by key frame clustering and repetitive segment detection. To shorten the video summary length to a target length, “time-constraint summary” is constructed by important factor based content ranking. Extensive experiments are carried out on TV series, movies, and cartoons. Good results demonstrate the effectiveness of the proposed method.

Download Full-text