scholarly journals Cross-Modal Learning for Audio-Visual Video Parsing

Author(s):  
Jatin Lamba ◽  
- Abhishek ◽  
Jayaprakash Akula ◽  
Rishabh Dabral ◽  
Preethi Jyothi ◽  
...  
Keyword(s):  
Author(s):  
Si Liu ◽  
Changhu Wang ◽  
Ruihe Qian ◽  
Han Yu ◽  
Renda Bao ◽  
...  

2008 ◽  
Vol 111 (2) ◽  
pp. 142-154 ◽  
Author(s):  
Manolis Delakis ◽  
Guillaume Gravier ◽  
Patrick Gros

Author(s):  
Chao Liang ◽  
Changsheng Xu ◽  
Jian Cheng ◽  
Hanqing Lu
Keyword(s):  

Author(s):  
Tao Wang ◽  
Yue Gao ◽  
Patricia Wang ◽  
Wei Hu ◽  
Jianguo Li ◽  
...  

Video summary is very important for users to grasp a whole video’s content quickly for efficient browsing and editing. In this chapter, we propose a novel video summarization approach based on redundancy removing and content ranking. Firstly, by video parsing and cast indexing, the approach constructs a story board to let user know about the main scenes and the main actors in the video. Then it removes redundant frames to generate a “story-constraint summary” by key frame clustering and repetitive segment detection. To shorten the video summary length to a target length, “time-constraint summary” is constructed by important factor based content ranking. Extensive experiments are carried out on TV series, movies, and cartoons. Good results demonstrate the effectiveness of the proposed method.


Sign in / Sign up

Export Citation Format

Share Document