Creating Sound Glyph Database for Video Subtitling

Author(s):  
Chitralekha Ganapati Bhat ◽  
Sunil Kumar Kopparapu

Accessibility of speech information in videos is a huge challenge for the hearing impaired, making a visual representation such as text subtitling essential. Unavailability of a good Automatic Speech Recognition (ASR) engine, makes automatic generation of text subtitles for resource deficient languages such as Indian languages, extremely difficult. Techniques to build such an ASR using audio and corresponding transcription in the form of broadcast news or audio books have been proposed; however, these techniques require transcriptions corresponding to the audio in editable text format, which are unavailable for resource deficient languages. In this chapter, a novel technique of building a sound-glyph database for a resource deficient language has been described. The sound-glyph database can be used effectively to subtitle videos in the same language script. Considering large volumes of data that need to be processed, we propose a parallel processing method in a multiresolution setup, harnessing the multi-core capacity of present day computers.

2019 ◽  
Vol 53 (5) ◽  
pp. 3673-3704
Author(s):  
Amitoj Singh ◽  
Virender Kadyan ◽  
Munish Kumar ◽  
Nancy Bassan

2018 ◽  
Author(s):  
Brij Mohan Lal Srivastava ◽  
Sunayana Sitaram ◽  
Rupesh Kumar Mehta ◽  
Krishna Doss Mohan ◽  
Pallavi Matani ◽  
...  

Author(s):  
Peter A. Heeman ◽  
Rebecca Lunsford ◽  
Andy McMillin ◽  
J. Scott Yaruss

Sign in / Sign up

Export Citation Format

Share Document