Generating Natural Video Descriptions via Multimodal Processing

Mapping Intimacies ◽

10.21437/interspeech.2016-380 ◽

2016 ◽

Author(s):

Qin Jin ◽

Junwei Liang ◽

Xiaozhu Lin

Keyword(s):

Multimodal Processing

Download Full-text

Acoustic and multimodal processing for multimedia content analysis

Proceedings of the 19th ACM international conference on Multimedia - MM '11 ◽

10.1145/2072298.2072403 ◽

2011 ◽

Author(s):

Gerald Friedland

Keyword(s):

Content Analysis ◽

Multimedia Content ◽

Multimedia Content Analysis ◽

Multimodal Processing

Download Full-text

Entirely Automatic 3D MRI Brain Analysis As A Step In Multimodal Processing

Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society ◽

10.1109/iembs.1992.594662 ◽

1992 ◽

Author(s):

Allain ◽

Travere ◽

Baron

Keyword(s):

3D Mri ◽

Mri Brain ◽

Multimodal Processing

Download Full-text

Multimodal processing of emotional information in 9-month-old infants II: Prenatal exposure to maternal anxiety

Brain and Cognition ◽

10.1016/j.bandc.2014.12.001 ◽

2015 ◽

Vol 95 ◽

pp. 107-117 ◽

Author(s):

R.A. Otte ◽

F.C.L. Donkers ◽

M.A.K.A. Braeken ◽

B.R.H. Van den Bergh

Keyword(s):

Prenatal Exposure ◽

Maternal Anxiety ◽

Emotional Information ◽

Multimodal Processing

Download Full-text

Multimodal processing in simultaneous interpreting with text

Target ◽

10.1075/target.18157.chm ◽

2020 ◽

Vol 32 (1) ◽

pp. 37-58 ◽

Author(s):

Agnieszka Chmiel ◽

Przemysław Janikowski ◽

Agnieszka Lijewska

Keyword(s):

Visual Modality ◽

Auditory Modality ◽

Professional Standard ◽

Facilitation Effect ◽

Simultaneous Interpreting ◽

Strategic Behaviour ◽

Multimodal Processing ◽

Visual Text ◽

Cross Language ◽

Abstract The present study focuses on (in)congruence of input between the visual and the auditory modality in simultaneous interpreting with text. We asked twenty-four professional conference interpreters to simultaneously interpret an aurally and visually presented text with controlled incongruences in three categories (numbers, names and control words), while measuring interpreting accuracy and eye movements. The results provide evidence for the dominance of the visual modality, which goes against the professional standard of following the auditory modality in the case of incongruence. Numbers enjoyed the greatest accuracy across conditions possibly due to simple cross-language semantic mappings. We found no evidence for a facilitation effect for congruent items, and identified an impeding effect of the presence of the visual text for incongruent items. These results might be interpreted either as evidence for the Colavita effect (in which visual stimuli take precedence over auditory ones) or as strategic behaviour applied by professional interpreters to avoid risk.

Download Full-text

Attention-Sharing Initiative of Multimodal Processing in Simultaneous Interpreting

International Journal of Translation Interpretation and Applied Linguistics ◽

10.4018/ijtial.20200701.oa4 ◽

2020 ◽

Vol 2 (2) ◽

pp. 42-53

Author(s):

Tianyun Li ◽

Bicheng Fan

Keyword(s):

Working Memory ◽

Real Time ◽

Facial Expressions ◽

Low Frequency ◽

Simultaneous Interpreting ◽

Dual Channel ◽

Optimization Principle ◽

Multimodal Processing ◽

Channel Information ◽

Speech Recognition Engine

This study sets out to describe simultaneous interpreters' attention-sharing initiatives when exposed under input from both videotaped speech recording and real-time transcriptions. Separation of mental energy in acquiring visual input accords with the human brain's statistic optimization principle where the same property of an object is presented through diverse fashions. In examining professional interpreters' initiatives, the authors invited five professional English-Chinese conference interpreters to simultaneously interpret a videotaped speech with real-time captions generated by speech recognition engine while meanwhile monitoring their eye movements. The results indicate the professional interpreters' preferences in referring to visually presented captions along with the speaker's facial expressions, where low-frequency words, proper names, and numbers gained greater attention than words with higher frequency. This phenomenon might be explained by the working memory theory in which the central executive enables redundancy gains retrieved from dual-channel information.

Download Full-text

Development of Multimodal Processing in Infancy

Infancy ◽

10.1080/15250000903144207 ◽

2009 ◽

Vol 14 (5) ◽

pp. 563-578 ◽

Author(s):

Faraz Farzin ◽

Eric P. Charles ◽

Susan M. Rivera

Keyword(s):

Multimodal Processing

Download Full-text

Strengths and weaknesses of multimodal processing in a group of adults with gliomas

Neurocase ◽

10.1080/13554794.2012.667128 ◽

2013 ◽

Vol 19 (3) ◽

pp. 302-312 ◽

Author(s):

Monique Plaza ◽

Laurent Capelle ◽

Géraldine Maigret ◽

Laurence Chaby

Keyword(s):

Multimodal Processing

Download Full-text

Do individuals with autism spectrum disorder have deficits in disengagement and/or multimodal processing?

Neuropsychiatrie de l Enfance et de l Adolescence ◽

10.1016/j.neurenf.2012.04.465 ◽

2012 ◽

Vol 60 (5) ◽

pp. S214

Author(s):

M. Katagiri ◽

K. Miya ◽

T. Miyawaki ◽

M. Matsui

Keyword(s):

Autism Spectrum Disorder ◽

Autism Spectrum ◽

Spectrum Disorder ◽

Multimodal Processing

Download Full-text

Special issue of the IEEE Transactions on Audio, Speech and Language Processing multimodal processing in speech-based interactions

IEEE Transactions on Audio Speech and Language Processing ◽

10.1109/tasl.2007.906019 ◽

2007 ◽

Vol 15 (7) ◽

pp. 2173-2173

Keyword(s):

Language Processing ◽

Special Issue ◽

Speech And Language ◽

Multimodal Processing ◽

Speech And Language Processing

Download Full-text

Cognitive Multimodal Processing

Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research including Business Opportunities and Challenges - RFMIR '14 ◽

10.1145/2666253.2666264 ◽

2014 ◽

Author(s):

Alexandros Potamianos

Keyword(s):

Multimodal Processing

Download Full-text