Distortion Reduction via CAE and DenseNet Mixture Network for Low Bitrate Spatial Audio Object Coding

IEEE Multimedia ◽

10.1109/mmul.2022.3142752 ◽

2022 ◽

pp. 1-1

Author(s):

Yulin Wu ◽

Ruimin Hu ◽

Xiaochen Wang ◽

Chenhao Hu ◽

Shanfa Ke

Keyword(s):

Spatial Audio ◽

Object Coding ◽

Distortion Reduction

Download Full-text

Multi-step Coding Structure of Spatial Audio Object Coding

MultiMedia Modeling - Lecture Notes in Computer Science ◽

10.1007/978-3-030-37731-1_54 ◽

2019 ◽

pp. 666-678

Author(s):

Chenhao Hu ◽

Ruimin Hu ◽

Xiaochen Wang ◽

Tingzhao Wu ◽

Dengshi Li

Keyword(s):

Spatial Audio ◽

Download Full-text

Perancangan dan Analisis Kinerja Rendering Matrix Analysis (RMA) Module untuk MPEG Spatial Audio Object Coding (SAOC)

10.25077/1520952004 ◽

2017 ◽

Author(s):

Amirul Luthfi

Keyword(s):

Matrix Analysis ◽

Spatial Audio ◽

Download Full-text

Mastering Signal Processing with Residual Coding Scheme in Spatial Audio Object Coding

2013 International Conference on Information Science and Applications (ICISA) ◽

10.1109/icisa.2013.6579396 ◽

2013 ◽

Author(s):

Kwangki Kim ◽

Byeong-ok Jang ◽

Sanghyun Park ◽

Yonggwan Won ◽

Jinsul Kim

Keyword(s):

Signal Processing ◽

Spatial Audio ◽

Coding Scheme ◽

Object Coding ◽

Residual Coding

Download Full-text

Harmonic elimination structures for Karaoke mode in Spatial Audio Object Coding scheme

2011 IEEE International Conference on Consumer Electronics (ICCE) ◽

10.1109/icce.2011.5722879 ◽

2011 ◽

Author(s):

Jihoon Park ◽

Jungpyo Hong ◽

Kwangki Kim ◽

Minsoo Hahn

Keyword(s):

Spatial Audio ◽

Coding Scheme ◽

Harmonic Elimination ◽

Download Full-text

Efficient Residual Coding Method of Spatial Audio Object Coding with Two-Step Coding Structure for Interactive Audio Services

IEICE Transactions on Information and Systems ◽

10.1587/transinf.2015edl8248 ◽

2016 ◽

Vol E99.D (7) ◽

pp. 1949-1952 ◽

Author(s):

Byonghwa LEE ◽

Kwangki KIM ◽

Minsoo HAHN

Keyword(s):

Spatial Audio ◽

Coding Method ◽

Object Coding ◽

Residual Coding

Download Full-text

Spatial Audio Object Coding Based on Time-Frequency Shifting and Scheduling

2021 IEEE International Conference on Multimedia and Expo (ICME) ◽

10.1109/icme51207.2021.9428297 ◽

2021 ◽

Author(s):

Chenhao Hu ◽

Ruimin Hu ◽

Xiaochen Wang ◽

Yulin Wu

Keyword(s):

Spatial Audio ◽

Time Frequency ◽

Object Coding ◽

Frequency Shifting

Download Full-text

Modified spatial audio object coding scheme with harmonic extraction and elimination structure for interactive audio service

10.21437/interspeech.2010-755 ◽

2010 ◽

Author(s):

Jihoon Park ◽

Kwangki Kim ◽

Jeongil Seo ◽

Minsoo Hahn

Keyword(s):

Spatial Audio ◽

Coding Scheme ◽

Harmonic Extraction ◽

Download Full-text

Spatial Audio Object Coding With Two-Step Coding Structure for Interactive Audio Service

IEEE Transactions on Multimedia ◽

10.1109/tmm.2011.2168197 ◽

2011 ◽

Vol 13 (6) ◽

pp. 1208-1216 ◽

Author(s):

Kwangki Kim ◽

Jeongil Seo ◽

Seungkwon Beack ◽

Kyeongok Kang ◽

Minsoo Hahn

Keyword(s):

Spatial Audio ◽

Download Full-text

Real-Time Audio-Visual Analysis for Multiperson Videoconferencing

Advances in Multimedia ◽

10.1155/2013/175745 ◽

2013 ◽

Vol 2013 ◽

pp. 1-21 ◽

Author(s):

Petr Motlicek ◽

Stefan Duffner ◽

Danil Korchagin ◽

Hervé Bourlard ◽

Carl Scheffler ◽

...

Keyword(s):

Real Time ◽

Video Processing ◽

Semantic Information ◽

Visual Analysis ◽

State Of The Art ◽

Spatial Audio ◽

Detection And Tracking ◽

Video Objects ◽

Object Coding ◽

Virtual Director

We describe the design of a system consisting of several state-of-the-art real-time audio and video processing components enabling multimodal stream manipulation (e.g., automatic online editing for multiparty videoconferencing applications) in open, unconstrained environments. The underlying algorithms are designed to allow multiple people to enter, interact, and leave the observable scene with no constraints. They comprise continuous localisation of audio objects and its application for spatial audio object coding, detection, and tracking of faces, estimation of head poses and visual focus of attention, detection and localisation of verbal and paralinguistic events, and the association and fusion of these different events. Combined all together, they represent multimodal streams with audio objects and semantic video objects and provide semantic information for stream manipulation systems (like a virtual director). Various experiments have been performed to evaluate the performance of the system. The obtained results demonstrate the effectiveness of the proposed design, the various algorithms, and the benefit of fusing different modalities in this scenario.

Download Full-text

Variable Subband Analysis for High Quality Spatial Audio Object Coding

2008 10th International Conference on Advanced Communication Technology ◽

10.1109/icact.2008.4493981 ◽

2008 ◽

Author(s):

Kyungryeol Koo ◽

Kwangki Kim ◽

Jeongil Seo ◽

Kyeongok Kang ◽

Minsoo Hahn

Keyword(s):

Spatial Audio ◽

High Quality ◽

Download Full-text