perceptual coding
Recently Published Documents


TOTAL DOCUMENTS

96
(FIVE YEARS 10)

H-INDEX

17
(FIVE YEARS 1)

2021 ◽  
Vol 149 (4) ◽  
pp. A32-A33
Author(s):  
Kelly L. Whiteford ◽  
Angela Sim ◽  
Andrew J. Oxenham

Sensors ◽  
2021 ◽  
Vol 21 (5) ◽  
pp. 1924
Author(s):  
Patrick Seeling ◽  
Martin Reisslein ◽  
Frank H. P. Fitzek

The Tactile Internet will require ultra-low latencies for combining machines and humans in systems where humans are in the control loop. Real-time and perceptual coding in these systems commonly require content-specific approaches. We present a generic approach based on deliberately reduced number accuracy and evaluate the trade-off between savings achieved and errors introduced with real-world data for kinesthetic movement and tele-surgery. Our combination of bitplane-level accuracy adaptability with perceptual threshold-based limits allows for great flexibility in broad application scenarios. Combining the attainable savings with the relatively small introduced errors enables the optimal selection of a working point for the method in actual implementations.


2021 ◽  
Vol 127 (1) ◽  
pp. 20-34
Author(s):  
Robin James

I argue that sound-centric scholarship can be of use to feminist theorists if and only if it begins from a non-ideal theory of sound; this article develops such a theory. To do this, I first develop more fully my claim that perceptual coding was a good metaphor for the ways that neoliberal market logics (re)produce relations of domination and subordination, such as white supremacist patriarchy. Because it was developed to facilitate the enclosure of the audio bandwidth, perceptual coding is especially helpful in centring the ways that patriarchal racial capitalism structures our concepts and experiences of both sound and technology. The first section identifies sonic cyberfeminist practices that function as a kind of perceptual coding because they subject ‘sound’ and/or ‘women’ to enclosure and accumulation by dispossession. The second section identifies a type of sonic cyberfeminism that tunes into the parts of the spectrum that this perceptual coding discards, building models of community and aesthetic value that do not rely on the exclusion of women, especially black women, from both humanist and posthuman concepts of personhood. Here I focus especially on Alexander Weheliye’s ‘phonographic’ approach to sound, technology and theoretical text. This approach, which he develops in his 2005 book of that title and in recent work in collaboration with Katherine McKittrick, avoids fetishising tech and self-transformation and focuses on practices that build registers of existence that hegemonic institutions perceptually code out of circulation. I conclude with examples of such phonographic compression, including Masters At Work’s ballroom classic ‘The Ha Dance’ and Nicki Minaj’s ‘Anaconda’.


2021 ◽  
Vol 108 ◽  
pp. 102903
Author(s):  
Xin Cui ◽  
Zongju Peng ◽  
Gangyi Jiang ◽  
Fen Chen ◽  
Mei Yu ◽  
...  

Author(s):  
Zhengyi Luo ◽  
Chen Zhu ◽  
Yan Huang ◽  
Rong Xie ◽  
Li Song ◽  
...  
Keyword(s):  

2020 ◽  
Author(s):  
Gavin M. Bidelman ◽  
Claire Pearson ◽  
Ashleigh Harrison

AbstractCategorical judgments of otherwise identical phonemes are biased toward hearing words (i.e., “Ganong effect”) suggesting lexical context influences perception of even basic speech primitives. Lexical biasing could manifest via late stage post-perceptual mechanisms related to decision or alternatively, top-down linguistic inference which acts on early perceptual coding. Here, we exploited the temporal sensitivity of EEG to resolve the spatiotemporal dynamics of these context-related influences on speech categorization. Listeners rapidly classified sounds from a /gi/ - /ki/ gradient presented in opposing word-nonword contexts (GIFT-kift vs. giss-KISS), designed to bias perception toward lexical items. Phonetic perception shifted toward the direction of words, establishing a robust Ganong effect behaviorally. ERPs revealed a neural analog of lexical biasing emerging within ∼200 ms. Source analyses uncovered a distributed neural network supporting the Ganong including middle temporal gyrus (MTG), inferior parietal lobe (IPL), and middle frontal cortex. Yet, among Ganong-sensitive regions, only left MTG and IPL predicted behavioral susceptibility to lexical influence. Our findings confirm lexical status rapidly constrains sub-lexical categorical representations for speech within several hundred milliseconds but likely does so outside the purview of canonical “auditory-linguistic” brain areas.


Entropy ◽  
2019 ◽  
Vol 21 (2) ◽  
pp. 165 ◽  
Author(s):  
Xiantao Jiang ◽  
Tian Song ◽  
Daqi Zhu ◽  
Takafumi Katayama ◽  
Lu Wang

Perceptual video coding (PVC) can provide a lower bitrate with the same visual quality compared with traditional H.265/high efficiency video coding (HEVC). In this work, a novel H.265/HEVC-compliant PVC framework is proposed based on the video saliency model. Firstly, both an effective and efficient spatiotemporal saliency model is used to generate a video saliency map. Secondly, a perceptual coding scheme is developed based on the saliency map. A saliency-based quantization control algorithm is proposed to reduce the bitrate. Finally, the simulation results demonstrate that the proposed perceptual coding scheme shows its superiority in objective and subjective tests, achieving up to a 9.46% bitrate reduction with negligible subjective and objective quality loss. The advantage of the proposed method is the high quality adapted for a high-definition video application.


Sign in / Sign up

Export Citation Format

Share Document