Asymptotic Closed-Loop Design Of Transform Modes For The Inter-Prediction Residual In Video Coding

Author(s):  
Bharath Vishwanath ◽  
Shunyao Li ◽  
Kenneth Rose
Author(s):  
Srinivas Bachu ◽  
N. Ramya Teja

Due to the advancement of multimedia and its requirement of communication over the network, video compression has received much attention among the researchers. One of the popular video codings is scalable video coding, referred to as H.264/AVC standard. The major drawback in the H.264 is that it performs the exhaustive search over the interlayer prediction to gain the best rate-distortion performance. To reduce the computation overhead due to exhaustive search on mode prediction process, this paper presents a new technique for inter prediction mode selection based on the fuzzy holoentropy. This proposed scheme utilizes the pixel values and probabilistic distribution of pixel symbols to decide the mode. The adaptive mode selection is introduced here by analyzing the pixel values of the current block to be coded with those of a motion compensated reference block using fuzzy holoentropy. The adaptively selected mode decision can reduce the computation time without affecting the visual quality of frames. Experimentation of the proposed scheme is evaluated by utilizing five videos, and from the analysis, it is evident that proposed scheme has overall high performance with values of 41.367 dB and 0.992 for PSNR and SSIM respectively.


2021 ◽  
Vol 2021 ◽  
pp. 1-10
Author(s):  
Yaning Zhu

There is often noise in spoken machine English, which affects the accuracy of pronunciation. Therefore, how to accurately detect the noise in machine English spoken language and give standard spoken pronunciation is very important and meaningful. The traditional machine-oriented spoken English speech noise detection technology is limited to the improvement of software algorithm, mainly including speech enhancement technology and speech endpoint detection technology. Based on this, this paper will develop a wireless sensor network based on machine English oral pronunciation noise based on air and nonair conduction, reasonably design and configure air sensors, and nonair conduction sensors to deal with machine English oral pronunciation noise, so as to improve the naturalness and intelligibility of machine English speech. At the hardware level, this paper mainly optimizes the AD sampling, sensor matching layout, and internal hardware circuit board layout of the two types of sensors, so as to solve the compatibility problem between them and further reduce the hardware power consumption. In order to further verify or evaluate the performance of the machine spoken English speech noise detection sensor designed in this paper, a machine spoken English training system based on Android platform is designed. Compared with the traditional system, the training system can improve the intelligence of machine oriented oral English noise detection algorithm, so as to continuously improve the accuracy of system detection. The machine English pronunciation is adjusted and corrected by combining the data sensed by the sensor, so as to form a closed-loop design. The experimental results show that the wireless sensor sample proposed in this paper has obvious advantages in detecting the accuracy of machine English oral pronunciation, and its good closed-loop system is helpful to further improve the accuracy of machine English oral pronunciation.


Sign in / Sign up

Export Citation Format

Share Document