A novel algorithm for real-time H.264 video encoding

Author(s):  
Youwei Yuan ◽  
Chengwu Zhou ◽  
Wenlong Ye ◽  
Lamei Yan
Author(s):  
Bernardo Breve ◽  
Stefano Cirillo ◽  
Mariano Cuofano ◽  
Domenico Desiato

AbstractGestural expressiveness plays a fundamental role in the interaction with people, environments, animals, things, and so on. Thus, several emerging application domains would exploit the interpretation of movements to support their critical designing processes. To this end, new forms to express the people’s perceptions could help their interpretation, like in the case of music. In this paper, we investigate the user’s perception associated with the interpretation of sounds by highlighting how sounds can be exploited for helping users in adapting to a specific environment. We present a novel algorithm for mapping human movements into MIDI music. The algorithm has been implemented in a system that integrates a module for real-time tracking of movements through a sample based synthesizer using different types of filters to modulate frequencies. The system has been evaluated through a user study, in which several users have participated in a room experience, yielding significant results about their perceptions with respect to the environment they were immersed.


Author(s):  
Gaurav Chaurasia ◽  
Arthur Nieuwoudt ◽  
Alexandru-Eugen Ichim ◽  
Richard Szeliski ◽  
Alexander Sorkine-Hornung

We present an end-to-end system for real-time environment capture, 3D reconstruction, and stereoscopic view synthesis on a mobile VR headset. Our solution allows the user to use the cameras on their VR headset as their eyes to see and interact with the real world while still wearing their headset, a feature often referred to as Passthrough. The central challenge when building such a system is the choice and implementation of algorithms under the strict compute, power, and performance constraints imposed by the target user experience and mobile platform. A key contribution of this paper is a complete description of a corresponding system that performs temporally stable passthrough rendering at 72 Hz with only 200 mW power consumption on a mobile Snapdragon 835 platform. Our algorithmic contributions for enabling this performance include the computation of a coarse 3D scene proxy on the embedded video encoding hardware, followed by a depth densification and filtering step, and finally stereoscopic texturing and spatio-temporal up-sampling. We provide a detailed discussion and evaluation of the challenges we encountered, as well as algorithm and performance trade-offs in terms of compute and resulting passthrough quality.;AB@The described system is available to users as the Passthrough+ feature on Oculus Quest. We believe that by publishing the underlying system and methods, we provide valuable insights to the community on how to design and implement real-time environment sensing and rendering on heavily resource constrained hardware.


2013 ◽  
Vol 22 (06) ◽  
pp. 1360019
Author(s):  
DAMON BLANCHETTE ◽  
EMMANUEL AGU

Spectral rendering, or the synthesis of images by taking into account the constituent wavelengths of white light, enables the rendering of iridescent colors caused by phenomena such as dispersion, diffraction, interference and scattering. Caustics, the focusing and defocusing of light through a refractive medium, can be interpreted as a special case of dispersion where all the wavelengths travel along the same paths. In this paper we extend Adaptive Caustic Mapping (ACM), a previously proposed caustics mapping algorithm, to handle physically-based dispersion. Because ACM can display caustics in real-time, it is amenable to extension to handle the more general case of dispersion. We also present a novel algorithm for filling in the gaps that occur due to discrete sampling of the spectrum. Our proposed method runs in screen-space, and is fast enough to display plausible dispersion phenomena at real-time and interactive frame rates.


2011 ◽  
Vol 383-390 ◽  
pp. 5028-5033
Author(s):  
Xue Mei Xu ◽  
Qin Mo ◽  
Lan Ni ◽  
Qiao Yun Guo ◽  
An Li

In the video encoding system, motion estimation plays an important role at the front-end of encoder, which can eliminate inter redundancy efficiently and improve encoding efficiency. However, traditional motion estimation algorithm can’t be used in real-time application like video monitoring due to its computational complexity. In order to improve real-time efficiency, an improved motion estimation algorithm is proposed in this paper. The essential ideas consist of early termination rules, prediction of initial search point, and determination of motion type. Furthermore, our algorithm adopts different search patterns for certain motion activity. Experimental result shows that the improved algorithm reduces the computation time significantly while maintaining the image quality, and satisfies real time requirement in monitoring system.


Sensors ◽  
2022 ◽  
Vol 22 (2) ◽  
pp. 516
Author(s):  
Alessandro Rizzo ◽  
Francesco Cardellini ◽  
Claudio Poggi ◽  
Enrico Borra ◽  
Luca Ciciani ◽  
...  

Nowadays, radon gas exposure is considered one of the main health concerns for the population because, by carrying about half the total dose due to environmental radioactivity, it is the second cause of lung cancer after smoking. Due to a relatively long half-life of 3.82 days, the chemical inertia and since its parent Ra-226 is largely diffuse on the earthrgb]0,0,1’s crust and especially in the building materials, radon can diffuse and potentially saturate human habitats, with a concentration that can suddenly change during the 24 h day depending on temperature, pressure, and relative humidity. For such reasons, `real-time’ measurements performed by an active detector, possibly of small dimensions and a handy configuration, can play an important role in evaluating the risk and taking the appropriate countermeasures to mitigate it. In this work, a novel algorithm for pattern recognition was developed to exploit the potentialities of silicon active detectors with a pixel matrix structure to measure radon through the α emission, in a simple measurement configuration, where the device is placed directly in air with no holder, no collection filter or electrostatic field to drift the radon progenies towards the detector active area. This particular measurement configuration (dubbed as bare) requires an α/β-discrimination method that is not based on spectroscopic analysis: as the gas surrounds the detector the α particles are emitted at different distances from it, so they lose variable energy amount in air depending on the traveled path-length which implies a variable deposited energy in the active area. The pixels matrix structure allows overcoming this issue because the interaction of α, β and γ particles generate in the active area of the detector clusters (group of pixels where a signal is read) of different shape and energy dispersion. The novel algorithm that exploits such a phenomenon was developed using a pixelated silicon detector of the TimePix family with a compact design. An α(Am-241) and a β(Sr-90) source were used to calibrate the algorithm and to evaluate its performances in terms of β rejection capability and α recognition efficiency. Successively, the detector was exposed to different radon concentrations at the ENEA-INMRI radon facility in `bare’ configuration, in order to check the linearity of the device response over a radon concentration range. The results for this technique are presented and discussed, highlighting the potential applications especially the possibility to exploit small and handy detectors to perform radon active measurements in the simplest configuration.


2014 ◽  
Vol 1003 ◽  
pp. 249-253
Author(s):  
Hao Fang ◽  
Ai Hua Li ◽  
Yan Fei Liu

To solve the difficulty of traditional video monitoring system in system upgrade and expansion, an solution of embedded video monitoring system based on DaVinci technology was put forward in this paper. By building the monitoring platform by DM6437 and DSP/BIOS in the solution, TVP5151 was used for receiving video signal in PAL/NTSC formats, and an JPEG Baseline Profile Encoder was integrated for video encoding, and the 10/100M Ethernet transmission function was realized based on NDK. Finally, the system is tested and the result shows that the system can capture and transmit D1 format signal in 25f/s and met the real-time requirement. At the same time, the system is easy to use and expand with a bright application prospect.


2012 ◽  
Vol 50 (2) ◽  
pp. 280-286 ◽  
Author(s):  
Nikolaus Karpinsky ◽  
Song Zhang
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document