dynamic scene Latest Research Papers

AbstractDeblurring images of dynamic scenes is a challenging task because blurring occurs due to a combination of many factors. In recent years, the use of multi-scale pyramid methods to recover high-resolution sharp images has been extensively studied. We have made improvements to the lack of detail recovery in the cascade structure through a network using progressive integration of data streams. Our new multi-scale structure and edge feature perception design deals with changes in blurring at different spatial scales and enhances the sensitivity of the network to blurred edges. The coarse-to-fine architecture restores the image structure, first performing global adjustments, and then performing local refinement. In this way, not only is global correlation considered, but also residual information is used to significantly improve image restoration and enhance texture details. Experimental results show quantitative and qualitative improvements over existing methods.

Download Full-text

A Robust Visual-Aided Inertial Navigation Algorithm for Pedestrians

Journal of Sensors ◽

10.1155/2022/1311221 ◽

2022 ◽

Vol 2022 ◽

pp. 1-12

Author(s):

Langping An ◽

Xianfei Pan ◽

Tingting Li ◽

Mang Wang

Keyword(s):

State Estimation ◽

Inertial Navigation System ◽

Dynamic Environment ◽

Inertial Navigation ◽

Dynamic Scene ◽

The Past ◽

Zero Velocity ◽

Navigation Algorithm ◽

Movement Characteristics ◽

Error Accumulation

Real-time and robust state estimation for pedestrians is a challenging problem under the satellite denial environment. The zero-velocity-aided foot-mounted inertial navigation system, with the shortcomings of unobservable heading, error accumulation, and poorly adaptable parameters, is a conventional method to estimate the pose relative to a known origin. Visual and inertial fusion is a popular technology for state estimation over the past decades, but it cannot make full use of the movement characteristics of pedestrians. In this paper, we propose a novel visual-aided inertial navigation algorithm for pedestrians, which improves the robustness in the dynamic environment and for multi-motion pedestrians. The algorithm proposed combines the zero-velocity-aided INS with visual odometry to obtain more accurate pose estimation in various environments. And then, the parameters of INS have adjusted adaptively via taking errors between fusion estimation and INS outputs as observers in the factor graphs. We evaluate the performance of our system with real-world experiments. Results are compared with other algorithms to show that the absolute trajectory accuracy in the algorithm proposed has been greatly improved, especially in the dynamic scene and multi-motions trials.

Download Full-text

Dynamic scene deblurring and image de-raining based on generative adversarial networks and transfer learning for Internet of vehicle

EURASIP Journal on Advances in Signal Processing ◽

10.1186/s13634-021-00829-0 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Bingcai Wei ◽

Liye Zhang ◽

Kangtao Wang ◽

Qun Kong ◽

Zhuang Wang

Keyword(s):

Transfer Learning ◽

High Speed ◽

Image Deblurring ◽

Training Data ◽

Generative Adversarial Networks ◽

Dynamic Scene ◽

The Real ◽

Adversarial Networks ◽

Rain Removal ◽

Residual Block

AbstractExtracting traffic information from images plays an increasingly significant role in Internet of vehicle. However, due to the high-speed movement and bumps of the vehicle, the image will be blurred during image acquisition. In addition, in rainy days, as a result of the rain attached to the lens, the target will be blocked by rain, and the image will be distorted. These problems have caused great obstacles for extracting key information from transportation images, which will affect the real-time judgment of vehicle control system on road conditions, and further cause decision-making errors of the system and even have a bearing on traffic accidents. In this paper, we propose a motion-blurred restoration and rain removal algorithm for IoV based on generative adversarial network and transfer learning. Dynamic scene deblurring and image de-raining are both among the challenging classical research directions in low-level vision tasks. For both tasks, firstly, instead of using ReLU in a conventional residual block, we designed a residual block containing three 256-channel convolutional layers, and we used the Leaky-ReLU activation function. Secondly, we used generative adversarial networks for the image deblurring task with our Resblocks, as well as the image de-raining task. Thirdly, experimental results on the synthetic blur dataset GOPRO and the real blur dataset RealBlur confirm the effectiveness of our model for image deblurring. Finally, as an image de-raining task based on transfer learning, we can fine-tune the pre-trained model with less training data and show good results on several datasets used for image rain removal.

Download Full-text

Kimera: From SLAM to spatial perception with 3D dynamic scene graphs

The International Journal of Robotics Research ◽

10.1177/02783649211056674 ◽

2021 ◽

Vol 40 (12-14) ◽

pp. 1510-1546

Author(s):

Antoni Rosinol ◽

Andrew Violette ◽

Marcus Abate ◽

Nathan Hughes ◽

Yun Chang ◽

...

Keyword(s):

Real Time ◽

Mental Model ◽

Comprehensive Evaluation ◽

Real Life ◽

Dynamic Environment ◽

Human Perception ◽

Object Localization ◽

Dynamic Scene ◽

Shape Estimation ◽

Levels Of Abstraction

Humans are able to form a complex mental model of the environment they move in. This mental model captures geometric and semantic aspects of the scene, describes the environment at multiple levels of abstractions (e.g., objects, rooms, buildings), includes static and dynamic entities and their relations (e.g., a person is in a room at a given time). In contrast, current robots’ internal representations still provide a partial and fragmented understanding of the environment, either in the form of a sparse or dense set of geometric primitives (e.g., points, lines, planes, and voxels), or as a collection of objects. This article attempts to reduce the gap between robot and human perception by introducing a novel representation, a 3D dynamic scene graph (DSG), that seamlessly captures metric and semantic aspects of a dynamic environment. A DSG is a layered graph where nodes represent spatial concepts at different levels of abstraction, and edges represent spatiotemporal relations among nodes. Our second contribution is Kimera, the first fully automatic method to build a DSG from visual–inertial data. Kimera includes accurate algorithms for visual–inertial simultaneous localization and mapping (SLAM), metric–semantic 3D reconstruction, object localization, human pose and shape estimation, and scene parsing. Our third contribution is a comprehensive evaluation of Kimera in real-life datasets and photo-realistic simulations, including a newly released dataset, uHumans2, which simulates a collection of crowded indoor and outdoor scenes. Our evaluation shows that Kimera achieves competitive performance in visual–inertial SLAM, estimates an accurate 3D metric–semantic mesh model in real-time, and builds a DSG of a complex indoor environment with tens of objects and humans in minutes. Our final contribution is to showcase how to use a DSG for real-time hierarchical semantic path-planning. The core modules in Kimera have been released open source.

Download Full-text

Eye Movements during dynamic scene viewing are affected by visual attention skills and events of the scene: Evidence from first-person shooter gameplay videos

Journal of Eye Movement Research ◽

10.16910/jemr.14.2.3 ◽

2021 ◽

Vol 14 (2) ◽

Author(s):

Suvi Holm ◽

Tuomo Häikiö ◽

Konstantin Olli ◽

Johanna Kaakinen

Keyword(s):

Individual Differences ◽

Eye Movements ◽

Visual Attention ◽

Eye Movement ◽

First Person ◽

Dynamic Scene ◽

Attention Tasks ◽

Scene Viewing ◽

The Individual ◽

First Person Shooter

The role of individual differences during dynamic scene viewing was explored. Participants (N=38) watched a gameplay video of a first-person shooter (FPS) videogame while their eye movements were recorded. In addition, the participants’ skills in three visual attention tasks (attentional blink, visual search, and multiple object tracking) were assessed. The results showed that individual differences in visual attention tasks were associated with eye movement patterns observed during viewing of the gameplay video. The differences were noted in four eye movement measures: number of fixations, fixation durations, saccade amplitudes and fixation distances from the center of the screen. The individual differences showed during specific events of the video as well as during the video as a whole. The results highlight that an unedited, fast-paced and cluttered dynamic scene can bring about individual differences in dynamic scene viewing.

Download Full-text

Augmenting TV Shows via Uncalibrated Camera Small Motion Tracking in Dynamic Scene

10.1145/3474085.3475543 ◽

2021 ◽

Author(s):

Yizhen Lao ◽

Jie Yang ◽

Xinying Wang ◽

Jianxin Lin ◽

Yu Cao ◽

...

Keyword(s):

Motion Tracking ◽

Dynamic Scene ◽

Small Motion ◽

Uncalibrated Camera ◽

Tv Shows

Download Full-text

Adaptive obstacle avoidance algorithm of collaborative unmanned vehicles integrated monocular cameras in dynamic scene

10.1117/12.2602756 ◽

2021 ◽

Author(s):

Yuqi Han

Keyword(s):

Obstacle Avoidance ◽

Unmanned Vehicles ◽

Dynamic Scene

Download Full-text

Dynamic Scene Deblurring and Image De-Raining Based on Generative Adversarial Networks and Transfer Learning for Internet of Vehicle

10.21203/rs.3.rs-895642/v1 ◽

2021 ◽

Author(s):

Bingcai Wei ◽

liye zhang ◽

Kangtao Wang ◽

Qun Kong ◽

Zhuang Wang

Keyword(s):

Transfer Learning ◽

High Speed ◽

Image Deblurring ◽

Motion Blur ◽

Generative Adversarial Networks ◽

Dynamic Scene ◽

Generative Adversarial Network ◽

The Real ◽

Adversarial Networks ◽

Residual Block

Abstract Extracting traffic information from images plays an important role in Internet of Vehicle (IoV). However, due to the high-speed movement and bumpiness of the vehicle, motion blur will occur in image acquisition. In addition, in rainy days, because the rain is attached to the lens, the target will be blocked by rain, and the image will be distorted. These problems have caused great obstacles for extracting key information from transportation images, which will affect the real-time judgment of vehicle control system on road conditions, and further cause decision-making errors of the system and even cause traffic accidents. In this paper, we propose a motion blurred restoration and rain removal algorithm for IoV based on Generative Adversarial Network (GAN) and transfer learning. Dynamic scene deblurring and image de-raining are both among the challenging classical tasks in low-level vision tasks. For both tasks, firstly, instead of using ReLU in a conventional residual block, we designed a residual block containing three 256-channel convolutional layers, and we used the Leakly-ReLU activation function. Secondly, we used generative adversarial networks for the image deblurring task with our Resblock, as well as the image de-raining task. Thirdly, experimental results on the synthetic blur dataset GOPRO and the real blur dataset RealBlur confirm the effectiveness of our model for image deblurring. Finally, we can use the pre-trained model for the transfer learning-based image de-raining task and show good results on several datasets.

Download Full-text

HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN

Signals ◽

10.3390/signals2030037 ◽

2021 ◽

Vol 2 (3) ◽

pp. 604-618

Author(s):

Paritosh Parmar ◽

Brendan Morris

Keyword(s):

Neural Networks ◽

State Of The Art ◽

Scene Recognition ◽

Multitask Learning ◽

Training Phase ◽

Dynamic Scene ◽

Improve Performance ◽

Computing Power ◽

Practical Standpoint ◽

3D Cnn

Spatiotemporal representations learned using 3D convolutional neural networks (CNN) are currently used in state-of-the-art approaches for action-related tasks. However, 3D-CNN are notorious for being memory and compute resource intensive as compared with more simple 2D-CNN architectures. We propose to hallucinate spatiotemporal representations from a 3D-CNN teacher with a 2D-CNN student. By requiring the 2D-CNN to predict the future and intuit upcoming activity, it is encouraged to gain a deeper understanding of actions and how they evolve. The hallucination task is treated as an auxiliary task, which can be used with any other action-related task in a multitask learning setting. Thorough experimental evaluation, it is shown that the hallucination task indeed helps improve performance on action recognition, action quality assessment, and dynamic scene recognition tasks. From a practical standpoint, being able to hallucinate spatiotemporal representations without an actual 3D-CNN can enable deployment in resource-constrained scenarios, such as with limited computing power and/or lower bandwidth. We also observed that our hallucination task has utility not only during the training phase, but also during the pre-training phase.

Download Full-text

Dynamic scene understanding using deep neural networks

10.3990/1.9789036552233 ◽

2021 ◽

Author(s):

Ye Lyu

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Scene Understanding ◽

Dynamic Scene

Download Full-text

dynamic scene
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Progressive edge-sensing dynamic scene deblurring

A Robust Visual-Aided Inertial Navigation Algorithm for Pedestrians

Dynamic scene deblurring and image de-raining based on generative adversarial networks and transfer learning for Internet of vehicle

Kimera: From SLAM to spatial perception with 3D dynamic scene graphs

Eye Movements during dynamic scene viewing are affected by visual attention skills and events of the scene: Evidence from first-person shooter gameplay videos

Augmenting TV Shows via Uncalibrated Camera Small Motion Tracking in Dynamic Scene

Adaptive obstacle avoidance algorithm of collaborative unmanned vehicles integrated monocular cameras in dynamic scene

Dynamic Scene Deblurring and Image De-Raining Based on Generative Adversarial Networks and Transfer Learning for Internet of Vehicle

HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN

Dynamic scene understanding using deep neural networks

Export Citation Format

dynamic sceneRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Progressive edge-sensing dynamic scene deblurring

A Robust Visual-Aided Inertial Navigation Algorithm for Pedestrians

Dynamic scene deblurring and image de-raining based on generative adversarial networks and transfer learning for Internet of vehicle

Kimera: From SLAM to spatial perception with 3D dynamic scene graphs

Eye Movements during dynamic scene viewing are affected by visual attention skills and events of the scene: Evidence from first-person shooter gameplay videos

Augmenting TV Shows via Uncalibrated Camera Small Motion Tracking in Dynamic Scene

Adaptive obstacle avoidance algorithm of collaborative unmanned vehicles integrated monocular cameras in dynamic scene

Dynamic Scene Deblurring and Image De-Raining Based on Generative Adversarial Networks and Transfer Learning for Internet of Vehicle

HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN

Dynamic scene understanding using deep neural networks

dynamic scene
Recently Published Documents