MemX

This work presents MemX: a biologically-inspired attention-aware eyewear system developed with the goal of pursuing the long-awaited vision of a personalized visual Memex. MemX captures human visual attention on the fly, analyzes the salient visual content, and records moments of personal interest in the form of compact video snippets. Accurate attentive scene detection and analysis on resource-constrained platforms is challenging because these tasks are computation and energy intensive. We propose a new temporal visual attention network that unifies human visual attention tracking and salient visual content analysis. Attention tracking focuses computation-intensive video analysis on salient regions, while video analysis makes human attention detection and tracking more accurate. Using the YouTube-VIS dataset and 30 participants, we experimentally show that MemX significantly improves the attention tracking accuracy over the eye-tracking-alone method, while maintaining high system energy efficiency. We have also conducted 11 in-field pilot studies across a range of daily usage scenarios, which demonstrate the feasibility and potential benefits of MemX.

Download Full-text

Content-Based Video Semantic Analysis

Semantic Mining Technologies for Multimedia Databases ◽

10.4018/978-1-60566-188-9.ch009 ◽

2011 ◽

pp. 211-235 ◽

Cited By ~ 1

Author(s):

Shuqiang Jiang ◽

Yonghong Tian ◽

Qingming Huang ◽

Tiejun Huang ◽

Wen Gao

Keyword(s):

Visual Attention ◽

Object Detection ◽

Video Analysis ◽

Event Detection ◽

Semantic Analysis ◽

Research Area ◽

Video Data ◽

Computing Power ◽

Detection And Tracking ◽

Video Semantic Analysis

With the explosive growth in the amount of video data and rapid advance in computing power, extensive research efforts have been devoted to content-based video analysis. In this chapter, the authors will give a broad discussion on this research area by covering different topics such as video structure analysis, object detection and tracking, event detection, visual attention analysis, and so forth. In the meantime, different video representation and indexing models are also presented.

Download Full-text

Visual Attention Distribution and its Application in the Gesture Interaction System

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.713-715.2185 ◽

2015 ◽

Vol 713-715 ◽

pp. 2185-2188

Author(s):

Na Na He ◽

Zhi Quan Feng ◽

Zhong Zhu Huang ◽

Xue Wen Yang

Keyword(s):

Visual Attention ◽

Cognitive Model ◽

Distribution Model ◽

Tracking Accuracy ◽

Gesture Interaction ◽

Visual Attention Model ◽

Attention Model ◽

Gesture Tracking ◽

Basic Characteristics ◽

Human Visual Attention

Aiming at making the simulation of human visual attention behavior more truly in computer, starting from analyzing operator’s cognitive model, a gesture tracking algorithm is put forward based on the distribution model of visual attention. To begin with, analyzing the change of the operator human eye sight, a visual attention model was built. Secondly, the basic characteristics of visual attention model were studied. Finally, the three Gauss formula is used to describe the model. Experimental results show that the algorithm can effectively improve the speed and tracking accuracy of gesture interaction.

Download Full-text

A unified framework for automated person re-indentification

Transport and Communication Science Journal ◽

10.25073/tcsj.71.7.11 ◽

2020 ◽

Vol 71 (7) ◽

pp. 868-880

Author(s):

Nguyen Hong-Quan ◽

Nguyen Thuy-Binh ◽

Tran Duc-Long ◽

Le Thi-Lan

Keyword(s):

Deep Learning ◽

Video Analysis ◽

Camera Network ◽

Unified Framework ◽

Person Detection ◽

Practical Applications ◽

Detection And Tracking ◽

Analysis System ◽

Bounding Boxes

Along with the strong development of camera networks, a video analysis system has been become more and more popular and has been applied in various practical applications. In this paper, we focus on person re-identification (person ReID) task that is a crucial step of video analysis systems. The purpose of person ReID is to associate multiple images of a given person when moving in a non-overlapping camera network. Many efforts have been made to person ReID. However, most of studies on person ReID only deal with well-alignment bounding boxes which are detected manually and considered as the perfect inputs for person ReID. In fact, when building a fully automated person ReID system the quality of the two previous steps that are person detection and tracking may have a strong effect on the person ReID performance. The contribution of this paper are two-folds. First, a unified framework for person ReID based on deep learning models is proposed. In this framework, the coupling of a deep neural network for person detection and a deep-learning-based tracking method is used. Besides, features extracted from an improved ResNet architecture are proposed for person representation to achieve a higher ReID accuracy. Second, our self-built dataset is introduced and employed for evaluation of all three steps in the fully automated person ReID framework.

Download Full-text

Saliency-Based Gaze Visualization for Eye Movement Analysis

Sensors ◽

10.3390/s21155178 ◽

2021 ◽

Vol 21 (15) ◽

pp. 5178

Author(s):

Sangbong Yoo ◽

Seongmin Jeong ◽

Seokyeon Kim ◽

Yun Jang

Keyword(s):

Eye Movements ◽

Visual Attention ◽

Eye Movement ◽

Visual Stimuli ◽

Saliency Map ◽

Gaze Behavior ◽

The Gaze ◽

Movement Data ◽

Visual Clues ◽

Human Visual Attention

Gaze movement and visual stimuli have been utilized to analyze human visual attention intuitively. Gaze behavior studies mainly show statistical analyses of eye movements and human visual attention. During these analyses, eye movement data and the saliency map are presented to the analysts as separate views or merged views. However, the analysts become frustrated when they need to memorize all of the separate views or when the eye movements obscure the saliency map in the merged views. Therefore, it is not easy to analyze how visual stimuli affect gaze movements since existing techniques focus excessively on the eye movement data. In this paper, we propose a novel visualization technique for analyzing gaze behavior using saliency features as visual clues to express the visual attention of an observer. The visual clues that represent visual attention are analyzed to reveal which saliency features are prominent for the visual stimulus analysis. We visualize the gaze data with the saliency features to interpret the visual attention. We analyze the gaze behavior with the proposed visualization to evaluate that our approach to embedding saliency features within the visualization supports us to understand the visual attention of an observer.

Download Full-text

Content-Based Filtering Enhanced by Human Visual Attention Applied to Clothing Recommendation

2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI) ◽

10.1109/ictai.2015.98 ◽

2015 ◽

Cited By ~ 3

Author(s):

Ernani Viriato de Melo ◽

Emilia Alves Nogueira ◽

Denise Guliato

Keyword(s):

Visual Attention ◽

Content Based Filtering ◽

Human Visual Attention

Download Full-text

Evaluation Method of Camouflage Effect Based on Human Visual Attention Mechanism

10.1109/aeeca52519.2021.9574326 ◽

2021 ◽

Author(s):

Qi Jia ◽

Mingxin Liu ◽

Yang Yi ◽

Liyan Zhu

Keyword(s):

Visual Attention ◽

Evaluation Method ◽

Attention Mechanism ◽

Visual Attention Mechanism ◽

Human Visual Attention

Download Full-text

Rethinking human visual attention: Spatial cueing effects and optimality of decisions by honeybees, monkeys and humans

Vision Research ◽

10.1016/j.visres.2012.12.011 ◽

2013 ◽

Vol 85 ◽

pp. 5-19 ◽

Cited By ~ 27

Author(s):

Miguel P. Eckstein ◽

Stephen C. Mack ◽

Dorion B. Liston ◽

Lisa Bogush ◽

Randolf Menzel ◽

...

Keyword(s):

Visual Attention ◽

Spatial Cueing ◽

Human Visual Attention

Download Full-text

I Know Where You Are Secretly Attending! The topography of human visual attention revealed with fMRI

Journal of Vision ◽

10.1167/10.7.9 ◽

2010 ◽

Vol 10 (7) ◽

pp. 9-9

Author(s):

E. DeYoe ◽

R. Datta

Keyword(s):

Visual Attention ◽

Human Visual Attention

Download Full-text

Bottom-up visual attention model for still image: a preliminary study

International Journal of Advances in Intelligent Informatics ◽

10.26555/ijain.v6i1.469 ◽

2020 ◽

Vol 6 (1) ◽

pp. 82

Author(s):

Adhi Prahara ◽

Murinto Murinto ◽

Dewi Pramudi Ismi

Keyword(s):

Visual Attention ◽

Object Detection ◽

Video Compression ◽

Saliency Map ◽

Bottom Up ◽

Attention Model ◽

Intrinsic Cues ◽

Preliminary Study ◽

Segmentation Image ◽

Human Visual Attention

The philosophy of human visual attention is scientifically explained in the field of cognitive psychology and neuroscience then computationally modeled in the field of computer science and engineering. Visual attention models have been applied in computer vision systems such as object detection, object recognition, image segmentation, image and video compression, action recognition, visual tracking, and so on. This work studies bottom-up visual attention, namely human fixation prediction and salient object detection models. The preliminary study briefly covers from the biological perspective of visual attention, including visual pathway, the theory of visual attention, to the computational model of bottom-up visual attention that generates saliency map. The study compares some models at each stage and observes whether the stage is inspired by biological architecture, concept, or behavior of human visual attention. From the study, the use of low-level features, center-surround mechanism, sparse representation, and higher-level guidance with intrinsic cues dominate the bottom-up visual attention approaches. The study also highlights the correlation between bottom-up visual attention and curiosity.

Download Full-text

Convolutional Neural Network Using Kalman Filter for Human Detection and Tracking on RGB-D Video

CommIT (Communication and Information Technology) Journal ◽

10.21512/commit.v12i2.4890 ◽

2018 ◽

Vol 12 (2) ◽

pp. 105 ◽

Cited By ~ 1

Author(s):

Jovin Angelico ◽

Ken Ratri Retno Wardani

Keyword(s):

Neural Network ◽

Kalman Filter ◽

Convolutional Neural Network ◽

Human Detection ◽

Detection Accuracy ◽

Tracking Accuracy ◽

Lighting Condition ◽

Detection And Tracking ◽

Computation Efficiency ◽

The Impact

The computer ability to detect human being by computer vision is still being improved both in accuracy or computation time. In low-lighting condition, the detection accuracy is usually low. This research uses additional information, besides RGB channels, namely a depth map that shows objects’ distance relative to the camera. This research integrates Cascade Classifier (CC) to localize the potential object, the Convolutional Neural Network (CNN) technique to identify the human and nonhuman image, and the Kalman filter technique to track human movement. For training and testing purposes, there are two kinds of RGB-D datasets used with different points of view and lighting conditions. Both datasets have been selected to remove images which contain a lot of noises and occlusions so that during the training process it will be more directed. Using these integrated techniques, detection and tracking accuracy reach 77.7%. The impact of using Kalman filter increases computation efficiency by 41%.

Download Full-text