Real world scene analysis in perspective

During text reading, the durations of eye fixations decrease with greater frequency and predictability of the currently fixated word (Rayner, 1998; 2009). However, it has not been tested whether those results also apply to scene viewing. We computed object frequency and predictability from both linguistic and visual scene analysis (LabelMe, Russell et al., 2008), and Latent Semantic Analysis (Landauer et al., 1998) was applied to estimate predictability. In a scene-viewing experiment, we found that, for small objects, linguistics-based frequency, but not scene-based frequency, had effects on first fixation duration, gaze duration, and total time. Both linguistic and scene-based predictability affected total time. Similar to reading, fixation duration decreased with higher frequency and predictability. For large objects, we found the direction of effects to be the inverse of those found in reading studies. These results suggest that the recognition of small objects in scene viewing shares some characteristics with the recognition of words in reading.

Download Full-text

3D Shape and surface colour sensor fusion for robot vision

Robotica ◽

10.1017/s0263574700010596 ◽

1992 ◽

Vol 10 (5) ◽

pp. 389-396 ◽

Cited By ~ 5

Author(s):

R. A. Jarvis

Keyword(s):

Sensor Fusion ◽

Trajectory Planning ◽

Real World ◽

Robot Vision ◽

Scene Analysis ◽

Scene Segmentation ◽

Site Location ◽

The Real ◽

Preliminary Results ◽

Sensory Data

SUMMARYThis paper argues the case for extracting as complete a set of sensory data as practicable from scenes consisting of complex assemblages of objects with the goal of completing the task of scene analysis, including placement, pose, identity and relationship amongst the components in a robust manner which supports goal directed robotic action, including collision-free trajectory planning, grip site location and manipulation of selected object classes.The emphasis of the paper is that of sensor fusion of range and surface colour data including preliminary results in proximity, surface normal directionality and colour based scene segmentation through semantic-free clustering processes. The larger context is that of imbedding the results of such analysis in a graphics world containing an articulated robotic manipulator and of carrying out experiments in that world prior to replication of safe manipulation sequences in the real world.

Download Full-text

Creating Space

Music Theory Online ◽

10.30535/mto.17.2.3 ◽

2011 ◽

Vol 17 (2) ◽

Cited By ~ 2

Author(s):

Jennifer Iverson

Keyword(s):

Real World ◽

Visual Art ◽

Auditory Scene Analysis ◽

Scene Analysis ◽

Complex Sound ◽

Auditory Scene

Charles’s Ives’s collages, such as “Putnam’s Camp,”The Fourth of July, and selected movements of theFourth Symphony, present listeners with extraordinarily complex sound environments. This article uses Albert Bregman’sAuditory Scene Analysisas a source for methodology to analyze how listeners may parse and organize the chaotic surface of a musical collage. Since scene analysis problems in Ives’s collages often mimic real-world environments, Ives creates music that seems “spatial” or “pictorial” as a result. Finally, the article compares and contrasts the perception of space in Ives’s musical collages with their historical parallel in visual art, Cubist collage.

Download Full-text

Parts and Wholes in Scene Processing

10.31234/osf.io/c4pzt ◽

2021 ◽

Author(s):

Daniel Kaiser ◽

Radoslaw M. Cichy

Keyword(s):

Real World ◽

Spatial Configuration ◽

Scene Analysis ◽

Neural Processing ◽

Profound Impact ◽

Natural Vision ◽

Cortical Responses ◽

The World ◽

Typical Part ◽

Scene Processing

During natural vision, our brains are constantly exposed to complex, but regularly structured environments. Real-world scenes are defined by typical part-whole relationships, where the meaning of the whole scene emerges from configurations of localized information present in individual parts of the scene. Such typical part-whole relationships suggest that information from individual scene parts is not processed independently, but that there are mutual influences between the parts and the whole during scene analysis. Here, we review recent research that used a straightforward, but effective approach to study such mutual influences: by dissecting scenes into multiple arbitrary pieces, these studies provide new insights into how the processing of whole scenes is shaped by their consistent parts and, conversely, how the processing of individual parts is determined by their role within the whole scene. We highlight three facets of this research: First, we discuss studies demonstrating that the spatial configuration of multiple scene parts has a profound impact on the neural processing of the whole scene. Second, we review work showing that cortical responses to individual scene parts are shaped by the context in which these parts typically appear within the environment. Third, we discuss studies demonstrating that missing scene parts are interpolated from the surrounding scene context. Bridging these findings, we argue that efficient scene processing relies on an active use of the scene’s part-whole structure, where the visual brain matches scene inputs with internal models of what the world should look like.

Download Full-text

Improving Deep Object Detection Algorithms for Game Scenes

Electronics ◽

10.3390/electronics10202527 ◽

2021 ◽

Vol 10 (20) ◽

pp. 2527

Author(s):

Minji Jung ◽

Heekyung Yang ◽

Kyungha Min

Keyword(s):

Computer Vision ◽

Object Detection ◽

Real World ◽

Computer Games ◽

Recognition Performance ◽

Role Playing ◽

Scene Analysis ◽

Research Topics ◽

The Real ◽

Detection Algorithms

The advancement and popularity of computer games make game scene analysis one of the most interesting research topics in the computer vision society. Among the various computer vision techniques, we employ object detection algorithms for the analysis, since they can both recognize and localize objects in a scene. However, applying the existing object detection algorithms for analyzing game scenes does not guarantee a desired performance, since the algorithms are trained using datasets collected from the real world. In order to achieve a desired performance for analyzing game scenes, we built a dataset by collecting game scenes and retrained the object detection algorithms pre-trained with the datasets from the real world. We selected five object detection algorithms, namely YOLOv3, Faster R-CNN, SSD, FPN and EfficientDet, and eight games from various game genres including first-person shooting, role-playing, sports, and driving. PascalVOC and MS COCO were employed for the pre-training of the object detection algorithms. We proved the improvement in the performance that comes from our strategy in two aspects: recognition and localization. The improvement in recognition performance was measured using mean average precision (mAP) and the improvement in localization using intersection over union (IoU).

Download Full-text

Functions of parahippocampal place area and retrosplenial cortex in real-world scene analysis: An fMRI study

Visual Cognition ◽

10.1080/13506285.2011.596852 ◽

2011 ◽

Vol 19 (7) ◽

pp. 910-927 ◽

Cited By ~ 12

Author(s):

John M. Henderson ◽

David C. Zhu ◽

Christine L. Larson

Keyword(s):

Real World ◽

Retrosplenial Cortex ◽

Scene Analysis ◽

Fmri Study ◽

Parahippocampal Place Area

Download Full-text

Outdoor and Large-Scale Real-World Scene Analysis

10.1007/978-3-642-34091-8 ◽

2012 ◽

Keyword(s):

Real World ◽

Large Scale ◽

Scene Analysis

Download Full-text

Parts and Wholes in Scene Processing

Journal of Cognitive Neuroscience ◽

10.1162/jocn_a_01788 ◽

2021 ◽

pp. 1-12

Author(s):

Daniel Kaiser ◽

Radoslaw M. Cichy

Keyword(s):

Real World ◽

Spatial Configuration ◽

Scene Analysis ◽

Neural Processing ◽

Profound Impact ◽

Natural Vision ◽

Cortical Responses ◽

The World ◽

Typical Part ◽

Scene Processing

Abstract During natural vision, our brains are constantly exposed to complex, but regularly structured environments. Real-world scenes are defined by typical part–whole relationships, where the meaning of the whole scene emerges from configurations of localized information present in individual parts of the scene. Such typical part–whole relationships suggest that information from individual scene parts is not processed independently, but that there are mutual influences between the parts and the whole during scene analysis. Here, we review recent research that used a straightforward, but effective approach to study such mutual influences: By dissecting scenes into multiple arbitrary pieces, these studies provide new insights into how the processing of whole scenes is shaped by their constituent parts and, conversely, how the processing of individual parts is determined by their role within the whole scene. We highlight three facets of this research: First, we discuss studies demonstrating that the spatial configuration of multiple scene parts has a profound impact on the neural processing of the whole scene. Second, we review work showing that cortical responses to individual scene parts are shaped by the context in which these parts typically appear within the environment. Third, we discuss studies demonstrating that missing scene parts are interpolated from the surrounding scene context. Bridging these findings, we argue that efficient scene processing relies on an active use of the scene's part–whole structure, where the visual brain matches scene inputs with internal models of what the world should look like.

Download Full-text

Evaluating Mental Health Services; How Do Programs for Children “Work” in the Real World? Edited by Carol T. Nixon & Denine A. Northrup. Sage, Thousand Oaks CA, 1997. pp. 306. £16.60 (pb).

Journal of Child Psychology and Psychiatry ◽

10.1017/s0021963098222918 ◽

1998 ◽

Vol 39 (6) ◽

pp. 935-936

Author(s):

Anne Worrall-Davies

Keyword(s):

Mental Health ◽

Mental Health Services ◽

Health Services ◽

Real World ◽

The Real

Download Full-text