<title>Visual target-object search using Gabor transform</title>

A central problem of behavioural studies providing artificial visual stimuli for non-human animals is to determine how subjects perceive and process these stimuli. Especially in the case of videos, it is important to ascertain that animals perceive the actual content of the images and are not just reacting to the motion cues in the presentation. In this study, we set out to investigate how dogs process life-sized videos. We aimed to find out whether dogs perceive the actual content of video images or whether they only react to the videos as a set of dynamic visual elements. For this purpose, dogs were presented with an object search task where a life-sized projected human was hiding a target object. The videos were either normally oriented or displayed upside down, and we analysed dogs’ reactions towards the projector screen after the video presentations, and their performance in the search task. Results indicated that in the case of the normally oriented videos, dogs spontaneously perceived the actual content of the images. However, the ‘Inverted’ videos were first processed as a set of unrelated visual elements, and only after some exposure to these videos did the dogs show signs of perceiving the unusual configuration of the depicted scene. Our most important conclusion was that dogs process the same type of artificial visual stimuli in different ways, depending on the familiarity of the depicted scene, and that the processing mode can change with exposure to unfamiliar stimuli.

Download Full-text

Online Planning for Target Object Search in Clutter under Partial Observability

2019 International Conference on Robotics and Automation (ICRA) ◽

10.1109/icra.2019.8793494 ◽

2019 ◽

Cited By ~ 5

Author(s):

Yuchen Xiao ◽

Sammie Katt ◽

Andreas ten Pas ◽

Shengjian Chen ◽

Christopher Amato

Keyword(s):

Target Object ◽

Partial Observability ◽

Online Planning ◽

Object Search

Download Full-text

Neural Basis of the Set-Size Effect in Frontal Eye Field: Timing of Attention During Visual Search

Journal of Neurophysiology ◽

10.1152/jn.00035.2009 ◽

2009 ◽

Vol 101 (4) ◽

pp. 1699-1704 ◽

Cited By ~ 51

Author(s):

Jeremiah Y. Cohen ◽

Richard P. Heitz ◽

Geoffrey F. Woodman ◽

Jeffrey D. Schall

Keyword(s):

Visual Search ◽

Response Time ◽

Visual Target ◽

Target Selection ◽

Target Object ◽

Frontal Eye Field ◽

Neural Basis ◽

Set Size ◽

Eye Field ◽

Recorded Activity

Visual search for a target object among distractors often takes longer when more distractors are present. To understand the neural basis of this capacity limitation, we recorded activity from visually responsive neurons in the frontal eye field (FEF) of macaque monkeys searching for a target among distractors defined by form (randomly oriented T or L). To test the hypothesis that the delay of response time with increasing number of distractors originates in the delay of attention allocation by FEF neurons, we manipulated the number of distractors presented with the search target. When monkeys were presented with more distractors, visual target selection was delayed and neuronal activity was reduced in proportion to longer response time. These findings indicate that the time taken by FEF neurons to select the target contributes to the variation in visual search efficiency.

Download Full-text

Technique for automated target object search in video stream from UAV in post-processing mode

Ukrainian Information Security Research Journal ◽

10.18372/2410-7840.21.13767 ◽

2019 ◽

Vol 21 (2) ◽

Author(s):

Пилип Олександрович Приставка ◽

Дмитро Ігорович Гісь ◽

Артем Валерійович Чирков

Keyword(s):

Video Stream ◽

Target Object ◽

Post Processing ◽

Processing Mode ◽

Object Search

Download Full-text

Tactile-based active object discrimination and target object search in an unknown workspace

Autonomous Robots ◽

10.1007/s10514-018-9707-8 ◽

2018 ◽

Vol 43 (1) ◽

pp. 123-152 ◽

Cited By ~ 15

Author(s):

Mohsen Kaboli ◽

Kunpeng Yao ◽

Di Feng ◽

Gordon Cheng

Keyword(s):

Target Object ◽

Object Discrimination ◽

Active Object ◽

Object Search

Download Full-text

The integration of visual and target signals in V4 and IT during visual object search

10.1101/370049 ◽

2018 ◽

Author(s):

Noam Roth ◽

Nicole C. Rust

Keyword(s):

Visual Information ◽

Visual Pathway ◽

Target Object ◽

Visual Object ◽

Neural Responses ◽

Top Down ◽

Feed Forward ◽

Object Search ◽

Ventral Visual Pathway ◽

Background Context

ABSTRACTSearching for a specific visual object requires our brain to compare the items in view with a remembered representation of the sought target to determine whether a target match is present. This comparison is thought to be implemented, in part, via the combination of top-down modulations reflecting target identity with feed-forward visual representations. However, it remains unclear whether top-down signals are integrated at a single locus within the ventral visual pathway (e.g. V4) or at multiple stages (e.g. both V4 and inferotemporal cortex, IT). To investigate, we recorded neural responses in V4 and IT as rhesus monkeys performed a task that required them to identify when a target object appeared across variation in position, size and background context. We found non-visual, task-specific signals in both V4 and IT. To evaluate whether V4 was the only locus for the integration of top-down signals, we evaluated several feed-forward accounts of processing from V4 to IT, including a model in which IT preferentially sampled from the best V4 units and a model that allowed for nonlinear IT computation. IT task-specific modulation was not accounted for by any of these feed-forward descriptions, suggesting that during object search, top-down signals are integrated directly within IT.NEW & NOTEWORTHYTo find specific objects, the brain must integrate top-down, target-specific signals with visual information about objects in view. However, the exact route of this integration in the ventral visual pathway is unclear. In the first study to systematically compare V4 and IT during an invariant object search task, we demonstrate that top-down signals found in IT cannot be described as being inherited from V4, but rather must be integrated directly within IT itself.

Download Full-text

Target Object Search Algorithm under Dynamic Programming in the Tree-Type Maze

Journal of Korean institute of intelligent systems ◽

10.5391/jkiis.2005.15.5.626 ◽

2005 ◽

Vol 15 (5) ◽

pp. 626-631

Author(s):

Dong-Hoon Lee ◽

Han-Ul Yoon ◽

Kwee-Bo Sim

Keyword(s):

Dynamic Programming ◽

Search Algorithm ◽

Target Object ◽

Object Search ◽

Tree Type

Download Full-text

Semantic Linking Maps for Active Visual Object Search (Extended Abstract)

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/667 ◽

2021 ◽

Author(s):

Zhen Zeng ◽

Adrian Röfer ◽

Odest Chadwicke Jenkins

Keyword(s):

Search Strategy ◽

Spatial Relations ◽

Search Space ◽

Target Object ◽

Visual Object ◽

Hybrid Search ◽

Simulated Environments ◽

Object Search ◽

Semantic Linking ◽

Next Best View

We aim for mobile robots to function in a variety of common human environments, which requires them to efficiently search previously unseen target objects. We can exploit background knowledge about common spatial relations between landmark objects and target objects to narrow down search space. In this paper, we propose an active visual object search strategy method through our introduction of the Semantic Linking Maps (SLiM) model. SLiM simultaneously maintains the belief over a target object's location as well as landmark objects' locations, while accounting for probabilistic inter-object spatial relations. Based on SLiM, we describe a hybrid search strategy that selects the next best view pose for searching for the target object based on the maintained belief. We demonstrate the efficiency of our SLiM-based search strategy through comparative experiments in simulated environments. We further demonstrate the real-world applicability of SLiM-based search in scenarios with a Fetch mobile manipulation robot.

Download Full-text

The integration of visual and target signals in V4 and IT during visual object search

Journal of Neurophysiology ◽

10.1152/jn.00024.2019 ◽

2019 ◽

Vol 122 (6) ◽

pp. 2522-2540 ◽

Cited By ~ 1

Author(s):

Noam Roth ◽

Nicole C. Rust

Keyword(s):

Visual Information ◽

Visual Pathway ◽

Target Object ◽

Visual Object ◽

Neural Responses ◽

Inferotemporal Cortex ◽

Top Down ◽

Feed Forward ◽

Object Search ◽

Ventral Visual Pathway

Searching for a specific visual object requires our brain to compare the items in view with a remembered representation of the sought target to determine whether a target match is present. This comparison is thought to be implemented, in part, via the combination of top-down modulations reflecting target identity with feed-forward visual representations. However, it remains unclear whether top-down signals are integrated at a single locus within the ventral visual pathway (e.g., V4) or at multiple stages [e.g., both V4 and inferotemporal cortex (IT)]. To investigate, we recorded neural responses in V4 and IT as rhesus monkeys performed a task that required them to identify when a target object appeared across variation in position, size, and background context. We found nonvisual, task-specific signals in both V4 and IT. To evaluate whether V4 was the only locus for the integration of top-down signals, we evaluated several feed-forward accounts of processing from V4 to IT, including a model in which IT preferentially sampled from the best V4 units and a model that allowed for nonlinear IT computation. IT task-specific modulation was not accounted for by any of these feed-forward descriptions, suggesting that during object search, top-down signals are integrated directly within IT. NEW & NOTEWORTHY To find specific objects, the brain must integrate top-down, target-specific signals with visual information about objects in view. However, the exact route of this integration in the ventral visual pathway is unclear. In the first study to systematically compare V4 and inferotemporal cortex (IT) during an invariant object search task, we demonstrate that top-down signals found in IT cannot be described as being inherited from V4 but rather must be integrated directly within IT itself.

Download Full-text

Is My Object in This Video? Reconstruction-based Object Search in Videos

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/635 ◽

2017 ◽

Cited By ~ 3

Author(s):

Tan Yu ◽

Jingjing Meng ◽

Junsong Yuan

Keyword(s):

Prior Knowledge ◽

Target Object ◽

Object Search ◽

Benchmark Datasets ◽

Memory Cost ◽

Instance Search ◽

Video Reconstruction ◽

Reconstruction Model

This paper addresses the problem of video-level object instance search, which aims to retrieve the videos in the database that contain a given query object instance. Without prior knowledge about "when" and "where" an object of interest may appear in a video, determining "whether" a video contains the target object is computationally prohibitive, as it requires exhaustively matching the query against all possible spatial-temporal locations in each video that an object may appear. To alleviate the computational and memory cost, we propose the Reconstruction-based Object SEarch (ROSE) method.It characterizes a huge corpus of features of possible spatial-temporal locations in the video into the parameters of the reconstruction model. Since the memory cost of storing reconstruction model is much less than that of storing features of possible spatial-temporal locations in the video, the efficiency of the search is significantly boosted. Comprehensive experiments on three benchmark datasets demonstrate the promising performance of the proposed ROSE method.

Download Full-text