A Computational Model of Saliency Map Read-Out during Visual Search

Speakers construct reference based on listeners’ expected visual search

10.31234/osf.io/89r72 ◽

2020 ◽

Author(s):

Julian Jara-Ettinger ◽

Paula Rubio-Fernandez

Keyword(s):

Visual Search ◽

Computational Model ◽

Referential Communication ◽

Human Communication ◽

Quantitative Accuracy ◽

Utterance Length ◽

Acceptability Judgments

A foundational assumption of human communication is that speakers ought to say as much as necessary, but no more. How speakers determine what is necessary in a given context, however, is unclear. In studies of referential communication, this expectation is often formalized as the idea that speakers should construct reference by selecting the shortest, sufficiently informative, description. Here we propose that reference production is, instead, a process whereby speakers adopt listeners’ perspectives to facilitate their visual search, without concern for utterance length. We show that a computational model of our proposal predicts graded acceptability judgments with quantitative accuracy, systematically outperforming brevity models. Our model also explains crosslinguistic differences in speakers’ propensity to over-specify in different visual contexts. Our findings suggest that reference production is best understood as driven by a cooperative goal to help the listener understand the intended message, rather than by an egocentric effort to minimize utterance length.

Download Full-text

Attention and its Role: Theories and Models

International Journal of Emerging Technologies in Learning (iJET) ◽

10.3991/ijet.v14i12.10185 ◽

2019 ◽

Vol 14 (12) ◽

pp. 169

Author(s):

Athanasios Drigas ◽

Maria Karyotaki

Keyword(s):

Visual Search ◽

Inhibition Of Return ◽

Saliency Map ◽

Saccadic Suppression ◽

Covert Attention ◽

Attentional Deployment ◽

Affect And Cognition ◽

Visual Scenes ◽

Selection Of ◽

Complete Account

Motivation, affect and cognition are interrelated. However, the control of attentional deployment and more specifically, attempting to provide a more complete account of the interactions between the dorsal and ventral processing streams is still a challenge. The interaction between overt and covert attention is particularly important for models concerned with visual search. Further modeling of such interactions can assist to scrutinize many mechanisms, such as saccadic suppression, dynamic remapping of the saliency map and inhibition of return, covert pre-selection of targets for overt saccades and online understanding of complex visual scenes.

Download Full-text

Application of Saliency Map to the Analysis of Visual Search

The Brain & Neural Networks ◽

10.3902/jnns.21.3 ◽

2014 ◽

Vol 21 (1) ◽

pp. 3-12 ◽

Cited By ~ 1

Author(s):

Masatoshi Yoshida

Keyword(s):

Visual Search ◽

Saliency Map

Download Full-text

A computational model for task inference in visual search

Journal of Vision ◽

10.1167/13.3.29 ◽

2013 ◽

Vol 13 (3) ◽

pp. 29-29 ◽

Cited By ~ 13

Author(s):

A. Haji-Abolhassani ◽

J. J. Clark

Keyword(s):

Visual Search ◽

Computational Model

Download Full-text

Been There, Seen That: A Neural Mechanism for Performing Efficient Visual Search

Journal of Neurophysiology ◽

10.1152/jn.00688.2009 ◽

2009 ◽

Vol 102 (6) ◽

pp. 3481-3491 ◽

Cited By ~ 45

Author(s):

Koorosh Mirpour ◽

Fabrice Arcizet ◽

Wei Song Ong ◽

James W. Bisley

Keyword(s):

Visual Search ◽

Neural Mechanism ◽

Saliency Map ◽

Lateral Intraparietal Area ◽

Goal Selection ◽

Reward Expectancy ◽

Irrelevant Distractors ◽

Saliency Map Models ◽

Task Irrelevant ◽

Foraging Task

In everyday life, we efficiently find objects in the world by moving our gaze from one location to another. The efficiency of this process is brought about by ignoring items that are dissimilar to the target and remembering which target-like items have already been examined. We trained two animals on a visual foraging task in which they had to find a reward-loaded target among five task-irrelevant distractors and five potential targets. We found that both animals performed the task efficiently, ignoring the distractors and rarely examining a particular target twice. We recorded the single unit activity of 54 neurons in the lateral intraparietal area (LIP) while the animals performed the task. The responses of the neurons differentiated between targets and distractors throughout the trial. Further, the responses marked off targets that had been fixated by a reduction in activity. This reduction acted like inhibition of return in saliency map models; items that had been fixated would no longer be represented by high enough activity to draw an eye movement. This reduction could also be seen as a correlate of reward expectancy; after a target had been identified as not containing the reward the activity was reduced. Within a trial, responses to the remaining targets did not increase as they became more likely to yield a result, suggesting that only activity related to an event is updated on a moment-by-moment bases. Together, our data show that all the neural activity required to guide efficient search is present in LIP. Because LIP activity is known to correlate with saccade goal selection, we propose that LIP plays a significant role in the guidance of efficient visual search.

Download Full-text

Memory modulated saliency: A computational model of the incremental learning of target locations in visual search

Visual Cognition ◽

10.1080/13506285.2013.784717 ◽

2013 ◽

Vol 21 (3) ◽

pp. 277-305 ◽

Cited By ~ 1

Author(s):

Michal Dziemianko ◽

Frank Keller

Keyword(s):

Visual Search ◽

Computational Model ◽

Incremental Learning ◽

Target Locations

Download Full-text

A biased competition computational model of spatial and object-based attention mediating active visual search

Neurocomputing ◽

10.1016/j.neucom.2004.01.110 ◽

2004 ◽

Vol 58-60 ◽

pp. 655-662 ◽

Cited By ~ 16

Author(s):

Linda J. Lanyon ◽

Susan L. Denham

Keyword(s):

Visual Search ◽

Computational Model ◽

Biased Competition ◽

Object Based

Download Full-text

Predicting Search Performance in Heterogeneous Scenes: Quantifying the Impact of Homogeneity Effects in Efficient Search

Collabra Psychology ◽

10.1525/collabra.151 ◽

2019 ◽

Vol 5 (1) ◽

Cited By ~ 2

Author(s):

Alejandro Lleras ◽

Zhiyuan Wang ◽

Anna Madison ◽

Simona Buetti

Keyword(s):

Visual Search ◽

Computational Model ◽

Real World ◽

Predictive Power ◽

Facilitatory Effect ◽

Stimulus Complexity ◽

Search Performance ◽

Efficient Search ◽

The Impact

Recently, Wang, Buetti and Lleras (2017) developed an equation to predict search performance in heterogeneous visual search scenes (i.e., multiple types of non-target objects simultaneously present) based on parameters observed when participants perform search in homogeneous scenes (i.e., when all non-target objects are identical to one another). The equation was based on a computational model where every item in the display is processed with unlimited capacity and independently of one another, with the goal of determining whether the item is likely to be a target or not. The model was tested in two experiments using real-world objects. Here, we extend those findings by testing the predictive power of the equation to simpler objects. Further, we compare the model’s performance under two stimulus arrangements: spatially-intermixed (items randomly placed around the scene) and spatially-segregated displays (identical items presented near each other). This comparison allowed us to isolate and quantify the facilitatory effect of processing displays that contain identical items (homogeneity facilitation), a factor that improves performance in visual search above-and-beyond target-distractor dissimilarity. The results suggest that homogeneity facilitation effects in search arise from local item-to-item interaction (rather than by rejecting items as “groups”) and that the strength of those interactions might be determined by stimulus complexity (with simpler stimuli producing stronger interactions and thus, stronger homogeneity facilitation effects).

Download Full-text

Olfaction spontaneously highlights visual saliency map

Proceedings of The Royal Society B Biological Sciences ◽

10.1098/rspb.2013.1729 ◽

2013 ◽

Vol 280 (1768) ◽

pp. 20131729 ◽

Cited By ~ 19

Author(s):

Kepu Chen ◽

Bin Zhou ◽

Shan Chen ◽

Sheng He ◽

Wen Zhou

Keyword(s):

Visual Search ◽

Visual Imagery ◽

Visual Image ◽

Visual Saliency ◽

Saliency Map ◽

Visual Features ◽

Sensory Inputs ◽

Perceptual Representations ◽

Object Feature ◽

Visual Domain

Attention is intrinsic to our perceptual representations of sensory inputs. Best characterized in the visual domain, it is typically depicted as a spotlight moving over a saliency map that topographically encodes strengths of visual features and feedback modulations over the visual scene. By introducing smells to two well-established attentional paradigms, the dot-probe and the visual-search paradigms, we find that a smell reflexively directs attention to the congruent visual image and facilitates visual search of that image without the mediation of visual imagery. Furthermore, such effect is independent of, and can override, top-down bias. We thus propose that smell quality acts as an object feature whose presence enhances the perceptual saliency of that object, thereby guiding the spotlight of visual attention. Our discoveries provide robust empirical evidence for a multimodal saliency map that weighs not only visual but also olfactory inputs.

Download Full-text

Predicting Visual Search Task Success from Eye Gaze Data as a Basis for User-Adaptive Information Visualization Systems

ACM Transactions on Interactive Intelligent Systems ◽

10.1145/3446638 ◽

2021 ◽

Vol 11 (2) ◽

pp. 1-25

Author(s):

Moritz Spiller ◽

Ying-Hsang Liu ◽

Md Zakir Hossain ◽

Tom Gedeon ◽

Julia Geissler ◽

...

Keyword(s):

Visual Search ◽

Computational Model ◽

Information Visualization ◽

Adaptive Systems ◽

Short Term Memory ◽

Eye Gaze ◽

Research Literature ◽

Convolutional Network ◽

Visualization Systems ◽

Information Visualizations

Information visualizations are an efficient means to support the users in understanding large amounts of complex, interconnected data; user comprehension, however, depends on individual factors such as their cognitive abilities. The research literature provides evidence that user-adaptive information visualizations positively impact the users’ performance in visualization tasks. This study attempts to contribute toward the development of a computational model to predict the users’ success in visual search tasks from eye gaze data and thereby drive such user-adaptive systems. State-of-the-art deep learning models for time series classification have been trained on sequential eye gaze data obtained from 40 study participants’ interaction with a circular and an organizational graph. The results suggest that such models yield higher accuracy than a baseline classifier and previously used models for this purpose. In particular, a Multivariate Long Short Term Memory Fully Convolutional Network shows encouraging performance for its use in online user-adaptive systems. Given this finding, such a computational model can infer the users’ need for support during interaction with a graph and trigger appropriate interventions in user-adaptive information visualization systems. This facilitates the design of such systems since further interaction data like mouse clicks is not required.

Download Full-text