scholarly journals Evidence for the intrinsically nonlinear nature of receptive fields in vision

2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Marcelo Bertalmío ◽  
Alex Gomez-Villa ◽  
Adrián Martín ◽  
Javier Vazquez-Corral ◽  
David Kane ◽  
...  

Abstract The responses of visual neurons, as well as visual perception phenomena in general, are highly nonlinear functions of the visual input, while most vision models are grounded on the notion of a linear receptive field (RF). The linear RF has a number of inherent problems: it changes with the input, it presupposes a set of basis functions for the visual system, and it conflicts with recent studies on dendritic computations. Here we propose to model the RF in a nonlinear manner, introducing the intrinsically nonlinear receptive field (INRF). Apart from being more physiologically plausible and embodying the efficient representation principle, the INRF has a key property of wide-ranging implications: for several vision science phenomena where a linear RF must vary with the input in order to predict responses, the INRF can remain constant under different stimuli. We also prove that Artificial Neural Networks with INRF modules instead of linear filters have a remarkably improved performance and better emulate basic human perception. Our results suggest a change of paradigm for vision science as well as for artificial intelligence.

2021 ◽  
Author(s):  
Yue Zhang ◽  
Ruoyu Huang ◽  
Wiebke Nörenberg ◽  
Aristides Arrenberg

The perception of optic flow is essential for any visually guided behavior of a moving animal. To mechanistically predict behavior and understand the emergence of self-motion perception in vertebrate brains, it is essential to systematically characterize the motion receptive fields (RFs) of optic flow processing neurons. Here, we present the fine-scale RFs of thousands of motion-sensitive neurons studied in the diencephalon and the midbrain of zebrafish. We found neurons that serve as linear filters and robustly encode directional and speed information of translation-induced optic flow. These neurons are topographically arranged in pretectum according to translation direction. The unambiguous encoding of translation enables the decomposition of translational and rotational self-motion information from mixed optic flow. In behavioral experiments, we successfully demonstrated the predicted decomposition in the optokinetic and optomotor responses. Together, our study reveals the algorithm and the neural implementation for self-motion estimation in a vertebrate visual system.


1997 ◽  
Vol 14 (6) ◽  
pp. 1015-1027 ◽  
Author(s):  
R. C. Reid ◽  
J. D. Victor ◽  
R. M. Shapley

AbstractWe have used Sutter's (1987) spatiotemporal m-sequence method to map the receptive fields of neurons in the visual system of the cat. The stimulus consisted of a grid of 16 X 16 square regions, each of which was modulated in time by a pseudorandom binary signal, known as an m-sequence. Several strategies for displaying the m-sequence stimulus are presented. The results of the method are illustrated with two examples. For both geniculate neurons and cortical simple cells, the measurement of first-order response properties with the m-sequence method provided a detailed characterization of classical receptive-field structures. First, we measured a spatiotemporal map of both the center and surround of a Y-cell in the lateral geniculate nucleus (LGN). The time courses of the center responses was biphasic: OFF at short latencies, ON at longer latencies. The surround was also biphasic—ON then OFF—but somewhat slower. Second, we mapped the response properties of an area 17 directional simple cell. The response dynamics of the ON and OFF subregions varied considerably; the time to peak ranged over more than a factor of two. This spatiotemporal inseparability is related to the cell's directional selectivity (Reid et al., 1987, 1991; McLean & Palmer, 1989; McLean et al., 1994). The detail with which the time course of response can be measured at many different positions is one of the strengths of the m-sequence method.


Receptive fields of simple cells in the cat visual cortex have recently been discussed in relation to the ‘theory of communication' proposed by Gabor (1946). A number of investigators have suggested that the line-weighting functions, as measured orthogonal to the preferred orientation, may be best described as the product of a Gaussian envelope and a sinusoid (i.e. a Gabor function). Following Gabor’s theory of ‘basis’ functions, it has also been suggested that simple cells can be categorized into even-and odd-symmetric categories. Based on the receptive field profiles of 46 simple cells recorded from cat visual cortex, our analysis provides a quantitative description of both the receptive-field envelope and the receptive-field ‘symmetry’ of each of the 46 cells. The results support the notion that, to a first approximation, Gabor functions with three free parameters (envelope width, carrier frequency and carrier phase) provide a good description of the receptive-field profiles. However, our analysis does not support the notion that simple cells generally fit into even- and odd-symmetric categories.


1980 ◽  
Vol 43 (3) ◽  
pp. 595-611 ◽  
Author(s):  
R. W. Rhoades ◽  
L. M. Chalupa

1. Monocular enucleation in infant hamsters results in a marked expansion of the normally very limited ipsilateral retinotectal projection (13). In 34 hamsters subjected to removal of one eye within 12 h of birth, the receptive-field characteristics of superior collicular neurons ipsilateral and contralateral to the remaining eye were investigated quantitatively and compared to those of normal animals. In six additional neonatal enucleates, the density of the expanded retinotectal projection was studied with the autoradiographic method and an attempt was made to relate the anatomical reorganization with the electrophysiological findings, 2. The response characteristics of visual cells in the colliculus contralateral to the remaining eye were not significantly different from those observed in normal animals. In the ipsilateral tectum, however, numerous changes were observed. Visual receptive fields were abnormally large. The incidence of directional selectivity was markedly reduced, as were the magnitudes of the discharges elicited by either flashed or moving stimuli. Fewer cells were activated by small flashed spots and most of the units that were responsive to such stimulation failed to exhibit the surround suppression typical for the majority of tectal neurons in normal hamsters. Most cells in the ipsilateral colliculus responded only to relatively low (less than 50 degrees/s) stimulus velocities and response decrements resulting from repeated stimulation also occurred much more readily for the neurons tested on this side. 3. The results of additional experiments in neonatal enucleates (n = 8), which were also subjected to acute bilateral removal of the visual cortex, demonstrated that such damage resulted in a marked reduction in the incidence of directional selectivity in the colliculus contralateral to the remaining eye but had no effect on the responses of cells innervated by the aberrant ipsilateral pathway. 4. A correlation between the relative density of the ipsilateral retinal projection at different points in the colliculus, as demonstrated by the autroradiography and the nature of the visual responses obtained in different portions of the structure, indicated that receptive-field size was negatively correlated with the density of the aberrant retinotectal projection and that absolute responsivity (number of impulses elicited by an optimal stimulus) was positively correlated with autoradiographic grain density. 5. These findings demonstrate that while the aberrant retinocollicular projection can, along with the other visual inputs to the tectum, result in the organization of normal response properties for a small number of tectal neurons, the majority of the visual cells innervated by this pathway have responses that are appreciably different from normal.


2020 ◽  
Vol 34 (07) ◽  
pp. 12821-12828 ◽  
Author(s):  
Lei Zhang ◽  
Zhiqiang Lang ◽  
Peng Wang ◽  
Wei Wei ◽  
Shengcai Liao ◽  
...  

Spectral super-resolution (SSR) aims at generating a hyperspectral image (HSI) from a given RGB image. Recently, a promising direction is to learn a complicated mapping function from the RGB image to the HSI counterpart using a deep convolutional neural network. This essentially involves mapping the RGB context within a size-specific receptive field centered at each pixel to its spectrum in the HSI. The focus thereon is to appropriately determine the receptive field size and establish the mapping function from RGB context to the corresponding spectrum. Due to their differences in category or spatial position, pixels in HSIs often require different-sized receptive fields and distinct mapping functions. However, few efforts have been invested to explicitly exploit this prior.To address this problem, we propose a pixel-aware deep function-mixture network for SSR, which is composed of a new class of modules, termed function-mixture (FM) blocks. Each FM block is equipped with some basis functions, i.e., parallel subnets of different-sized receptive fields. Besides, it incorporates an extra subnet as a mixing function to generate pixel-wise weights, and then linearly mixes the outputs of all basis functions with those generated weights. This enables us to pixel-wisely determine the receptive field size and the mapping function. Moreover, we stack several such FM blocks to further increase the flexibility of the network in learning the pixel-wise mapping. To encourage feature reuse, intermediate features generated by the FM blocks are fused in late stage, which proves to be effective for boosting the SSR performance. Experimental results on three benchmark HSI datasets demonstrate the superiority of the proposed method.


1993 ◽  
Vol 90 (23) ◽  
pp. 11142-11146 ◽  
Author(s):  
S Bisti ◽  
C Trimarchi

Prenatal unilateral enucleation in mammals causes an extensive anatomical reorganization of visual pathways. The remaining eye innervates the entire extent of visual subcortical and cortical areas. Electrophysiological recordings have shown that the retino-geniculate connections are retinotopically organized and geniculate neurones have normal receptive field properties. In area 17 all neurons respond to stimulation of the remaining eye and retinotopy, orientation columns, and direction selectivity are maintained. The only detectable change is a reduction in receptive field size. Are these changes reflected in the visual behavior? We studied visual performance in cats unilaterally enucleated 3 weeks before birth (gestational age at enucleation, 39-42 days). We tested behaviorally the development of visual acuity and, in the adult, the extension of the visual field and the contrast sensitivity. We found no difference between prenatal monocularly enucleated cats and controls in their ability to orient to targets in different positions of the visual field or in their visual acuity (at any age). The major difference between enucleated and control animals was in contrast sensitivity:prenatal enucleated cats present a loss in sensitivity for gratings of low spatial frequency (below 0.5 cycle per degree) as well as a slight increase in sensitivity at middle frequencies. We conclude that prenatal unilateral enucleation causes a selective change in the spatial performance of the remaining eye. We suggest that this change is the result of a reduction in the number of neurones with large receptive fields, possibly due to a severe impairment of the Y system.


Of the many possible functions of the macaque monkey primary visual cortex (striate cortex, area 17) two are now fairly well understood. First, the incoming information from the lateral geniculate bodies is rearranged so that most cells in the striate cortex respond to specifically oriented line segments, and, second, information originating from the two eyes converges upon single cells. The rearrangement and convergence do not take place immediately, however: in layer IVc, where the bulk of the afferents terminate, virtually all cells have fields with circular symmetry and are strictly monocular, driven from the left eye or from the right, but not both; at subsequent stages, in layers above and below IVc, most cells show orientation specificity, and about half are binocular. In a binocular cell the receptive fields in the two eyes are on corresponding regions in the two retinas and are identical in structure, but one eye is usually more effective than the other in influencing the cell; all shades of ocular dominance are seen. These two functions are strongly reflected in the architecture of the cortex, in that cells with common physiological properties are grouped together in vertically organized systems of columns. In an ocular dominance column all cells respond preferentially to the same eye. By four independent anatomical methods it has been shown that these columns have the form of vertically disposed alternating left-eye and right-eye slabs, which in horizontal section form alternating stripes about 400 μm thick, with occasional bifurcations and blind endings. Cells of like orientation specificity are known from physiological recordings to be similarly grouped in much narrower vertical sheeet-like aggregations, stacked in orderly sequences so that on traversing the cortex tangentially one normally encounters a succession of small shifts in orientation, clockwise or counterclockwise; a 1 mm traverse is usually accompanied by one or several full rotations through 180°, broken at times by reversals in direction of rotation and occasionally by large abrupt shifts. A full complement of columns, of either type, left-plus-right eye or a complete 180° sequence, is termed a hypercolumn. Columns (and hence hypercolumns) have roughly the same width throughout the binocular part of the cortex. The two independent systems of hypercolumns are engrafted upon the well known topographic representation of the visual field. The receptive fields mapped in a vertical penetration through cortex show a scatter in position roughly equal to the average size of the fields themselves, and the area thus covered, the aggregate receptive field, increases with distance from the fovea. A parallel increase is seen in reciprocal magnification (the number of degrees of visual field corresponding to 1 mm of cortex). Over most or all of the striate cortex a movement of 1-2 mm, traversing several hypercolumns, is accompanied by a movement through the visual field about equal in size to the local aggregate receptive field. Thus any 1-2 mm block of cortex contains roughly the machinery needed to subserve an aggregate receptive field. In the cortex the fall-off in detail with which the visual field is analysed, as one moves out from the foveal area, is accompanied not by a reduction in thickness of layers, as is found in the retina, but by a reduction in the area of cortex (and hence the number of columnar units) devoted to a given amount of visual field: unlike the retina, the striate cortex is virtually uniform morphologically but varies in magnification. In most respects the above description fits the newborn monkey just as well as the adult, suggesting that area 17 is largely genetically programmed. The ocular dominance columns, however, are not fully developed at birth, since the geniculate terminals belonging to one eye occupy layer IVc throughout its length, segregating out into separate columns only after about the first 6 weeks, whether or not the animal has visual experience. If one eye is sutured closed during this early period the columns belonging to that eye become shrunken and their companions correspondingly expanded. This would seem to be at least in part the result of interference with normal maturation, though sprouting and retraction of axon terminals are not excluded.


2015 ◽  
Vol 114 (6) ◽  
pp. 3076-3096 ◽  
Author(s):  
Ryan M. Peters ◽  
Phillip Staibano ◽  
Daniel Goldreich

The ability to resolve the orientation of edges is crucial to daily tactile and sensorimotor function, yet the means by which edge perception occurs is not well understood. Primate cortical area 3b neurons have diverse receptive field (RF) spatial structures that may participate in edge orientation perception. We evaluated five candidate RF models for macaque area 3b neurons, previously recorded while an oriented bar contacted the monkey's fingertip. We used a Bayesian classifier to assign each neuron a best-fit RF structure. We generated predictions for human performance by implementing an ideal observer that optimally decoded stimulus-evoked spike counts in the model neurons. The ideal observer predicted a saturating reduction in bar orientation discrimination threshold with increasing bar length. We tested 24 humans on an automated, precision-controlled bar orientation discrimination task and observed performance consistent with that predicted. We next queried the ideal observer to discover the RF structure and number of cortical neurons that best matched each participant's performance. Human perception was matched with a median of 24 model neurons firing throughout a 1-s period. The 10 lowest-performing participants were fit with RFs lacking inhibitory sidebands, whereas 12 of the 14 higher-performing participants were fit with RFs containing inhibitory sidebands. Participants whose discrimination improved as bar length increased to 10 mm were fit with longer RFs; those who performed well on the 2-mm bar, with narrower RFs. These results suggest plausible RF features and computational strategies underlying tactile spatial perception and may have implications for perceptual learning.


1995 ◽  
Vol 74 (5) ◽  
pp. 2100-2125 ◽  
Author(s):  
D. M. Snodderly ◽  
M. Gur

1. In alert macaque monkeys, multiunit activity is encountered in an alternating sequence of silent and spontaneously active zones as an electrode is lowered through the striate cortex (V1). 2. Individual neurons that are spontaneously active in the dark usually have a maintained discharge in the light. Because both types of discharge occur in the absence of deliberate stimulation, we call them the "ongoing" activity. The zones with ongoing activity correspond to the cytochrome oxidase (CytOx)-rich geniculorecipient layers 4A, 4C, and 6, whereas the adjacent layers 2/3, 4B, and 5 have little ongoing activity. 3. The widths of receptive field activating regions (ARs) are positively correlated with the cells' ongoing activity. Cells with larger ARs are preferentially located in the CytOx-rich (input) layers, and many are unselective for stimulus orientation. However, approximately 90% of the cells in the silent layers are orientation selective, and they often have small ARs. 4. The laminar distribution of selectivity for orientation and direction of movement in alert animals is consistent with earlier results from anesthetized animals, but the laminar distribution of AR widths differs. In alert macaques, the ARs of direction-selective cells in layer 4B and of orientation-selective cells in layer 5 are among the smallest in V1. 5. Our findings indicate that the input layers of V1 (4A, 4C, and 6) have a diversity of AR widths, including large ones. Cortical processing produces receptive fields in some of the output layers (4B and 5) that are restricted to small ARs with high resolution of spatial position. These results imply potent lateral and/or interlaminar interactions in alert animals in early cortical processing. The diversity of AR widths generated in V1 may contribute to detection of fine detail in the presence of contrasting backgrounds--the early stages of figure-ground discrimination.


1998 ◽  
Vol 80 (6) ◽  
pp. 2882-2892 ◽  
Author(s):  
Christopher I. Moore ◽  
Sacha B. Nelson

Moore, Christopher I. and Sacha B. Nelson. Spatio-temporal subthreshold receptive fields in the vibrissa representation of rat primary somatosensory cortex. J. Neurophysiol. 80: 2882–2892, 1998. Whole cell recordings of synaptic responses evoked by deflection of individual vibrissa were obtained from neurons within adult rat primary somatosensory cortex. To define the spatial and temporal properties of subthreshold receptive fields, the spread, amplitude, latency to onset, rise time to half peak amplitude, and the balance of excitation and inhibition of subthreshold input were quantified. The convergence of information onto single neurons was found to be extensive: inputs were consistently evoked by vibrissa one- and two-away from the vibrissa that evoked the largest response (the “primary vibrissa”). Latency to onset, rise time, and the incidence and strength of inhibitory postsynaptic potentials (IPSPs) varied as a function of position within the receptive field and the strength of evoked excitatory input. Nonprimary vibrissae evoked smaller amplitude subthreshold responses [primary vibrissa, 9.1 ± 0.84 (SE) mV, n = 14; 1-away, 5.1 ± 0.5 mV, n = 38; 2-away, 3.7 ± 0.59 mV, n = 22; 3-away, 1.3 ± 0.70 mV, n = 8] with longer latencies (primary vibrissa, 10.8 ± 0.80 ms; 1-away, 15.0 ± 1.2 ms; 2-away, 15.7 ± 2.0 ms). Rise times were significantly faster for inputs that could evoke action potential responses (suprathreshold, 4.1 ± 1.3 ms, n = 8; subthreshold, 12.4 ± 1.5 ms, n = 61). In a subset of cells, sensory evoked IPSPs were examined by deflecting vibrissa during injection of hyperpolarizing and depolarizing current. The strongest IPSPs were evoked by the primary vibrissa ( n = 5/5), but smaller IPSPs also were evoked by nonprimary vibrissae ( n = 8/13). Inhibition peaked by 10–20 ms after the onset of the fastest excitatory input to the cortex. This pattern of inhibitory activity led to a functional reversal of the center of the receptive field and to suppression of later-arriving and slower-rising nonprimary inputs. Together, these data demonstrate that subthreshold receptive fields are on average large, and the spatio-temporal dynamics of these receptive fields vary as a function of position within the receptive field and strength of excitatory input. These findings constrain models of suprathreshold receptive field generation, multivibrissa interactions, and cortical plasticity.


Sign in / Sign up

Export Citation Format

Share Document