scholarly journals Optimized but not maximized cue integration for 3D visual perception

2019 ◽  
Author(s):  
Ting-Yu Chang ◽  
Byounghoon Kim ◽  
Lowell Thompson ◽  
Adhira Sunkara ◽  
Raymond Doudlah ◽  
...  

AbstractReconstructing three-dimensional (3D) scenes from two-dimensional (2D) retinal images is an ill-posed problem. Despite this, our 3D perception of the world based on 2D retinal images is seemingly accurate and precise. The integration of distinct visual cues is essential for robust 3D perception in humans, but it is unclear if this mechanism is conserved in non-human primates, and how the underlying neural architecture constrains 3D perception. Here we assess 3D perception in macaque monkeys using a surface orientation discrimination task. We find that perception is generally accurate, but precision depends on the spatial pose of the surface and available cues. The results indicate that robust perception is achieved by dynamically reweighting the integration of stereoscopic and perspective cues according to their pose-dependent reliabilities. They further suggest that 3D perception is influenced by a prior for the 3D orientation statistics of natural scenes. We compare the data to simulations based on the responses of 3D orientation selective neurons. The results are explained by a model in which two independent neuronal populations representing stereoscopic and perspective cues (with perspective signals from the two eyes combined using nonlinear canonical computations) are optimally integrated through linear summation. Perception of combined-cue stimuli is optimal given this architecture. However, an alternative architecture in which stereoscopic cues and perspective cues detected by each eye are represented by three independent populations yields two times greater precision than observed. This implies that, due to canonical computations, cue integration for 3D perception is optimized but not maximized.Author summaryOur eyes only sense two-dimensional projections of the world (like a movie on a screen), yet we perceive the world in three dimensions. To create reliable 3D percepts, the human visual system integrates distinct visual signals according to their reliabilities, which depend on conditions such as how far away an object is located and how it is oriented. Here we find that non-human primates similarly integrate different 3D visual signals, and that their perception is influenced by the 3D orientation statistics of natural scenes. Cue integration is thus a conserved mechanism for creating robust 3D percepts by the primate brain. Using simulations of neural population activity, based on neuronal recordings from the same animals, we show that some computations which occur widely in the brain facilitate 3D perception, while others hinder perception. This work addresses key questions about how neural systems solve the difficult problem of generating 3D percepts, identifies a plausible neural architecture for implementing robust 3D vision, and reveals how neural computation can simultaneously optimize and curb perception.


2020 ◽  
Vol 2020 (10) ◽  
Author(s):  
Jin Chen ◽  
Chao-Hsiang Sheu ◽  
Mikhail Shifman ◽  
Gianni Tallarita ◽  
Alexei Yung

Abstract We study two-dimensional weighted $$ \mathcal{N} $$ N = (2) supersymmetric ℂℙ models with the goal of exploring their infrared (IR) limit. 𝕎ℂℙ(N,$$ \tilde{N} $$ N ˜ ) are simplified versions of world-sheet theories on non-Abelian strings in four-dimensional $$ \mathcal{N} $$ N = 2 QCD. In the gauged linear sigma model (GLSM) formulation, 𝕎ℂℙ(N,$$ \tilde{N} $$ N ˜ ) has N charges +1 and $$ \tilde{N} $$ N ˜ charges −1 fields. As well-known, at $$ \tilde{N} $$ N ˜ = N this GLSM is conformal. Its target space is believed to be a non-compact Calabi-Yau manifold. We mostly focus on the N = 2 case, then the Calabi-Yau space is a conifold. On the other hand, in the non-linear sigma model (NLSM) formulation the model has ultra-violet logarithms and does not look conformal. Moreover, its metric is not Ricci-flat. We address this puzzle by studying the renormalization group (RG) flow of the model. We show that the metric of NLSM becomes Ricci-flat in the IR. Moreover, it tends to the known metric of the resolved conifold. We also study a close relative of the 𝕎ℂℙ model — the so called zn model — which in actuality represents the world sheet theory on a non-Abelian semilocal string and show that this zn model has similar RG properties.



Author(s):  
Skye Lee Pazuchanics ◽  
Douglas J. Gillan

Virtual depth displays depend on static, monocular cues. Models of integrating monocular cues may be continuous (additive) or discontinuous. Previous research using simple displays and a small number of cues supported continuous cue integration. The present research is designed to expand the understanding of how the visual system integrates information from multiple pictorial cues by investigating combinations of one to ten pictorial cues in visually-rich, two-dimensional displays (paintings and photographs). Participants estimated depth in target paintings and photographs relative to a standard two dimensional display. Certain results suggest that the visual system integrates cues in a largely additive way, but after a number of cues are present there may be an additional boost in perceived depth resulting in a best-fittingdiscontinuous model of cue combination. However, this discontinuous effect may be due to designdecisions made by the painters rather than exclusively to the perceptual processes of the viewers. Analyses of these design decisions provide lessons for the design of two-dimensional displays.



Author(s):  
José Lages ◽  
Justin Loye ◽  
Célestin Coquidé ◽  
Guillaume Rollin

The worldwide football transfer market is analyzed as a directed complex network: the football clubs are the network nodes and the directed edges are weighted by the total amount of money transferred from a club to another. The Google matrix description allows to treat every club independently of their richness and allows to measure for a given club the efficiency of player sales and player acquisitions. The PageRank algorithm, developed initially for the World Wide Web, naturally characterizes the ability of a club to import players. The CheiRank algorithm, also developed to analyze large scale directed complex networks, characterizes the ability of a club to export players. The analysis in the two-dimensional PageRank-CheiRank plan permits to determine the transfer balance of the clubs in a more subtle manner than the traditional import-export scheme. We investigate the 2017-2018 mercato concerning 2296 clubs, 6698 player transfers, and 147 player nationalities. The transfer balance is determined globally for different types of player trades (defender, midfielder, forward, …) and for different national football leagues. Although, on average, the network transfer flows from and to clubs are balanced, the discrimination by player type draws a specific portrait of each football club.



1997 ◽  
Vol 18 (15) ◽  
pp. 2755-2758 ◽  
Author(s):  
Christine Hoogland ◽  
Vincent Baujard ◽  
Jean-Charles Sanchez ◽  
Denis F. Hochstrasser ◽  
Ron D. Appel


2022 ◽  
Vol 13 (1) ◽  
pp. 1-20
Author(s):  
Shui-Hua Wang ◽  
Xin Zhang ◽  
Yu-Dong Zhang

( Aim ) COVID-19 has caused more than 2.28 million deaths till 4/Feb/2021 while it is still spreading across the world. This study proposed a novel artificial intelligence model to diagnose COVID-19 based on chest CT images. ( Methods ) First, the two-dimensional fractional Fourier entropy was used to extract features. Second, a custom deep stacked sparse autoencoder (DSSAE) model was created to serve as the classifier. Third, an improved multiple-way data augmentation was proposed to resist overfitting. ( Results ) Our DSSAE model obtains a micro-averaged F1 score of 92.32% in handling a four-class problem (COVID-19, community-acquired pneumonia, secondary pulmonary tuberculosis, and healthy control). ( Conclusion ) Our method outperforms 10 state-of-the-art approaches.



Author(s):  
Shigeru Ikuta ◽  
Masamichi Watanuki ◽  
Shinya Abe

Grid Onput is a set of novel two-dimensional codes comprising extremely small dots. The present authors recently developed software to overlap the dot codes on the user's designed sheet, to create a content to replay audios, to create a standalone application to replay multimedia, and to create an application to replay multimedia on iPad. Simply touching the dot codes with a speaking-pen and/or a dot-code reader enables users to directly access the corresponding digital information; a maximum of four mediums can be easily linked to each dot code icon. In collaboration with schoolteachers all over the world, one of the authors, Shigeru Ikuta, has been creating a variety of original self-made content and conducting various activities at both general and special needs schools. This chapter outlines the recent development of the state-of-the-art Grid Onput dot code technology and presents basic information regarding the creation of original teaching materials using newly developed software and the use at both general and special needs schools.



How to Land ◽  
2018 ◽  
pp. 107-138
Author(s):  
Ann Cooper Albright

This chapter weaves an in-depth discussion of the physical function of releasing our bodies into the support of gravity with an analysis of how that experience can serve as an important stability in our daily lives. It begins by reviewing the crucial distinction between collapsing and yielding in order to demonstrate how the same force that draws us to the ground can also sponsor our action in the world, helping us find a sense of resistance and agency. In addition, gravity can provide a useful counterbalance to the ubiquitous presence of two-dimensional screens in our lives. By allowing us to experience weight, gravity is key to our sense of grounding, linking inhalation with exhalation, sky to earth, as well as the sympathetic and parasympathetic aspects of our autonomic nervous system.



Author(s):  
R. Murugan

The retinal parts segmentation has been recognized as a key component in both ophthalmological and cardiovascular sickness analysis. The parts of retinal pictures, vessels, optic disc, and macula segmentations, will add to the indicative outcome. In any case, the manual segmentation of retinal parts is tedious and dreary work, and it additionally requires proficient aptitudes. This chapter proposes a supervised method to segment blood vessel utilizing deep learning methods. All the more explicitly, the proposed part has connected the completely convolutional network, which is normally used to perform semantic segmentation undertaking with exchange learning. The convolutional neural system has turned out to be an amazing asset for a few computer vision assignments. As of late, restorative picture investigation bunches over the world are rapidly entering this field and applying convolutional neural systems and other deep learning philosophies to a wide assortment of uses, and uncommon outcomes are rising constantly.



1996 ◽  
Vol 17 (8) ◽  
pp. 1386-1392 ◽  
Author(s):  
Klaus-Peter Pleißner ◽  
Stefan Sander ◽  
Helmut Oswald ◽  
Vera Regitz-Zagrosek ◽  
Eckart Fleck


Sign in / Sign up

Export Citation Format

Share Document