scene content Latest Research Papers

Screen Content Quality Assessment: Overview, Benchmark, and Beyond

ACM Computing Surveys ◽

10.1145/3470970 ◽

2022 ◽

Vol 54 (9) ◽

pp. 1-36

Author(s):

Xiongkuo Min ◽

Ke Gu ◽

Guangtao Zhai ◽

Xiaokang Yang ◽

Wenjun Zhang ◽

...

Keyword(s):

Quality Assessment ◽

Comprehensive Evaluation ◽

Critical Role ◽

Research Field ◽

Future Research ◽

Natural Scene ◽

Content Quality ◽

Screen Content ◽

Systems Quality ◽

Scene Content

Screen content, which is often computer-generated, has many characteristics distinctly different from conventional camera-captured natural scene content. Such characteristic differences impose major challenges to the corresponding content quality assessment, which plays a critical role to ensure and improve the final user-perceived quality of experience (QoE) in various screen content communication and networking systems. Quality assessment of such screen content has attracted much attention recently, primarily because the screen content grows explosively due to the prevalence of cloud and remote computing applications in recent years, and due to the fact that conventional quality assessment methods can not handle such content effectively. As the most technology-oriented part of QoE modeling, image/video content/media quality assessment has drawn wide attention from researchers, and a large amount of work has been carried out to tackle the problem of screen content quality assessment. This article is intended to provide a systematic and timely review on this emerging research field, including (1) background of natural scene vs. screen content quality assessment; (2) characteristics of natural scene vs. screen content; (3) overview of screen content quality assessment methodologies and measures; (4) relevant benchmarks and comprehensive evaluation of the state-of-the-art; (5) discussions on generalizations from screen content quality assessment to QoE assessment, and other techniques beyond QoE assessment; and (6) unresolved challenges and promising future research directions. Throughout this article, we focus on the differences and similarities between screen content and conventional natural scene content. We expect that this review article shall provide readers with an overview of the background, history, recent progress, and future of the emerging screen content quality assessment research.

Image Scene Analysis Based on Improved FCN Model

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001421520200 ◽

2021 ◽

pp. 2152020

Author(s):

Weijie Yang ◽

Yueting Hui

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Data Augmentation ◽

Semantic Segmentation ◽

Scene Analysis ◽

Model Structure ◽

Multi Scale ◽

Class Labels ◽

Small Targets ◽

Scene Content

Image scene analysis is to analyze image scene content through image semantic segmentation, which can identify the categories and positions of different objects in an image. However, due to the loss of spatial detail information, the accuracy of image scene analysis is often affected, resulting in rough edges of FCN, inconsistent class labels of target regions and missing small targets. To address these problems, this paper increases the receptive field, conducts multi-scale fusion and changes the weight of different sensitive channels, so as to improve the feature discrimination and maintain or restore spatial detail information. Furthermore, the deep neural network FCN is used to build the base model of semantic segmentation. The ASPP, data augmentation, SENet, decoder and global pooling are added to the baseline to optimize the model structure and improve the effect of semantic segmentation. Finally, the more accurate results of scene analysis are obtained.

Scanpath Theory in Virtual Reality

10.31234/osf.io/s7q9u ◽

2021 ◽

Author(s):

Nicola C Anderson ◽

Oliver Jacobs ◽

Walter F. Bischof ◽

Alan Kingstone

Keyword(s):

Visual Perception ◽

Performance Prediction ◽

Head Movement ◽

Memory Performance ◽

Scene Recognition ◽

Head Movements ◽

Head Tracking ◽

Active System ◽

Scene Content ◽

Sensorimotor Processes

It has long been thought that visual perception is represented in sensorimotor processes that unfold over time. One prominent theory predicts that our memory for a scene consists of both the scene content and the motor commands (i.e., eye movements) used to explore that scene. This Scanpath Theory (Noton & Stark, Science 171 (1971) 308-311) has long been contested, with many studies providing evidence both for, and against it. That past work, however, has failed to account for the fact that visual perception is embodied within an active system of effectors, namely, that people routinely move both their eyes and head to explore visible space. In the present work we tested Scanpath Theory while observers were free to move within a 360-degree VR environment. Their task was to encode and later recognise panoramic scenes within this fully immersive world. During both encoding and recognition, we recorded their eye and head movements using a VR headset equipped with eye and head tracking. Our results reveal that eye and head movement patterns are diagnostic of memory performance; and that scene recognition improves when certain movements that had occurred during encoding are repeated. Finally, including head movement measures enhances performance prediction, strengthening the evidence for Scanpath Theory, and reinforcing the fact that the head moves in service of the eyes in allocating attention.

In search of a Goldilocks zone for credible AI

Scientific Reports ◽

10.1038/s41598-021-93109-8 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Kevin Allan ◽

Nir Oren ◽

Jacqui Hutchison ◽

Douglas Martin

Keyword(s):

Social Cognition ◽

Single Source ◽

Multiple Tests ◽

The Social ◽

New Directions ◽

Global Problems ◽

Scene Content ◽

Different Sources ◽

Source Accuracy

AbstractIf artificial intelligence (AI) is to help solve individual, societal and global problems, humans should neither underestimate nor overestimate its trustworthiness. Situated in-between these two extremes is an ideal ‘Goldilocks’ zone of credibility. But what will keep trust in this zone? We hypothesise that this role ultimately falls to the social cognition mechanisms which adaptively regulate conformity between humans. This novel hypothesis predicts that human-like functional biases in conformity should occur during interactions with AI. We examined multiple tests of this prediction using a collaborative remembering paradigm, where participants viewed household scenes for 30 s vs. 2 min, then saw 2-alternative forced-choice decisions about scene content originating either from AI- or human-sources. We manipulated the credibility of different sources (Experiment 1) and, from a single source, the estimated-likelihood (Experiment 2) and objective accuracy (Experiment 3) of specific decisions. As predicted, each manipulation produced functional biases for AI-sources mirroring those found for human-sources. Participants conformed more to higher credibility sources, and higher-likelihood or more objectively accurate decisions, becoming increasingly sensitive to source accuracy when their own capability was reduced. These findings support the hypothesised role of social cognition in regulating AI’s influence, raising important implications and new directions for research on human–AI interaction.

Experiencing with electronic image stabilization and PRNU through scene content image registration

Pattern Recognition Letters ◽

10.1016/j.patrec.2021.01.014 ◽

2021 ◽

Vol 145 ◽

pp. 8-15

Author(s):

Fabio Bellavia ◽

Marco Fanfani ◽

Carlo Colombo ◽

Alessandro Piva

Keyword(s):

Image Registration ◽

Image Stabilization ◽

Scene Content

Functional Context Affects Scene Processing

Journal of Cognitive Neuroscience ◽

10.1162/jocn_a_01694 ◽

2021 ◽

pp. 1-13

Author(s):

Elissa M. Aminoff ◽

Michael J. Tarr

Keyword(s):

Brain Regions ◽

Behavioral Experiment ◽

Top Down ◽

Visual Inputs ◽

Outdoor Scenes ◽

Functional Context ◽

Scene Processing ◽

Scene Content ◽

Picture Frame ◽

As If

Abstract Rapid visual perception is often viewed as a bottom–up process. Category-preferred neural regions are often characterized as automatic, default processing mechanisms for visual inputs of their categorical preference. To explore the sensitivity of such regions to top–down information, we examined three scene-preferring brain regions, the occipital place area (OPA), the parahippocampal place area (PPA), and the retrosplenial complex (RSC), and tested whether the processing of outdoor scenes is influenced by the functional contexts in which they are seen. Context was manipulated by presenting real-world landscape images as if being viewed through a window or within a picture frame—manipulations that do not affect scene content but do affect one's functional knowledge regarding the scene. This manipulation influences neural scene processing (as measured by fMRI): The OPA and the PPA exhibited greater neural activity when participants viewed images as if through a window as compared with within a picture frame, whereas the RSC did not show this difference. In a separate behavioral experiment, functional context affected scene memory in predictable directions (boundary extension). Our interpretation is that the window context denotes three-dimensionality, therefore rendering the perceptual experience of viewing landscapes as more realistic. Conversely, the frame context denotes a 2-D image. As such, more spatially biased scene representations in the OPA and the PPA are influenced by differences in top–down, perceptual expectations generated from context. In contrast, more semantically biased scene representations in the RSC are likely to be less affected by top–down signals that carry information about the physical layout of a scene.

Analysis of Natural Scene Derived Spatial Frequency Responses for Estimating Camera ISO12233 Slanted-edge Performance

Journal of Imaging Science and Technology ◽

10.2352/j.imagingsci.technol.2021.65.6.060405 ◽

2021 ◽

Author(s):

Oliver van Zwanenberg ◽

Sophie Triantaphillidou ◽

Alexandra Psarrou ◽

Robin B. Jenkin

Keyword(s):

Quantitative Analysis ◽

Spatial Frequency ◽

Fine Tuning ◽

Practical Implementation ◽

Natural Scene ◽

Frequency Responses ◽

Camera Systems ◽

Dataset Size ◽

Edge Based ◽

Scene Content

The Natural Scene derived Spatial Frequency Response (NS-SFR) framework automatically extracts suitable step-edges from natural pictorial scenes and processes these edges via the edge-based ISO12233 (e-SFR) algorithm. Previously, a novel methodology was presented to estimate the standard e-SFR from NS-SFR data. This paper implements this method using diverse natural scene image datasets from three characterized camera systems. Quantitative analysis was carried out on the system e-SFR estimates to validate accuracy of the method. Both linear and non-linear camera systems were evaluated. To investigate how scene content and dataset size affect system e-SFR estimates, analysis was conducted on entire datasets, as well as subsets of various sizes and scene group types. Results demonstrate that system e-SFR estimates strongly correlate with results from test chart inputs, with accuracy comparable to that of the ISO12233. Further work toward improving and fine-tuning the proposed methodology for practical implementation is discussed.

Task dependence of neural representations of global scene properties but not scene categories in the prefrontal cortex

10.1101/2020.12.04.412445 ◽

2020 ◽

Author(s):

Yaelan Jung ◽

Dirk B. Walther

Keyword(s):

Prefrontal Cortex ◽

Visual Cortex ◽

Scene Perception ◽

Sound Level ◽

Task Context ◽

Complex Scene ◽

Neural Representations ◽

Task Conditions ◽

Scene Content ◽

The Brain

AbstractNatural scenes deliver rich sensory information about the world. Decades of research has shown that the scene-selective network in the visual cortex represents various aspects of scenes. It is, however, unknown how such complex scene information is processed beyond the visual cortex, such as in the prefrontal cortex. It is also unknown how task context impacts the process of scene perception, modulating which scene content is represented in the brain. In this study, we investigate these questions using scene images from four natural scene categories, which also depict two types of global scene properties, temperature (warm or cold), and sound-level (noisy or quiet). A group of healthy human subjects from both sexes participated in the present study using fMRI. In the study, participants viewed scene images under two different task conditions; temperature judgment and sound-level judgment. We analyzed how different scene attributes (scene categories, temperature, and sound-level information) are represented across the brain under these task conditions. Our findings show that global scene properties are only represented in the brain, especially in the prefrontal cortex, when they are task-relevant. However, scene categories are represented in the brain, in both the parahippocampal place area and the prefrontal cortex, regardless of task context. These findings suggest that the prefrontal cortex selectively represents scene content according to task demands, but this task selectivity depends on the types of scene content; task modulates neural representations of global scene properties but not of scene categories.

Where the eyes wander: The relationship between mind wandering and fixation allocation to visually salient and semantically informative static scene content

Journal of Vision ◽

10.1167/jov.20.9.10 ◽

2020 ◽

Vol 20 (9) ◽

pp. 10

Author(s):

Kristina Krasich ◽

Greg Huffman ◽

Myrthe Faber ◽

James R. Brockmole

Keyword(s):

Mind Wandering ◽

The Relationship ◽

Scene Content

An Efficient Scene Content-Based Indexing and Retrieval on Video Lectures

Advances in Intelligent Systems and Computing - Intelligent System Design ◽

10.1007/978-981-15-5400-1_53 ◽

2020 ◽

pp. 521-534

Author(s):

P. M. Ashok Kumar ◽

Rami Reddy Ambati ◽

L. Arun Raj

Keyword(s):

Video Lectures ◽

Indexing And Retrieval ◽

Scene Content

scene content
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Screen Content Quality Assessment: Overview, Benchmark, and Beyond

Image Scene Analysis Based on Improved FCN Model

Scanpath Theory in Virtual Reality

In search of a Goldilocks zone for credible AI

Experiencing with electronic image stabilization and PRNU through scene content image registration

Functional Context Affects Scene Processing

Analysis of Natural Scene Derived Spatial Frequency Responses for Estimating Camera ISO12233 Slanted-edge Performance

Task dependence of neural representations of global scene properties but not scene categories in the prefrontal cortex

Where the eyes wander: The relationship between mind wandering and fixation allocation to visually salient and semantically informative static scene content

An Efficient Scene Content-Based Indexing and Retrieval on Video Lectures

Export Citation Format

scene contentRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Screen Content Quality Assessment: Overview, Benchmark, and Beyond

Image Scene Analysis Based on Improved FCN Model

Scanpath Theory in Virtual Reality

In search of a Goldilocks zone for credible AI

Experiencing with electronic image stabilization and PRNU through scene content image registration

Functional Context Affects Scene Processing

Analysis of Natural Scene Derived Spatial Frequency Responses for Estimating Camera ISO12233 Slanted-edge Performance

Task dependence of neural representations of global scene properties but not scene categories in the prefrontal cortex

Where the eyes wander: The relationship between mind wandering and fixation allocation to visually salient and semantically informative static scene content

An Efficient Scene Content-Based Indexing and Retrieval on Video Lectures

scene content
Recently Published Documents