visual feature Latest Research Papers

The current era is an information age, and society is turning to the information age. The image processing technology is also widely used in various fields, and the technology of sports action recognition based on image processing technology can also be said to be appropriate. This article uses a spatial visual feature analysis algorithm to implement it. To implement this algorithm, a series of work such as image collection, feature extraction, and action recognition must be completed first and then implemented through texture functions and other related functions. This algorithm can be used to complete the image-based sports action recognition technology at the minimum time cost. This algorithm can help sportsmen better complete training and standardize movements to a certain extent. As for the development of China’s current sports industry structure, it is also steadily improving. The people’s love for sports is getting stronger and stronger, which also makes the development of China’s sports industry still benefit a lot.

Download Full-text

Colonoscopy image classification using self-supervised visual feature learning

Journal of Military Science and Technology ◽

10.54939/1859-1043.j.mst.csce5.2021.3-13 ◽

2021 ◽

pp. 3-13

Author(s):

Nguyen Chi Thanh

Keyword(s):

Image Classification ◽

Detection System ◽

Feature Learning ◽

Training Dataset ◽

Visual Feature ◽

Polyp Detection ◽

The Public ◽

Automatic Feature Extraction ◽

Image Dataset ◽

Novel Method

Colonoscopy image classification is an image classification task that predicts whether colonoscopy images contain polyps or not. It is an important task input for an automatic polyp detection system. Recently, deep neural networks have been widely used for colonoscopy image classification due to the automatic feature extraction with high accuracy. However, training these networks requires a large amount of manually annotated data, which is expensive to acquire and limited by the available resources of endoscopy specialists. We propose a novel method for training colonoscopy image classification networks by using self-supervised visual feature learning to overcome this challenge. We adapt image denoising as a pretext task for self-supervised visual feature learning from unlabeled colonoscopy image dataset, where noise is added to the image for input, and the original image serves as the label. We use an unlabeled colonoscopy image dataset containing 8,500 images collected from the PACS system of Hospital 103 to train the pretext network. The feature exactor of the pretext network trained in a self-supervised way is used for colonoscopy image classification. A small labeled dataset from the public colonoscopy image dataset Kvasir is used to fine-tune the classifier. Our experiments demonstrate that the proposed self-supervised learning method can achieve a high colonoscopy image classification accuracy better than the classifier trained from scratch, especially at a small training dataset. When a dataset with only annotated 200 images is used for training classifiers, the proposed method improves accuracy from 72,16% to 93,15% compared to the baseline classifier.

Download Full-text

Unsupervised Visual Feature Learning Based on Similarity Guidance

Neurocomputing ◽

10.1016/j.neucom.2021.11.102 ◽

2021 ◽

Author(s):

Xiaoqiang Chen ◽

Zhihao Jin ◽

Qicong Wang ◽

Wenming Yang ◽

Qingmin Liao ◽

...

Keyword(s):

Feature Learning ◽

Visual Feature

Download Full-text

Image Analytics: A consolidation of visual feature extraction methods

Journal of Management Analytics ◽

10.1080/23270012.2021.1998801 ◽

2021 ◽

pp. 1-29

Author(s):

Xiaohui Liu ◽

Fei Liu ◽

Yijing Li ◽

Huizhang Shen ◽

Eric T.K. Lim ◽

...

Keyword(s):

Feature Extraction ◽

Extraction Methods ◽

Visual Feature ◽

Visual Feature Extraction

Download Full-text

Radar-Based Localization Using Visual Feature Matching

10.33012/2021.17918 ◽

2021 ◽

Author(s):

Mohamed Elkholy ◽

Mohamed Elsheikh ◽

Naser El-Sheimy

Keyword(s):

Feature Matching ◽

Visual Feature

Download Full-text

The pupil responds spontaneously to perceived numerosity

Nature Communications ◽

10.1038/s41467-021-26261-4 ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Elisa Castaldi ◽

Antonella Pomè ◽

Guido Marco Cicchini ◽

David Burr ◽

Paola Binda

Keyword(s):

Pupil Size ◽

Light Response ◽

The Other ◽

Visual Feature ◽

Main Determinant ◽

Pupillary Light

AbstractAlthough luminance is the main determinant of pupil size, the amplitude of the pupillary light response is also modulated by stimulus appearance and attention. Here we ask whether perceived numerosity modulates the pupillary light response. Participants passively observed arrays of black or white dots of matched physical luminance but different physical or illusory numerosity. In half the patterns, pairs of dots were connected by lines to create dumbbell-like shapes, inducing an illusory underestimation of perceived numerosity; in the other half, connectors were either displaced or removed. Constriction to white arrays and dilation to black were stronger for patterns with higher perceived numerosity, either physical or illusory, with the strength of the pupillary light response scaling with the perceived numerosity of the arrays. Our results show that even without an explicit task, numerosity modulates a simple automatic reflex, suggesting that numerosity is a spontaneously encoded visual feature.

Download Full-text

A Deep Multimodal Model for Predicting Affective Responses Evoked by Movies Based on Shot Segmentation

Security and Communication Networks ◽

10.1155/2021/7650483 ◽

2021 ◽

Vol 2021 ◽

pp. 1-12

Author(s):

Chunxiao Wang ◽

Jingjing Zhang ◽

Wei Jiang ◽

Shuang Wang

Keyword(s):

Short Term Memory ◽

Feature Fusion ◽

Pearson Correlation ◽

Feature Representation ◽

Visual Features ◽

Visual Feature ◽

Temporal Attention ◽

Video Content Analysis ◽

Wide Range ◽

Experienced Emotion

Predicting the emotions evoked in a viewer watching movies is an important research element in affective video content analysis over a wide range of applications. Generally, the emotion of the audience is evoked by the combined effect of the audio-visual messages of the movies. Current research has mainly used rough middle- and high-level audio and visual features to predict experienced emotions, but combining semantic information to refine features to improve emotion prediction results is still not well studied. Therefore, on the premise of considering the time structure and semantic units of a movie, this paper proposes a shot-based audio-visual feature representation method and a long short-term memory (LSTM) model incorporating a temporal attention mechanism for experienced emotion prediction. First, the shot-based audio-visual feature representation defines a method for extracting and combining audio and visual features of each shot clip, and the advanced pretraining models in the related audio-visual tasks are used to extract the audio and visual features with different semantic levels. Then, four components are included in the prediction model: a nonlinear multimodal feature fusion layer, a temporal feature capture layer, a temporal attention layer, and a sentiment prediction layer. This paper focuses on experienced emotion prediction and evaluates the proposed method on the extended COGNIMUSE dataset. The method performs significantly better than the state-of-the-art while significantly reducing the number of calculations, with increases in the Pearson correlation coefficient (PCC) from 0.46 to 0.62 for arousal and from 0.18 to 0.34 for valence in experienced emotion.

Download Full-text

Measuring the saliency of an invisible visual feature and its interaction with visible features

Journal of Vision ◽

10.1167/jov.21.9.2930 ◽

2021 ◽

Vol 21 (9) ◽

pp. 2930

Author(s):

Jinyou Zou ◽

Li Zhaoping

Keyword(s):

Visual Feature

Download Full-text

Alignment Method of Combined Perception for Peg-in-Hole Assembly with Deep Reinforcement Learning

Journal of Sensors ◽

10.1155/2021/5073689 ◽

2021 ◽

Vol 2021 ◽

pp. 1-12

Author(s):

Yongzhi Wang ◽

Lei Zhao ◽

Qian Zhang ◽

Ran Zhou ◽

Liping Wu ◽

...

Keyword(s):

Reinforcement Learning ◽

Visual Perception ◽

Simulation Training ◽

Tactile Perception ◽

Visual Feature ◽

Alignment Method ◽

Torque Sensor ◽

Robot System ◽

Contact State ◽

Simulation Results

The method of tactile perception can accurately reflect the contact state by collecting force and torque information, but it is not sensitive to the changes in position and posture between assembly objects. The method of visual perception is very sensitive to changes in pose and posture between assembled objects, but they cannot accurately reflect the contact state, especially since the objects are occluded from each other. The robot will perceive the environment more accurately if visual and tactile perception can be combined. Therefore, this paper proposes the alignment method of combined perception for the peg-in-hole assembly with self-supervised deep reinforcement learning. The agent first observes the environment through visual sensors and then predicts the action of the alignment adjustment based on the visual feature of the contact state. Subsequently, the agent judges the contact state based on the force and torque information collected by the force/torque sensor. And the action of the alignment adjustment is selected according to the contact state and used as a visual prediction label. Whereafter, the network of visual perception performs backpropagation to correct the network weights according to the visual prediction label. Finally, the agent will have learned the alignment skill of combined perception with the increase of iterative training. The robot system is built based on CoppeliaSim for simulation training and testing. The simulation results show that the method of combined perception has higher assembly efficiency than single perception.

Download Full-text

visual feature
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Visual feature synthesis with semantic reconstructor for traditional and generalized zero‐shot object classification

Sports Action Recognition Based on Image Processing Technology and Analysis of the Development of Sports Industry Pattern

Colonoscopy image classification using self-supervised visual feature learning

Unsupervised Visual Feature Learning Based on Similarity Guidance

Image Analytics: A consolidation of visual feature extraction methods

Radar-Based Localization Using Visual Feature Matching

The pupil responds spontaneously to perceived numerosity

A Deep Multimodal Model for Predicting Affective Responses Evoked by Movies Based on Shot Segmentation

Measuring the saliency of an invisible visual feature and its interaction with visible features

Alignment Method of Combined Perception for Peg-in-Hole Assembly with Deep Reinforcement Learning

Export Citation Format

visual featureRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Visual feature synthesis with semantic reconstructor for traditional and generalized zero‐shot object classification

Sports Action Recognition Based on Image Processing Technology and Analysis of the Development of Sports Industry Pattern

Colonoscopy image classification using self-supervised visual feature learning

Unsupervised Visual Feature Learning Based on Similarity Guidance

Image Analytics: A consolidation of visual feature extraction methods

Radar-Based Localization Using Visual Feature Matching

The pupil responds spontaneously to perceived numerosity

A Deep Multimodal Model for Predicting Affective Responses Evoked by Movies Based on Shot Segmentation

Measuring the saliency of an invisible visual feature and its interaction with visible features

Alignment Method of Combined Perception for Peg-in-Hole Assembly with Deep Reinforcement Learning

visual feature
Recently Published Documents