Sex Classification via 2D-Skeletonization

Sex classification is a challenging open problem in computer vision. It is useful from statistics up to people recognition on surveillance video. So far, the best performance can be achieved by using 3D cameras, but this approach requires the use of some especial hardware. Other 2D approaches achieve good results on normal situations but fail when the person wears loose clothing and carries bags or the camera angle changes as they rely on calculating borders, silhouettes, or the energy of the person in the image. This work aims to provide a novel sex classification methodology based on the creation of a virtual skeleton for each individual from 2D images and video; then, the distances between some points of the skeleton are measured and work as input of a sex classifier. This improves the results since clothing, bags, and the camera angle affect little the skeletonization process.

Download Full-text

Automated detection of grade-crossing-trespassing near misses based on computer vision analysis of surveillance video data

Safety Science ◽

10.1016/j.ssci.2017.11.023 ◽

2018 ◽

Vol 110 ◽

pp. 276-285 ◽

Cited By ~ 4

Author(s):

Zhipeng Zhang ◽

Chintan Trivedi ◽

Xiang Liu

Keyword(s):

Computer Vision ◽

Automated Detection ◽

Video Data ◽

Surveillance Video ◽

Near Misses ◽

Grade Crossing

Download Full-text

Detection of Customer Interested Garments in Surveillance Video using Computer Vision

2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT) ◽

10.1109/icccnt49239.2020.9225571 ◽

2020 ◽

Author(s):

Earnest Paul Ijjina ◽

Aniruddha Srinivas Joshi ◽

Goutham Kanahasabai

Keyword(s):

Computer Vision ◽

Surveillance Video

Download Full-text

An Approach to the Creation and Presentation of Reference Gesture Datasets, for the Preservation of Traditional Crafts

Applied Sciences ◽

10.3390/app10207325 ◽

2020 ◽

Vol 10 (20) ◽

pp. 7325

Author(s):

Nikolaos Partarakis ◽

Xenophon Zabulis ◽

Antonis Chatziantoniou ◽

Nikolaos Patsiouras ◽

Ilia Adami

Keyword(s):

Computer Vision ◽

Motion Capture ◽

Human Activities ◽

Wide Spectrum ◽

Semantic Annotation ◽

Human Motion ◽

Digital Data ◽

3D Motion ◽

Pertinent Data ◽

The Creation

A wide spectrum of digital data are becoming available to researchers and industries interested in the recording, documentation, recognition, and reproduction of human activities. In this work, we propose an approach for understanding and articulating human motion recordings into multimodal datasets and VR demonstrations of actions and activities relevant to traditional crafts. To implement the proposed approach, we introduce Animation Studio (AnimIO) that enables visualisation, editing, and semantic annotation of pertinent data. AnimIO is compatible with recordings acquired by Motion Capture (MoCap) and Computer Vision. Using AnimIO, the operator can isolate segments from multiple synchronous recordings and export them in multimodal animation files. AnimIO can be used to isolate motion segments that refer to individual craft actions, as described by practitioners. The proposed approach has been iteratively designed for use by non-experts in the domain of 3D motion digitisation.

Download Full-text

Hotspot: Making computer vision more effective for human video surveillance

Information Visualization ◽

10.1177/1473871616630015 ◽

2016 ◽

Vol 15 (4) ◽

pp. 273-285 ◽

Cited By ~ 1

Author(s):

Cuong Nguyen ◽

Wu-chi Feng ◽

Feng Liu

Keyword(s):

Computer Vision ◽

Video Surveillance ◽

False Alarms ◽

Surveillance Video ◽

Visualization System ◽

Surveillance Videos ◽

Human Capability ◽

Vision Detection ◽

Video Visualization ◽

Attention Switch

Studies have shown that the human capability of monitoring multiple surveillance videos is limited. Computer vision techniques have been developed to detect abnormal events to support human video surveillance; however, their results are often unreliable, thus distracting surveillance operators and making them miss important events. This article presents Hotspot as a surveillance video visualization system that can effectively leverage noisy computer vision techniques to support human video surveillance. Hotspot consists of two views: a designated focus view to summarize videos with detected events and a video-bank view surrounding the focus view to display source surveillance videos. The focus view allows an operator to quickly dismiss false alarms and focus on true alarms. The video-bank view allows for extended human video analysis after an important event is detected. Hotspot further provides visual links to assist quick attention switch from the focus view to the video-bank view. Our experiments show that Hotspot can effectively integrate noisy, automatic computer vision detection results and better support human video surveillance tasks than the baseline video surveillance with no or only basic computer vision support.

Download Full-text

Topic-based Video Analysis

ACM Computing Surveys ◽

10.1145/3459089 ◽

2021 ◽

Vol 54 (6) ◽

pp. 1-34

Author(s):

Ratnabali Pal ◽

Arif Ahmed Sekh ◽

Debi Prosad Dogra ◽

Samarjit Kar ◽

Partha Pratim Roy ◽

...

Keyword(s):

Computer Vision ◽

Video Analysis ◽

Visual Surveillance ◽

Video Data ◽

Camera Motion ◽

Topic Modelling ◽

Surveillance Video ◽

Analysis Computer ◽

Manual Processing ◽

Spatio Temporal

Manual processing of a large volume of video data captured through closed-circuit television is challenging due to various reasons. First, manual analysis is highly time-consuming. Moreover, as surveillance videos are recorded in dynamic conditions such as in the presence of camera motion, varying illumination, or occlusion, conventional supervised learning may not work always. Thus, computer vision-based automatic surveillance scene analysis is carried out in unsupervised ways. Topic modelling is one of the emerging fields used in unsupervised information processing. Topic modelling is used in text analysis, computer vision applications, and other areas involving spatio-temporal data. In this article, we discuss the scope, variations, and applications of topic modelling, particularly focusing on surveillance video analysis. We have provided a methodological survey on existing topic models, their features, underlying representations, characterization, and applications in visual surveillance’s perspective. Important research papers related to topic modelling in visual surveillance have been summarized and critically analyzed in this article.

Download Full-text

A 3D Dynamic Face Recognition Method Based on Computer Vision

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.556-562.5006 ◽

2014 ◽

Vol 556-562 ◽

pp. 5006-5008 ◽

Cited By ~ 1

Author(s):

Bo Xia Zeng ◽

Wen Feng Li

Keyword(s):

Computer Vision ◽

Face Recognition ◽

Traditional Method ◽

Recognition Algorithm ◽

Facial Features ◽

Nonlinear Mapping ◽

Geometric Features ◽

Feature Points ◽

Recognition Method ◽

2D Images

The non-rigid 3D characters recovery technology for 2D images array is affected by background diversity, motion complexity, data losing and noise of feature points, so the recognition and recovery accuracy of facial features deformation is low. Due to the high error in traditional method, the paper puts forward a 3D facial recognition algorithm based on random images array, which converts the 2D features to 3D by nonlinear mapping, and completes the recognition on foundation of 3D geometric features distance. The experimental results show that the method effectively reduces error and improves recognition effects.

Download Full-text

Deep neural networks for analysis of fisheries surveillance video and automated monitoring of fish discards

ICES Journal of Marine Science ◽

10.1093/icesjms/fsz149 ◽

2019 ◽

Vol 77 (4) ◽

pp. 1340-1353 ◽

Cited By ~ 1

Author(s):

Geoff French ◽

Michal Mackiewicz ◽

Mark Fisher ◽

Helen Holah ◽

Rachel Kilburn ◽

...

Keyword(s):

Neural Network ◽

Computer Vision ◽

Deep Neural Network ◽

Deep Neural Networks ◽

Vision System ◽

Surveillance Video ◽

Computer Vision System ◽

Species Classification ◽

Fish Catch ◽

Computer Vision Systems

Abstract We report on the development of a computer vision system that analyses video from CCTV systems installed on fishing trawlers for the purpose of monitoring and quantifying discarded fish catch. Our system is designed to operate in spite of the challenging computer vision problem posed by conditions on-board fishing trawlers. We describe the approaches developed for isolating and segmenting individual fish and for species classification. We present an analysis of the variability of manual species identification performed by expert human observers and contrast the performance of our species classifier against this benchmark. We also quantify the effect of the domain gap on the performance of modern deep neural network-based computer vision systems.

Download Full-text

A novel multi-stream method for violent interaction detection using deep learning

Measurement and Control ◽

10.1177/0020294020902788 ◽

2020 ◽

Vol 53 (5-6) ◽

pp. 796-806

Author(s):

Hongchang Li ◽

Jing Wang ◽

Jianjun Han ◽

Jinmin Zhang ◽

Yushan Yang ◽

...

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Optical Flow ◽

High Probability ◽

Recognition Performance ◽

Surveillance Video ◽

Interaction Detection ◽

Temporal Features ◽

Violent Action ◽

Learning In Computer Vision

Violent interaction detection is a hot topic in computer vision. However, the recent research works on violent interaction detection mainly focus on the traditional hand-craft features, and does not make full use of the research results of deep learning in computer vision. In this paper, we propose a new robust violent interaction detection framework based on multi-stream deep learning in surveillance scene. The proposed approach enhances the recognition performance of violent action in video by fusing three different streams: attention-based spatial RGB stream, temporal stream, and local spatial stream. The attention-based spatial RGB stream learns the spatial attention regions of persons that have high probability to be action region through soft-attention mechanism. The temporal stream employs optical flow as input to extract temporal features. The local spatial stream learns spatial local features using block images as input. Experimental results demonstrate the effectiveness and reliability of the proposed method on three violent interactive datasets: hockey fights, movies, violent interaction. We also verify the proposed method on our own elevator surveillance video dataset and the performance of the proposed method is satisfied.

Download Full-text

Deteksi Area Parkir Mobil Berbasis Marker Menggunakan Moment Invariants dan K-NN

Jurnal Teknik Informatika dan Sistem Informasi ◽

10.28932/jutisi.v5i1.1586 ◽

2019 ◽

Vol 5 (1) ◽

Author(s):

Kristian Adi Nugraha

Keyword(s):

Computer Vision ◽

Moment Invariants ◽

Moment Invariant ◽

Invariant Method ◽

Camera Angle ◽

Spot Detection ◽

Parking Spot ◽

Parking System ◽

Camera Position

Smart car parking system has become very important these days because it is very helpful and saves a lot of time to find a parking spot. However, the installation of that system is too expensive, then not everyone can afford it. Affordable solution for car parking spot detection can be implemented using installed camera or CCTV combined with computer vision method. New problem appears if the camera position is limited like indoor building causing one marker shape can be captured in different shape due to the different camera angle. In this work, I do a research to detecting marker using moment invariant method. Even the shape of marker changed a little, moment invariant method can still recognize that shape with overall accuracy 91.94%.

Download Full-text

Development of an automated system for preparation of optical components for the production of optical transivers

Transaction of Scientific Papers of the Novosibirsk State Technical University ◽

10.17212/2307-6879-2021-1-42-52 ◽

2021 ◽

pp. 42-52

Author(s):

Aleksandr Kashtanov ◽

◽

Mikhail Pazhetnov ◽

Evgeny Koptev ◽

◽

...

Keyword(s):

Computer Vision ◽

Human Factor ◽

Robotic System ◽

Robotic Systems ◽

Automated System ◽

Vision Systems ◽

Optical Components ◽

Physical Sensors ◽

Computer Vision Systems ◽

The Creation

In the process of creating modern production, it is necessary to reduce the marriage associated with the human factor to zero, by reducing the number of operations carried out by people, leveling the mistakes that a person can make. So one of the solutions to this problem is the creation of robotic systems in which all operations are performed without any operator intervention. To control the execution of operations by a robotic system, in some cases, it is possible to use various sensors and sensors, but sometimes it turns out to be too expensive and difficult, therefore, the authors demonstrated the implementation of control over automation implemented using computer vision systems. Also, in the work, an analogy is drawn between computer vision and physical sensors, using the example of solving the problem that the authors faced.

Download Full-text