scholarly journals Sex Classification via 2D-Skeletonization

2020 ◽  
Vol 2020 ◽  
pp. 1-12
Author(s):  
Miguel Contreras-Murillo ◽  
Sergio G. de-los-Cobos-Silva ◽  
Pedro Lara-Velázquez ◽  
Eric A. Rincón-García ◽  
Román A. Mora-Gutiérrez ◽  
...  

Sex classification is a challenging open problem in computer vision. It is useful from statistics up to people recognition on surveillance video. So far, the best performance can be achieved by using 3D cameras, but this approach requires the use of some especial hardware. Other 2D approaches achieve good results on normal situations but fail when the person wears loose clothing and carries bags or the camera angle changes as they rely on calculating borders, silhouettes, or the energy of the person in the image. This work aims to provide a novel sex classification methodology based on the creation of a virtual skeleton for each individual from 2D images and video; then, the distances between some points of the skeleton are measured and work as input of a sex classifier. This improves the results since clothing, bags, and the camera angle affect little the skeletonization process.

2020 ◽  
Vol 10 (20) ◽  
pp. 7325
Author(s):  
Nikolaos Partarakis ◽  
Xenophon Zabulis ◽  
Antonis Chatziantoniou ◽  
Nikolaos Patsiouras ◽  
Ilia Adami

A wide spectrum of digital data are becoming available to researchers and industries interested in the recording, documentation, recognition, and reproduction of human activities. In this work, we propose an approach for understanding and articulating human motion recordings into multimodal datasets and VR demonstrations of actions and activities relevant to traditional crafts. To implement the proposed approach, we introduce Animation Studio (AnimIO) that enables visualisation, editing, and semantic annotation of pertinent data. AnimIO is compatible with recordings acquired by Motion Capture (MoCap) and Computer Vision. Using AnimIO, the operator can isolate segments from multiple synchronous recordings and export them in multimodal animation files. AnimIO can be used to isolate motion segments that refer to individual craft actions, as described by practitioners. The proposed approach has been iteratively designed for use by non-experts in the domain of 3D motion digitisation.


2016 ◽  
Vol 15 (4) ◽  
pp. 273-285 ◽  
Author(s):  
Cuong Nguyen ◽  
Wu-chi Feng ◽  
Feng Liu

Studies have shown that the human capability of monitoring multiple surveillance videos is limited. Computer vision techniques have been developed to detect abnormal events to support human video surveillance; however, their results are often unreliable, thus distracting surveillance operators and making them miss important events. This article presents Hotspot as a surveillance video visualization system that can effectively leverage noisy computer vision techniques to support human video surveillance. Hotspot consists of two views: a designated focus view to summarize videos with detected events and a video-bank view surrounding the focus view to display source surveillance videos. The focus view allows an operator to quickly dismiss false alarms and focus on true alarms. The video-bank view allows for extended human video analysis after an important event is detected. Hotspot further provides visual links to assist quick attention switch from the focus view to the video-bank view. Our experiments show that Hotspot can effectively integrate noisy, automatic computer vision detection results and better support human video surveillance tasks than the baseline video surveillance with no or only basic computer vision support.


2021 ◽  
Vol 54 (6) ◽  
pp. 1-34
Author(s):  
Ratnabali Pal ◽  
Arif Ahmed Sekh ◽  
Debi Prosad Dogra ◽  
Samarjit Kar ◽  
Partha Pratim Roy ◽  
...  

Manual processing of a large volume of video data captured through closed-circuit television is challenging due to various reasons. First, manual analysis is highly time-consuming. Moreover, as surveillance videos are recorded in dynamic conditions such as in the presence of camera motion, varying illumination, or occlusion, conventional supervised learning may not work always. Thus, computer vision-based automatic surveillance scene analysis is carried out in unsupervised ways. Topic modelling is one of the emerging fields used in unsupervised information processing. Topic modelling is used in text analysis, computer vision applications, and other areas involving spatio-temporal data. In this article, we discuss the scope, variations, and applications of topic modelling, particularly focusing on surveillance video analysis. We have provided a methodological survey on existing topic models, their features, underlying representations, characterization, and applications in visual surveillance’s perspective. Important research papers related to topic modelling in visual surveillance have been summarized and critically analyzed in this article.


2014 ◽  
Vol 556-562 ◽  
pp. 5006-5008 ◽  
Author(s):  
Bo Xia Zeng ◽  
Wen Feng Li

The non-rigid 3D characters recovery technology for 2D images array is affected by background diversity, motion complexity, data losing and noise of feature points, so the recognition and recovery accuracy of facial features deformation is low. Due to the high error in traditional method, the paper puts forward a 3D facial recognition algorithm based on random images array, which converts the 2D features to 3D by nonlinear mapping, and completes the recognition on foundation of 3D geometric features distance. The experimental results show that the method effectively reduces error and improves recognition effects.


2019 ◽  
Vol 77 (4) ◽  
pp. 1340-1353 ◽  
Author(s):  
Geoff French ◽  
Michal Mackiewicz ◽  
Mark Fisher ◽  
Helen Holah ◽  
Rachel Kilburn ◽  
...  

Abstract We report on the development of a computer vision system that analyses video from CCTV systems installed on fishing trawlers for the purpose of monitoring and quantifying discarded fish catch. Our system is designed to operate in spite of the challenging computer vision problem posed by conditions on-board fishing trawlers. We describe the approaches developed for isolating and segmenting individual fish and for species classification. We present an analysis of the variability of manual species identification performed by expert human observers and contrast the performance of our species classifier against this benchmark. We also quantify the effect of the domain gap on the performance of modern deep neural network-based computer vision systems.


2020 ◽  
Vol 53 (5-6) ◽  
pp. 796-806
Author(s):  
Hongchang Li ◽  
Jing Wang ◽  
Jianjun Han ◽  
Jinmin Zhang ◽  
Yushan Yang ◽  
...  

Violent interaction detection is a hot topic in computer vision. However, the recent research works on violent interaction detection mainly focus on the traditional hand-craft features, and does not make full use of the research results of deep learning in computer vision. In this paper, we propose a new robust violent interaction detection framework based on multi-stream deep learning in surveillance scene. The proposed approach enhances the recognition performance of violent action in video by fusing three different streams: attention-based spatial RGB stream, temporal stream, and local spatial stream. The attention-based spatial RGB stream learns the spatial attention regions of persons that have high probability to be action region through soft-attention mechanism. The temporal stream employs optical flow as input to extract temporal features. The local spatial stream learns spatial local features using block images as input. Experimental results demonstrate the effectiveness and reliability of the proposed method on three violent interactive datasets: hockey fights, movies, violent interaction. We also verify the proposed method on our own elevator surveillance video dataset and the performance of the proposed method is satisfied.


Author(s):  
Kristian Adi Nugraha

Smart car parking system has become very important these days because it is very helpful and saves a lot of time to find a parking spot. However, the installation of that system is too expensive, then not everyone can afford it. Affordable solution for car parking spot detection can be implemented using installed camera or CCTV combined with computer vision method. New problem appears if the camera position is limited like indoor building causing one marker shape can be captured in different shape due to the different camera angle. In this work, I do a research to detecting marker using moment invariant method. Even the shape of marker changed a little, moment invariant method can still recognize that shape with overall accuracy 91.94%.


Author(s):  
Aleksandr Kashtanov ◽  
◽  
Mikhail Pazhetnov ◽  
Evgeny Koptev ◽  
◽  
...  

In the process of creating modern production, it is necessary to reduce the marriage associated with the human factor to zero, by reducing the number of operations carried out by people, leveling the mistakes that a person can make. So one of the solutions to this problem is the creation of robotic systems in which all operations are performed without any operator intervention. To control the execution of operations by a robotic system, in some cases, it is possible to use various sensors and sensors, but sometimes it turns out to be too expensive and difficult, therefore, the authors demonstrated the implementation of control over automation implemented using computer vision systems. Also, in the work, an analogy is drawn between computer vision and physical sensors, using the example of solving the problem that the authors faced.


Sign in / Sign up

Export Citation Format

Share Document