Syntactic model-based human body 3D reconstruction and event classification via association based features mining and deep learning

The study of human posture analysis and gait event detection from various types of inputs is a key contribution to the human life log. With the help of this research and technologies humans can save costs in terms of time and utility resources. In this paper we present a robust approach to human posture analysis and gait event detection from complex video-based data. For this, initially posture information, landmark information are extracted, and human 2D skeleton mesh are extracted, using this information set we reconstruct the human 2D to 3D model. Contextual features, namely, degrees of freedom over detected body parts, joint angle information, periodic and non-periodic motion, and human motion direction flow, are extracted. For features mining, we applied the rule-based features mining technique and, for gait event detection and classification, the deep learning-based CNN technique is applied over the mpii-video pose, the COCO, and the pose track datasets. For the mpii-video pose dataset, we achieved a human landmark detection mean accuracy of 87.09% and a gait event recognition mean accuracy of 90.90%. For the COCO dataset, we achieved a human landmark detection mean accuracy of 87.36% and a gait event recognition mean accuracy of 89.09%. For the pose track dataset, we achieved a human landmark detection mean accuracy of 87.72% and a gait event recognition mean accuracy of 88.18%. The proposed system performance shows a significant improvement compared to existing state-of-the-art frameworks.

Download Full-text

Saliency detection in deep learning era: trends of development

Information and Control Systems ◽

10.31799/1684-8853-2019-3-10-36 ◽

2019 ◽

pp. 10-36 ◽

Cited By ~ 2

Author(s):

M. N. Favorskaya ◽

L. C. Jain

Keyword(s):

Deep Learning ◽

Object Detection ◽

Event Detection ◽

Visual Analysis ◽

Saliency Detection ◽

Salient Object Detection ◽

Public Image ◽

Detection Methods ◽

Salient Object ◽

Salient Event

Introduction:Saliency detection is a fundamental task of computer vision. Its ultimate aim is to localize the objects of interest that grab human visual attention with respect to the rest of the image. A great variety of saliency models based on different approaches was developed since 1990s. In recent years, the saliency detection has become one of actively studied topic in the theory of Convolutional Neural Network (CNN). Many original decisions using CNNs were proposed for salient object detection and, even, event detection.Purpose:A detailed survey of saliency detection methods in deep learning era allows to understand the current possibilities of CNN approach for visual analysis conducted by the human eyes’ tracking and digital image processing.Results:A survey reflects the recent advances in saliency detection using CNNs. Different models available in literature, such as static and dynamic 2D CNNs for salient object detection and 3D CNNs for salient event detection are discussed in the chronological order. It is worth noting that automatic salient event detection in durable videos became possible using the recently appeared 3D CNN combining with 2D CNN for salient audio detection. Also in this article, we have presented a short description of public image and video datasets with annotated salient objects or events, as well as the often used metrics for the results’ evaluation.Practical relevance:This survey is considered as a contribution in the study of rapidly developed deep learning methods with respect to the saliency detection in the images and videos.

Download Full-text

A Deep Learning-based Approach for Human Posture Classification

Proceedings of the 2020 2nd International Conference on Management Science and Industrial Engineering ◽

10.1145/3396743.3396763 ◽

2020 ◽

Author(s):

Jui-Sheng Hung ◽

Pin-Ling Liu ◽

Chien-Chi Chang

Keyword(s):

Deep Learning ◽

Human Posture ◽

Posture Classification

Download Full-text

Olympic Games Event Recognition via Transfer Learning with Photobombing Guided Data Augmentation

Journal of Imaging ◽

10.3390/jimaging7020012 ◽

2021 ◽

Vol 7 (2) ◽

pp. 12

Author(s):

Yousef I. Mohamad ◽

Samah S. Baraheem ◽

Tam V. Nguyen

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Data Augmentation ◽

Olympic Games ◽

Event Recognition ◽

Surveillance Systems ◽

Video Captioning ◽

Practical Applications ◽

Sport Events ◽

The Olympic Games

Automatic event recognition in sports photos is both an interesting and valuable research topic in the field of computer vision and deep learning. With the rapid increase and the explosive spread of data, which is being captured momentarily, the need for fast and precise access to the right information has become a challenging task with considerable importance for multiple practical applications, i.e., sports image and video search, sport data analysis, healthcare monitoring applications, monitoring and surveillance systems for indoor and outdoor activities, and video captioning. In this paper, we evaluate different deep learning models in recognizing and interpreting the sport events in the Olympic Games. To this end, we collect a dataset dubbed Olympic Games Event Image Dataset (OGED) including 10 different sport events scheduled for the Olympic Games Tokyo 2020. Then, the transfer learning is applied on three popular deep convolutional neural network architectures, namely, AlexNet, VGG-16 and ResNet-50 along with various data augmentation methods. Extensive experiments show that ResNet-50 with the proposed photobombing guided data augmentation achieves 90% in terms of accuracy.

Download Full-text

On unifying deep learning and edge computing for human motion analysis in exergames development

Neural Computing and Applications ◽

10.1007/s00521-021-06181-6 ◽

2021 ◽

Author(s):

Antonis Pardos ◽

Andreas Menychtas ◽

Ilias Maglogiannis

Keyword(s):

Deep Learning ◽

Motion Analysis ◽

Human Motion ◽

Edge Computing ◽

Human Motion Analysis

Download Full-text

Optical Fiber Distributed Vibration Sensing Using Grayscale Image and Multi-Class Deep Learning Framework for Multi-Event Recognition

IEEE Sensors Journal ◽

10.1109/jsen.2021.3089004 ◽

2021 ◽

pp. 1-1

Author(s):

Zhenshi Sun ◽

Kun Liu ◽

Junfeng Jiang ◽

Tianhua Xu ◽

Shuang Wang ◽

...

Keyword(s):

Deep Learning ◽

Optical Fiber ◽

Event Recognition ◽

Grayscale Image ◽

Learning Framework ◽

Vibration Sensing

Download Full-text

Deep learning for cephalometric landmark detection: systematic review and meta-analysis

Clinical Oral Investigations ◽

10.1007/s00784-021-03990-w ◽

2021 ◽

Author(s):

Falk Schwendicke ◽

Akhilanand Chaurasia ◽

Lubaina Arsiwala ◽

Jae-Hong Lee ◽

Karim Elhennawy ◽

...

Keyword(s):

Systematic Review ◽

Deep Learning ◽

Meta Analysis ◽

High Accuracy ◽

Risk Of Bias ◽

Automated Detection ◽

Reference Test ◽

Landmark Detection ◽

Future Studies ◽

Using Data

Abstract Objectives Deep learning (DL) has been increasingly employed for automated landmark detection, e.g., for cephalometric purposes. We performed a systematic review and meta-analysis to assess the accuracy and underlying evidence for DL for cephalometric landmark detection on 2-D and 3-D radiographs. Methods Diagnostic accuracy studies published in 2015-2020 in Medline/Embase/IEEE/arXiv and employing DL for cephalometric landmark detection were identified and extracted by two independent reviewers. Random-effects meta-analysis, subgroup, and meta-regression were performed, and study quality was assessed using QUADAS-2. The review was registered (PROSPERO no. 227498). Data From 321 identified records, 19 studies (published 2017–2020), all employing convolutional neural networks, mainly on 2-D lateral radiographs (n=15), using data from publicly available datasets (n=12) and testing the detection of a mean of 30 (SD: 25; range.: 7–93) landmarks, were included. The reference test was established by two experts (n=11), 1 expert (n=4), 3 experts (n=3), and a set of annotators (n=1). Risk of bias was high, and applicability concerns were detected for most studies, mainly regarding the data selection and reference test conduct. Landmark prediction error centered around a 2-mm error threshold (mean; 95% confidence interval: (–0.581; 95 CI: –1.264 to 0.102 mm)). The proportion of landmarks detected within this 2-mm threshold was 0.799 (0.770 to 0.824). Conclusions DL shows relatively high accuracy for detecting landmarks on cephalometric imagery. The overall body of evidence is consistent but suffers from high risk of bias. Demonstrating robustness and generalizability of DL for landmark detection is needed. Clinical significance Existing DL models show consistent and largely high accuracy for automated detection of cephalometric landmarks. The majority of studies so far focused on 2-D imagery; data on 3-D imagery are sparse, but promising. Future studies should focus on demonstrating generalizability, robustness, and clinical usefulness of DL for this objective.

Download Full-text

Anthropometric Landmark Detection in 3D Head Surfaces using a Deep Learning Approach

IEEE Journal of Biomedical and Health Informatics ◽

10.1109/jbhi.2020.3035888 ◽

2020 ◽

pp. 1-1

Author(s):

Helena R.Torres ◽

Pedro Morais ◽

Anne Fritze ◽

Bruno Oliveira ◽

Fernando Veloso ◽

...

Keyword(s):

Deep Learning ◽

Learning Approach ◽

Landmark Detection

Download Full-text

SHEDR: An End-to-End Deep Neural Event Detection and Recommendation Framework for Hyperlocal News Using Social Media

INFORMS Journal on Computing ◽

10.1287/ijoc.2021.1112 ◽

2021 ◽

Author(s):

Yuheng Hu ◽

Yili Hong

Keyword(s):

Neural Network ◽

Social Media ◽

Deep Learning ◽

Event Detection ◽

Large Scale ◽

Short Term Memory ◽

State Of The Art ◽

Neural Network Models ◽

Neural Event ◽

End To End

Residents often rely on newspapers and television to gather hyperlocal news for community awareness and engagement. More recently, social media have emerged as an increasingly important source of hyperlocal news. Thus far, the literature on using social media to create desirable societal benefits, such as civic awareness and engagement, is still in its infancy. One key challenge in this research stream is to timely and accurately distill information from noisy social media data streams to community members. In this work, we develop SHEDR (social media–based hyperlocal event detection and recommendation), an end-to-end neural event detection and recommendation framework with a particular use case for Twitter to facilitate residents’ information seeking of hyperlocal events. The key model innovation in SHEDR lies in the design of the hyperlocal event detector and the event recommender. First, we harness the power of two popular deep neural network models, the convolutional neural network (CNN) and long short-term memory (LSTM), in a novel joint CNN-LSTM model to characterize spatiotemporal dependencies for capturing unusualness in a region of interest, which is classified as a hyperlocal event. Next, we develop a neural pairwise ranking algorithm for recommending detected hyperlocal events to residents based on their interests. To alleviate the sparsity issue and improve personalization, our algorithm incorporates several types of contextual information covering topic, social, and geographical proximities. We perform comprehensive evaluations based on two large-scale data sets comprising geotagged tweets covering Seattle and Chicago. We demonstrate the effectiveness of our framework in comparison with several state-of-the-art approaches. We show that our hyperlocal event detection and recommendation models consistently and significantly outperform other approaches in terms of precision, recall, and F-1 scores. Summary of Contribution: In this paper, we focus on a novel and important, yet largely underexplored application of computing—how to improve civic engagement in local neighborhoods via local news sharing and consumption based on social media feeds. To address this question, we propose two new computational and data-driven methods: (1) a deep learning–based hyperlocal event detection algorithm that scans spatially and temporally to detect hyperlocal events from geotagged Twitter feeds; and (2) A personalized deep learning–based hyperlocal event recommender system that systematically integrates several contextual cues such as topical, geographical, and social proximity to recommend the detected hyperlocal events to potential users. We conduct a series of experiments to examine our proposed models. The outcomes demonstrate that our algorithms are significantly better than the state-of-the-art models and can provide users with more relevant information about the local neighborhoods that they live in, which in turn may boost their community engagement.

Download Full-text

A Non - Intrusive Load Identification Algorithm Combined with Event Detection

Recent Advances in Electrical & Electronic Engineering (Formerly Recent Patents on Electrical & Electronic Engineering) ◽

10.2174/2352096514666210617125742 ◽

2021 ◽

Vol 14 ◽

Author(s):

Runhai Jiao ◽

Qihang Zhou ◽

Liangqiu Lyu ◽

Guangwei Yan

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Event Detection ◽

The State ◽

Event Recognition ◽

Identification Algorithm ◽

Load Identification ◽

Fusion Model ◽

Public Data ◽

Artificial Neural

Background: The traditional state-based non-intrusive load monitoring method mainly deploys the aggregate load as the characteristic to identify the states of every electrical appliance. Each identification is relatively independent, and there is no correlation between the identification results. Objective: This paper combines the event detection results with the state-based non-intrusive load identification algorithm to improve accuracy. Methods: Firstly, the load recognition model based on an artificial neural network is constructed, and the state-based recognition results are obtained. An event recognition and detection model is then built to identify electrical state transitions, that is, the current moment based on the event recognition results obtained from the previous moment. Finally, a reasonable decision method is constructed to determine the identification result of the electrical states. Result: Experimental results on the public data set REDD show that in the Long Short-Term Memory (LSTM) fusion model, the macro-F1 is increased by an average of 6%, and the macro-F1 of the Artificial Neural Network (ANN) fusion model is increased by an average of 5.3% compared with LSTM and ANN. Conclusion: The proposed model can effectively improve the accuracy of identification compared with the state-based load identification method.

Download Full-text

Developing a Twitter-based traffic event detection model using deep learning architectures

Expert Systems with Applications ◽

10.1016/j.eswa.2018.10.017 ◽

2019 ◽

Vol 118 ◽

pp. 425-439 ◽

Cited By ~ 20

Author(s):

Sina Dabiri ◽

Kevin Heaslip

Keyword(s):

Deep Learning ◽

Event Detection ◽

Detection Model ◽

Learning Architectures

Download Full-text