A Crowd Sensing Approach to Video Classification of Traffic Accident Hotspots

We aim to significantly reduce the computational cost for classification of temporally untrimmed videos while retaining similar accuracy. Existing video classification methods sample frames with a predefined frequency over entire video. Differently, we propose an end-to-end deep reinforcement approach which enables an agent to classify videos by watching a very small portion of frames like what we do. We make two main contributions. First, information is not equally distributed in video frames along time. An agent needs to watch more carefully when a clip is informative and skip the frames if they are redundant or irrelevant. The proposed approach enables the agent to adapt sampling rate to video content and skip most of the frames without the loss of information. Second, in order to have a confident decision, the number of frames that should be watched by an agent varies greatly from one video to another. We incorporate an adaptive stop network to measure confidence score and generate timely trigger to stop the agent watching videos, which improves efficiency without loss of accuracy. Our approach reduces the computational cost significantly for the large-scale YouTube-8M dataset, while the accuracy remains the same.

Download Full-text

Siamese Architecture-Based 3D DenseNet with Person-Specific Normalization Using Neutral Expression for Spontaneous and Posed Smile Classification

Sensors ◽

10.3390/s20247184 ◽

2020 ◽

Vol 20 (24) ◽

pp. 7184

Author(s):

Kunyoung Lee ◽

Eui Chul Lee

Keyword(s):

Principal Component ◽

Facial Muscle ◽

Automated Classification ◽

Video Classification ◽

Model Accuracy ◽

Proposed Model ◽

3D Cnn ◽

Reference Input ◽

Neutral Expression

Clinical studies have demonstrated that spontaneous and posed smiles have spatiotemporal differences in facial muscle movements, such as laterally asymmetric movements, which use different facial muscles. In this study, a model was developed in which video classification of the two types of smile was performed using a 3D convolutional neural network (CNN) applying a Siamese network, and using a neutral expression as reference input. The proposed model makes the following contributions. First, the developed model solves the problem caused by the differences in appearance between individuals, because it learns the spatiotemporal differences between the neutral expression of an individual and spontaneous and posed smiles. Second, using a neutral expression as an anchor improves the model accuracy, when compared to that of the conventional method using genuine and imposter pairs. Third, by using a neutral expression as an anchor image, it is possible to develop a fully automated classification system for spontaneous and posed smiles. In addition, visualizations were designed for the Siamese architecture-based 3D CNN to analyze the accuracy improvement, and to compare the proposed and conventional methods through feature analysis, using principal component analysis (PCA).

Download Full-text

Deep Learning Approaches to Automated Video Classification of Upper Limb Tension Test

Healthcare ◽

10.3390/healthcare9111579 ◽

2021 ◽

Vol 9 (11) ◽

pp. 1579

Author(s):

Wansuk Choi ◽

Seoyoon Heo

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

High Performance ◽

Model Fit ◽

Fine Tuning ◽

Learning Approaches ◽

Video Classification ◽

Video Clips ◽

Performance Computing

The purpose of this study was to classify ULTT videos through transfer learning with pre-trained deep learning models and compare the performance of the models. We conducted transfer learning by combining a pre-trained convolution neural network (CNN) model into a Python-produced deep learning process. Videos were processed on YouTube and 103,116 frames converted from video clips were analyzed. In the modeling implementation, the process of importing the required modules, performing the necessary data preprocessing for training, defining the model, compiling, model creation, and model fit were applied in sequence. Comparative models were Xception, InceptionV3, DenseNet201, NASNetMobile, DenseNet121, VGG16, VGG19, and ResNet101, and fine tuning was performed. They were trained in a high-performance computing environment, and validation and loss were measured as comparative indicators of performance. Relatively low validation loss and high validation accuracy were obtained from Xception, InceptionV3, and DenseNet201 models, which is evaluated as an excellent model compared with other models. On the other hand, from VGG16, VGG19, and ResNet101, relatively high validation loss and low validation accuracy were obtained compared with other models. There was a narrow range of difference between the validation accuracy and the validation loss of the Xception, InceptionV3, and DensNet201 models. This study suggests that training applied with transfer learning can classify ULTT videos, and that there is a difference in performance between models.

Download Full-text

Reply to: The Video Classification of Intubation score: a new description tool for tracheal intubation using videolaryngoscopy

European Journal of Anaesthesiology ◽

10.1097/eja.0000000000001563 ◽

2022 ◽

Vol 39 (2) ◽

pp. 182-183 ◽

Cited By ~ 1

Author(s):

Pierre Bradley

Keyword(s):

Tracheal Intubation ◽

Video Classification

Download Full-text

Faculty Opinions recommendation of The Video Classification of Intubation (VCI) score: a new description tool for tracheal intubation using videolaryngoscopy: A pilot study.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.739549833.793582885 ◽

2021 ◽

Author(s):

Andrew Smith ◽

Jan Hansel

Keyword(s):

Pilot Study ◽

Tracheal Intubation ◽

Video Classification

Download Full-text

A novel model for content based video classification of distributed datasets

2015 International Conference on Industrial Instrumentation and Control (ICIC) ◽

10.1109/iic.2015.7150802 ◽

2015 ◽

Cited By ~ 1

Author(s):

Siddhant Kulkarni ◽

Deepti Bhatia ◽

Sonali Sabale ◽

Sandhya Shinde

Keyword(s):

Video Classification ◽

Novel Model

Download Full-text

Hybrid Learning Driven by Dynamic Descriptors for Video Classification of Reflective Surfaces

IEEE Transactions on Industrial Informatics ◽

10.1109/tii.2021.3062619 ◽

2021 ◽

pp. 1-1

Author(s):

Riccardo Fantinel ◽

Angelo Cenedese ◽

Giampaolo Fadel

Keyword(s):

Hybrid Learning ◽

Video Classification ◽

Reflective Surfaces

Download Full-text

Determinants of Accident Status on Student Commuters of Jabodetabek in 2019

Jurnal Matematika Statistika dan Komputasi ◽

10.20956/j.v18i1.14503 ◽

2021 ◽

Vol 18 (1) ◽

pp. 102-120

Author(s):

Aprilia Lutviana Dewi ◽

Budyanra Budyanra

Keyword(s):

Rural Areas ◽

Traffic Accident ◽

Traffic Accidents ◽

Estimation Method ◽

Likelihood Estimation ◽

Binary Logistic Regression ◽

World Health ◽

Mode Of Transportation ◽

Health Organization

Traffic accidents among students are one of the problems experienced in the Greater Jakarta area. World Health Organization (WHO) stated that younger drivers are the most vulnerable group to experiencing traffic accidents, including the students. According to Badan Pusat Statistik (BPS), it was estimated that as many as 301,120 Jabodetabek commuters had experienced a traffic accident in 2019. Moreover, 13 to 14 out of the 100 commuters who had experienced traffic accidents are student commuters or commuters with the main activities going to school. Therefore, this study was conducted to determine the factors that affect the accident status of Jabodetabek student commuters in 2019 and their odds ratios by using the 2019 Jabodetabek Commuter Survey data. The analytical method used is a binary logistic regression with the parameter estimation method using penalized maximum likelihood estimation (PMLE). And the results showed that the variables of age, gender, last education, mode of transportation, classification of the area of residence, distance traveled, and the area of the activity had a significant influence on the accident status of Jabodetabek student commuters. Furthermore, student commuters who live in rural areas have the highest tendency to experience a traffic accident.

Download Full-text

Classification of Traffic Accident Information Using Machine Learning from Social Media

International Journal of Emerging Trends in Engineering Research ◽

10.30534/ijeter/2020/04832020 ◽

2020 ◽

Vol 8 (3) ◽

pp. 630-637

Author(s):

Dody Agung Saputro

Keyword(s):

Machine Learning ◽

Social Media ◽

Traffic Accident

Download Full-text

A Crowd Sensing Approach to Video Classification of Traffic Accident Hotspots

Classification of Road Traffic Accident Data Using Machine Learning Algorithms

Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification

Siamese Architecture-Based 3D DenseNet with Person-Specific Normalization Using Neutral Expression for Spontaneous and Posed Smile Classification

Deep Learning Approaches to Automated Video Classification of Upper Limb Tension Test

Reply to: The Video Classification of Intubation score: a new description tool for tracheal intubation using videolaryngoscopy

Faculty Opinions recommendation of The Video Classification of Intubation (VCI) score: a new description tool for tracheal intubation using videolaryngoscopy: A pilot study.

A novel model for content based video classification of distributed datasets

Hybrid Learning Driven by Dynamic Descriptors for Video Classification of Reflective Surfaces

Determinants of Accident Status on Student Commuters of Jabodetabek in 2019

Classification of Traffic Accident Information Using Machine Learning from Social Media

Export Citation Format