video feature Latest Research Papers

This article examines the first Greek film to give a central role to the concept of the mermaid: Georges Skalenakis’ 1987 direct-to-video feature Gorgona (‘Mermaid’). Although actually concerning an all-human female, Gorgona attaches to her many traits of both the internationally common half-fish/half-woman creature (known in Greek as γοργόνα/gorgona) and the mermaid sister (also known as γοργόνα) in the legend of Alexander the Great. The article identifies the video-film’s allusions to these fishtailed figures and argues that the film produced an updated mermaid image that responded to other national and foreign audiovisual conceptions of the mermaid of the 1980s and enriched the star persona of its female lead, Eleni Filini, with a mythic quality and national symbolism.

Download Full-text

An Automatic Classification Method of Sports Teaching Video Using Support Vector Machine

Scientific Programming ◽

10.1155/2021/4728584 ◽

2021 ◽

Vol 2021 ◽

pp. 1-8

Author(s):

Zhang Min-qing ◽

Li Wen-ping

Keyword(s):

Support Vector Machine ◽

Video Segmentation ◽

Prediction Method ◽

Classification Performance ◽

Video Data ◽

Support Vector ◽

Classification Algorithms ◽

Video Classification ◽

Video Feature ◽

Different Types

There are many different types of sports training films, and categorizing them can be difficult. As a result, this research introduces an autonomous video content classification system that makes managing large amounts of video data easier. This research provides a video feature extraction approach using a support vector machine (SVM) video classification algorithm and a mix of video and audio dual-mode characteristics. It automates the classification of cartoons, ads, music, news, and sports videos, as well as the detection of terrorist and violent moments in films. To begin, a new feature expression scheme, the MPEG-7 visual descriptor subcombination, is proposed based on an analysis of the existing video classification algorithms, with the goal of addressing the problems in these algorithms. This is accomplished by analyzing the visual differences of the five video classification algorithms. The model was able to extract 9 descriptors from the four characteristics of color, texture, shape, and motion, resulting in a new overall visual feature with good results. The results suggest that the algorithm optimizes video segmentation by highlighting disparities in feature selection between different categories of films. Second, the support vector machine’s multivideo classification performance is improved by the enhanced secondary prediction method. Finally, a comparison experiment with current related similar algorithms was conducted. The suggested method outperformed the competition in the accuracy of video classification in five different types of videos, as well as in the recognition of terrorist and violent incidents.

Download Full-text

Video Abnormal Event Detection Based on One-Class Neural Network

Computational Intelligence and Neuroscience ◽

10.1155/2021/1955116 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Xiangli Xia ◽

Yang Gao

Keyword(s):

Neural Network ◽

Anomaly Detection ◽

Event Detection ◽

Data Representation ◽

Urban Environments ◽

Layer By Layer ◽

Abnormal Event Detection ◽

Detection Model ◽

Video Feature ◽

Hidden Layer

Video abnormal event detection is a challenging problem in pattern recognition field. Existing methods usually design the two steps of video feature extraction and anomaly detection model establishment independently, which leads to the failure to achieve the optimal result. As a remedy, a method based on one-class neural network (ONN) is designed for video anomaly detection. The proposed method combines the layer-by-layer data representation capabilities of the autoencoder and good classification capabilities of ONN. The features of the hidden layer are constructed for the specific task of anomaly detection, thereby obtaining a hyperplane to separate all normal samples from abnormal ones. Experimental results show that the proposed method achieves 94.9% frame-level AUC and 94.5% frame-level AUC on the PED1 subset and PED2 subset from the USCD dataset, respectively. In addition, it achieves 80 correct event detections on the Subway dataset. The results confirm the wide applicability and good performance of the proposed method in industrial and urban environments.

Download Full-text

1048 Patient and Clinician Satisfaction with Urology Telephone Consultations During the Covid-19 Pandemic

British Journal of Surgery ◽

10.1093/bjs/znab259.1101 ◽

2021 ◽

Vol 108 (Supplement_6) ◽

Author(s):

J Colemeadow ◽

S K Pandian

Keyword(s):

University Hospital ◽

Online Questionnaire ◽

Suitable Alternative ◽

Telephone Consultation ◽

Paper Questionnaire ◽

Video Feature ◽

Language Interpretation ◽

The Future ◽

Clinician Satisfaction ◽

Patient Responses

Abstract Aim To establish the level of both patient and clinician satisfaction with regard to the newly implemented telephone clinics and determine ways in which the service can be improved. The telephone clinics have been implemented during the Covid-19 pandemic within the urology department at a busy university hospital. Method An online and paper questionnaire was distributed to patients who received a telephone consultation between April and August 2020. A similar online questionnaire was distributed to urology staff undertaking telephone consultations. Results 44 patient responses were received, with 8 clinician responses. 72% of patients were satisfied or very satisfied with the telephone clinic service provided. The same proportion of patients received their appointment on schedule. 98% of patients could hear and understand the information relayed to them. 78% of patients would opt for a telephone clinic in the future. Only 21% of patients would have preferred the addition of a video feature to their telephone consultation. 63% of clinicians felt that telephone consultations were a suitable alternative and 89% reported their consultations ran to time. With regards to improvements, 89% of clinicians felt that a language interpretation service should be readily available and that headsets may facilitate ease of consultation. Conclusions Telephone consultations are effective and appropriate during the restricted services resulting from the COVID-19 pandemic. Patients have received telephone clinics well. The majority of clinicians felt that telephone clinics were a suitable alternative, however several improvements to the service have been suggested.

Download Full-text

EXPRESS: Consumer Behavior in the Online Classroom: Using Video Analytics and Machine Learning to Understand the Consumption of Video Courseware

Journal of Marketing Research ◽

10.1177/00222437211042013 ◽

2021 ◽

pp. 002224372110420

Author(s):

Mi Zhou ◽

Pedro Ferreira ◽

Michael D. Smith ◽

George H. Chen

Keyword(s):

Machine Learning ◽

Consumer Behavior ◽

Marketing Research ◽

Traditional Education ◽

Video Data ◽

Multimedia Data ◽

Online Video ◽

Online Classroom ◽

Individual Level ◽

Video Feature

Video is one of the fastest growing online services offered to consumers. The rapid growth of online video consumption brings new opportunities for marketing executives and researchers to analyze consumer behavior. However, video introduces new challenges. Specifically, analyzing unstructured video data presents formidable methodological challenges that limit the current use of multimedia data to generate marketing insights. To address this challenge, the authors propose a novel video feature framework based on machine learning and computer vision techniques, which helps marketers predict and understand the consumption of online video from a content-based perspective. The authors apply this frame-work to two unique datasets: one provided by Masterclass.com, consisting of 771 online videos and more than 2.6 million viewing records from 225,580 consumers, and another from Crash Course, consisting of 1,127 videos focusing on more traditional education disciplines. The analyses show that the framework proposed in this paper can be used to accurately predict both individual-level consumer behavior and aggregate video popularity in these two very different contexts. The authors discuss how their findings and methods can be used to advance management and marketing research with unstructured video data in other contexts such as video marketing and entertainment analytics.

Download Full-text

A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval

Sensors ◽

10.3390/s21093094 ◽

2021 ◽

Vol 21 (9) ◽

pp. 3094

Author(s):

Hanqing Chen ◽

Chunyan Hu ◽

Feifei Lee ◽

Chaowei Lin ◽

Wei Yao ◽

...

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Convolutional Neural Network ◽

Large Scale ◽

Video Retrieval ◽

Video Frames ◽

Video Feature ◽

Video Hashing ◽

Short Video ◽

The Stability

Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, content-based video retrieval (CBVR) has aroused the interest of many researchers. A typical CBVR system mainly contains the following two essential parts: video feature extraction and similarity comparison. Feature extraction of video is very challenging, previous video retrieval methods are mostly based on extracting features from single video frames, while resulting the loss of temporal information in the videos. Hashing methods are extensively used in multimedia information retrieval due to its retrieval efficiency, but most of them are currently only applied to image retrieval. In order to solve these problems in video retrieval, we build an end-to-end framework called deep supervised video hashing (DSVH), which employs a 3D convolutional neural network (CNN) to obtain spatial-temporal features of videos, then train a set of hash functions by supervised hashing to transfer the video features into binary space and get the compact binary codes of videos. Finally, we use triplet loss for network training. We conduct a lot of experiments on three public video datasets UCF-101, JHMDB and HMDB-51, and the results show that the proposed method has advantages over many state-of-the-art video retrieval methods. Compared with the DVH method, the mAP value of UCF-101 dataset is improved by 9.3%, and the minimum improvement on JHMDB dataset is also increased by 0.3%. At the same time, we also demonstrate the stability of the algorithm in the HMDB-51 dataset.

Download Full-text

Crowdsourced privacy-preserved feature tagging of short home videos for machine learning ASD detection

Scientific Reports ◽

10.1038/s41598-021-87059-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Peter Washington ◽

Qandeel Tariq ◽

Emilie Leblanc ◽

Brianna Chrisman ◽

Kaitlyn Dunlap ◽

...

Keyword(s):

Machine Learning ◽

At Risk ◽

High Performance ◽

High Sensitivity ◽

Autism Spectrum ◽

Privacy Preserving ◽

Machine Learning Classification ◽

Home Videos ◽

Video Feature ◽

Pitch Shift

AbstractStandard medical diagnosis of mental health conditions requires licensed experts who are increasingly outnumbered by those at risk, limiting reach. We test the hypothesis that a trustworthy crowd of non-experts can efficiently annotate behavioral features needed for accurate machine learning detection of the common childhood developmental disorder Autism Spectrum Disorder (ASD) for children under 8 years old. We implement a novel process for identifying and certifying a trustworthy distributed workforce for video feature extraction, selecting a workforce of 102 workers from a pool of 1,107. Two previously validated ASD logistic regression classifiers, evaluated against parent-reported diagnoses, were used to assess the accuracy of the trusted crowd’s ratings of unstructured home videos. A representative balanced sample (N = 50 videos) of videos were evaluated with and without face box and pitch shift privacy alterations, with AUROC and AUPRC scores > 0.98. With both privacy-preserving modifications, sensitivity is preserved (96.0%) while maintaining specificity (80.0%) and accuracy (88.0%) at levels comparable to prior classification methods without alterations. We find that machine learning classification from features extracted by a certified nonexpert crowd achieves high performance for ASD detection from natural home videos of the child at risk and maintains high sensitivity when privacy-preserving mechanisms are applied. These results suggest that privacy-safeguarded crowdsourced analysis of short home videos can help enable rapid and mobile machine-learning detection of developmental delays in children.

Download Full-text

Video Semantic Analysis

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2021040101 ◽

2021 ◽

Vol 11 (2) ◽

pp. 1-21

Author(s):

Daniel Danso Essel ◽

Ben-Bright Benuwa ◽

Benjamin Ghansah

Keyword(s):

Dictionary Learning ◽

Semantic Analysis ◽

Learning Algorithm ◽

Recognition Rate ◽

Image Data ◽

Video Data ◽

The Novel ◽

Video Feature ◽

Novel Approach ◽

Video Semantic Analysis

Sparse Representation (SR) and Dictionary Learning (DL) based Classifier have shown promising results in classification tasks, with impressive recognition rate on image data. In Video Semantic Analysis (VSA) however, the local structure of video data contains significant discriminative information required for classification. To the best of our knowledge, this has not been fully explored by recent DL-based approaches. Further, similar coding findings are not being realized from video features with the same video category. Based on the foregoing, a novel learning algorithm, Sparsity based Locality-Sensitive Discriminative Dictionary Learning (SLSDDL) for VSA is proposed in this paper. In the proposed algorithm, a discriminant loss function for the category based on sparse coding of the sparse coefficients is introduced into structure of Locality-Sensitive Dictionary Learning (LSDL) algorithm. Finally, the sparse coefficients for the testing video feature sample are solved by the optimized method of SLSDDL and the classification result for video semantic is obtained by minimizing the error between the original and reconstructed samples. The experimental results show that, the proposed SLSDDL significantly improves the performance of video semantic detection compared with state-of-the-art approaches. The proposed approach also shows robustness to diverse video environments, proving the universality of the novel approach.

Download Full-text

Feature Extraction and Clustering for Static Video Summarization

10.21203/rs.3.rs-344569/v1 ◽

2021 ◽

Author(s):

Yunyun Sun ◽

Peng Li ◽

Yutong Liu ◽

Zhaohui Jiang

Keyword(s):

Video Summarization ◽

Mean Shift ◽

Maximum Weight ◽

Average Fidelity ◽

Key Frame Extraction ◽

Key Frame ◽

Video Feature ◽

Optimization Function ◽

Static Video Summarization ◽

Ratio Measure

Abstract Numerous limitations of shot based and content based key frame extraction approaches have encouraged the development of cluster based methods. This work provides OTMW, Optimal Threshold and Maximum Weight clustering method, as a novel cluster based key frame extraction method. The video feature dataset is constructed by computing the color, texture and information complexity features of frame images. An optimization function is developed to compute the optimal clustering threshold. It is constrained by fidelity and ratio measure parameters. We turn to an empirical study on the proposed method in multi-type video key frame extraction tasks and compare it with popular cluster based methods including Mean-shift, DBSCAN, GMM and K-means. OTWM method achieves an average fidelity and ratio of 96.12 and 97.13, respectively. Experimental results demonstrate that OTMW can bring higher fidelity and ratio performance, while still maintaining a competitive performance over other cluster based methods. Overall, the proposed method can accurately extract key frames from multi-type videos.

Download Full-text

video feature
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

FM-Mnet Real-Time Object Detection Model for Video Feature Fusion Processing

Mermaid Allusions and Star Promotion in the Greek Video-Film Gorgona (1987)

An Automatic Classification Method of Sports Teaching Video Using Support Vector Machine

Video Abnormal Event Detection Based on One-Class Neural Network

1048 Patient and Clinician Satisfaction with Urology Telephone Consultations During the Covid-19 Pandemic

EXPRESS: Consumer Behavior in the Online Classroom: Using Video Analytics and Machine Learning to Understand the Consumption of Video Courseware

A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval

Crowdsourced privacy-preserved feature tagging of short home videos for machine learning ASD detection

Video Semantic Analysis

Feature Extraction and Clustering for Static Video Summarization

Export Citation Format

video featureRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

FM-Mnet Real-Time Object Detection Model for Video Feature Fusion Processing

Mermaid Allusions and Star Promotion in the Greek Video-Film Gorgona (1987)

An Automatic Classification Method of Sports Teaching Video Using Support Vector Machine

Video Abnormal Event Detection Based on One-Class Neural Network

1048 Patient and Clinician Satisfaction with Urology Telephone Consultations During the Covid-19 Pandemic

EXPRESS: Consumer Behavior in the Online Classroom: Using Video Analytics and Machine Learning to Understand the Consumption of Video Courseware

A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval

Crowdsourced privacy-preserved feature tagging of short home videos for machine learning ASD detection

Video Semantic Analysis

Feature Extraction and Clustering for Static Video Summarization

video feature
Recently Published Documents