scholarly journals A Continuous Semantic Embedding Method for Video Compact Representation

Electronics ◽  
2021 ◽  
Vol 10 (24) ◽  
pp. 3106
Author(s):  
Tingting Han ◽  
Yuankai Qi ◽  
Suguo Zhu

Video compact representation aims to obtain a representation that could reflect the kernel mode of video content and concisely describe the video. As most information in complex videos is either noisy or redundant, some researchers have instead focused on long-term video semantics. Recent video compact representation methods heavily rely on the segmentation accuracy of video semantics. In this paper, we propose a novel framework to address these challenges. Specifically, we designed a novel continuous video semantic embedding model to learn the actual distribution of video words. First, an embedding model based on the continuous bag of words method is proposed to learn the video embeddings, integrated with a well-designed discriminative negative sampling approach, which helps emphasize the convincing clips in the embedding while weakening the influence of the confusing ones. Second, an aggregated distribution pooling method is proposed to capture the semantic distribution of kernel modes in videos. Finally, our well-trained model can generate compact video representations by direct inference, which provides our model with a better generalization ability compared with those of previous methods. We performed extensive experiments on event detection and the mining of representative event parts. Experiments on TRECVID MED11 and CCV datasets demonstrated the effectiveness of our method. Our method could capture the semantic distribution of kernel modes in videos and shows powerful potential to discover and better describe complex video patterns.

2020 ◽  
Vol 4 (Supplement_1) ◽  
pp. 533-533
Author(s):  
Linda Edelman ◽  
Troy Andersen ◽  
Cherie Brunker ◽  
Nicholas Cox ◽  
Jorie Butler ◽  
...  

Abstract Opioids are often the first-line chronic pain management strategy for long-term care (LTC) residents who are also at increased risk for opioid-related adverse events. Therefore, there is a need to train LTC providers and staff about appropriate opioid use and alternative treatment strategies. Our interdisciplinary team worked with LTC partners to identify staff educational needs around opioid stewardship. Based on this need’s assessment, we developed eight modules about opioid use and risks for older adults, including those with dementia, recommendations for de-prescribing including other pharmacological and non-pharmacological alternatives, SBIRT, and motivational interviewing to determine “what matters”. Each 20-minute module contains didactic and video content that is appropriate for group staff training or individuals and provides rural LTC facilities access to needed training in their home communities. Within the first month of launching online, the program received over 1100 hits and LTC partners are incorporating modules into clinical staff training schedules.


Author(s):  
Min Chen

The fast proliferation of video data archives has increased the need for automatic video content analysis and semantic video retrieval. Since temporal information is critical in conveying video content, in this chapter, an effective temporal-based event detection framework is proposed to support high-level video indexing and retrieval. The core is a temporal association mining process that systematically captures characteristic temporal patterns to help identify and define interesting events. This framework effectively tackles the challenges caused by loose video structure and class imbalance issues. One of the unique characteristics of this framework is that it offers strong generality and extensibility with the capability of exploring representative event patterns with little human interference. The temporal information and event detection results can then be input into our proposed distributed video retrieval system to support the high-level semantic querying, selective video browsing and event-based video retrieval.


Author(s):  
Min Chen

The fast proliferation of video data archives has increased the need for automatic video content analysis and semantic video retrieval. Since temporal information is critical in conveying video content, in this chapter, an effective temporal-based event detection framework is proposed to support high-level video indexing and retrieval. The core is a temporal association mining process that systematically captures characteristic temporal patterns to help identify and define interesting events. This framework effectively tackles the challenges caused by loose video structure and class imbalance issues. One of the unique characteristics of this framework is that it offers strong generality and extensibility with the capability of exploring representative event patterns with little human interference. The temporal information and event detection results can then be input into our proposed distributed video retrieval system to support the high-level semantic querying, selective video browsing and event-based video retrieval.


2020 ◽  
Author(s):  
Anastasia Sokolova ◽  
Yuri Uljanitski ◽  
Airat R. Kayumov ◽  
Mikhail I Bogachev

ABSTRACTDespite recent success in advanced signal analysis technologies, simple and universal methods are still of interest in a variety of applications. Wearable devices including biomedical monitoring and diagnostic systems suitable for long-term operation are prominent examples, where simple online signal analysis and early event detection algorithms are required. Here we suggest a simple and universal approach to the online detection of events represented by abrupt bursts in long-term observational data series. We show that simple gradient-based transformations obtained as a product of the signal and its derivative lead to the improved accuracy of the online detection of any significant bursts in the observational data series irrespective of their particular shapes. We provide explicit analytical expressions characterizing the performance of the suggested approach in comparison with the conventional solutions optimized for particular theoretical scenarios and widely utilized in various signal analysis applications. Moreover, we estimate the accuracy of the gradient-based approach in the exact positioning of single ECG cycles, where it outperforms the conventional Pan-Tompkins algorithm in its original formulation, while exhibiting comparable detection efficacy. Finally, we show that our approach is also applicable to the comparative analysis of lanes in electrophoretic gel images widely used in life sciences and molecular diagnostics like restriction fragment length polymorphism (RFLP) and variable number tandem repeats (VNTR) methods.


Author(s):  
Samuel Negredo

Newspaper websites and online only news operations deliver an increasingly varied and comprehensive offer of original audiovisual content. In Spain, they cover current affairs and niche interests, complementing the video reports supplied by news agencies. The spoken word is a primary mode of expression, in the form of dialogues (interviews and debates) and speeches (comments and analyses), but more complex and visually appealing formats have been developed. There is a challenge to organise these packages and programmes in order to facilitate access and retrieval, which may help to improve user experience, and to maximise long-term consumption and value.


2007 ◽  
Vol 2007 (1) ◽  
pp. 014615 ◽  
Author(s):  
Bart Lehane ◽  
NoelE O'Connor ◽  
Hyowon Lee ◽  
AlanF Smeaton

Geophysics ◽  
2019 ◽  
Vol 84 (4) ◽  
pp. KS143-KS153
Author(s):  
Jubran Akram ◽  
Daniel Peter ◽  
David Eaton

Event detection is an essential component of microseismic data analysis. This process is typically carried out using a short- and long-term-average-ratio (STA/LTA) method, which is simple and computationally efficient but often yields inconsistent results for noisy data sets. We have aimed to optimize the performance of the STA/LTA method by testing different input forms of 3C waveform data and different characteristic functions (CFs), including a proposed [Formula: see text]-mean CF. These tests are evaluated using receiver operating characteristic (ROC) analysis and are compared based on synthetic and field data examples. Our analysis indicates that the STA/LTA method using a [Formula: see text]-mean CF improves the detection sensitivity and yields more robust event detection on noisy data sets than some previous approaches. In addition, microseismic events are detected efficiently on field data examples using the same detection threshold obtained from the ROC analysis on synthetic data examples. We recommend the use of the Youden index based on ROC analysis using a training subset, extracted from the continuous data, to further improve the detection threshold for field microseismic data.


Author(s):  
Alexander Artikis ◽  
Marek Sergot ◽  
Georgios Paliouras

The authors have been developing a system for recognising human activities given a symbolic representation of video content. The input of the system is a stream of time-stamped short-term activities detected on video frames. The output of the system is a set of recognised long-term activities, which are pre-defined spatio-temporal combinations of short-term activities. The constraints on the short-term activities that, if satisfied, lead to the recognition of a long-term activity, are expressed using a dialect of the Event Calculus. The authors illustrate the expressiveness of the dialect by showing the representation of several typical complex activities. Furthermore, they present a detailed evaluation of the system through experimentation on a benchmark dataset of surveillance videos.


Sign in / Sign up

Export Citation Format

Share Document