A new 3D convolutional neural network (3D-CNN) framework for multimedia event detection

Author(s):  
Kaavya Kanagaraj ◽  
G. G. Lakshmi Priya
2019 ◽  
Vol 15 (6) ◽  
pp. 155014771985649 ◽  
Author(s):  
Van Quan Nguyen ◽  
Tien Nguyen Anh ◽  
Hyung-Jeong Yang

We proposed an approach for temporal event detection using deep learning and multi-embedding on a set of text data from social media. First, a convolutional neural network augmented with multiple word-embedding architectures is used as a text classifier for the pre-processing of the input textual data. Second, an event detection model using a recurrent neural network is employed to learn time series data features by extracting temporal information. Recently, convolutional neural networks have been used in natural language processing problems and have obtained excellent results as performing on available embedding vector. In this article, word-embedding features at the embedding layer are combined and fed to convolutional neural network. The proposed method shows no size limitation, supplementation of more embeddings than standard multichannel based approaches, and obtained similar performance (accuracy score) on some benchmark data sets, especially in an imbalanced data set. For event detection, a long short-term memory network is used as a predictor that learns higher level temporal features so as to predict future values. An error distribution estimation model is built to calculate the anomaly score of observation. Events are detected using a window-based method on the anomaly scores.


2019 ◽  
Author(s):  
Jinhyeong Bae ◽  
Jane Stocks ◽  
Ashley Heywood ◽  
Youngmoon Jung ◽  
Lisanne Jenkins ◽  
...  

AbstractDementia of Alzheimer’s Type (DAT) is associated with a devastating and irreversible cognitive decline. As a pharmacological intervention has not yet been developed to reverse disease progression, preventive medicine will play a crucial role for patient care and treatment planning. However, predicting which patients will progress to DAT is difficult as patients with Mild Cognitive Impairment (MCI) could either convert to DAT (MCI-C) or not (MCI-NC). In this paper, we develop a deep learning model to address the heterogeneous nature of DAT development. Structural magnetic resonance imaging was utilized as a single biomarker, and a three-dimensional convolutional neural network (3D-CNN) was developed. The 3D-CNN was trained using transfer learning from the classification of Normal Control and DAT scans at the source task. This was applied to the target task of classifying MCI-C and MCI-NC scans. The model results in 82.4% classification accuracy, which outperforms current models in the field. Furthermore, by implementing an occlusion map approach, we visualize key brain regions that significantly contribute to the prediction of MCI-C and MCI-NC. Results show the hippocampus, amygdala, cerebellum, and pons regions as significant to prediction, which are consistent with current understanding of disease. Finally, the model’s prediction value is significantly correlated with rates of change in clinical assessment scores, indicating the model is able to predict an individual patient’s future cognitive decline. This information, in conjunction with the identified anatomical features, will aid in building a personalized therapeutic strategy for individuals with MCI. This model could also be useful for selection of participants for clinical trials.


Nanomaterials ◽  
2020 ◽  
Vol 10 (6) ◽  
pp. 1161 ◽  
Author(s):  
Kai-Chun Li ◽  
Ming-Yen Lu ◽  
Hong Thai Nguyen ◽  
Shih-Wei Feng ◽  
Sofya B. Artemkina ◽  
...  

Increasing attention has been paid to two-dimensional (2D) materials because of their superior performance and wafer-level synthesis methods. However, the large-area characterization, precision, intelligent automation, and high-efficiency detection of nanostructures for 2D materials have not yet reached an industrial level. Therefore, we use big data analysis and deep learning methods to develop a set of visible-light hyperspectral imaging technologies successfully for the automatic identification of few-layers MoS2. For the classification algorithm, we propose deep neural network, one-dimensional (1D) convolutional neural network, and three-dimensional (3D) convolutional neural network (3D-CNN) models to explore the correlation between the accuracy of model recognition and the optical characteristics of few-layers MoS2. The experimental results show that the 3D-CNN has better generalization capability than other classification models, and this model is applicable to the feature input of the spatial and spectral domains. Such a difference consists in previous versions of the present study without specific substrate, and images of different dynamic ranges on a section of the sample may be administered via the automatic shutter aperture. Therefore, adjusting the imaging quality under the same color contrast conditions is unnecessary, and the process of the conventional image is not used to achieve the maximum field of view recognition range of ~1.92 mm2. The image resolution can reach ~100 nm and the detection time is 3 min per one image.


2019 ◽  
Vol 2019 ◽  
pp. 1-17 ◽  
Author(s):  
Zhuofu Deng ◽  
Qingzhe Guo ◽  
Zhiliang Zhu

Segmentation of liver tumors plays an important role in the choice of therapeutic strategies for liver disease and treatment monitoring. In this paper, we generalize the process of a level set with a novel algorithm of dynamic regulation to energy functional parameters. The presented method is fully automatic once the tumor has been detected. First, a 3D convolutional neural network with dense layers for classification is used to estimate current contour location relative to the tumor boundary. Second, the output 3D CNN probabilities can dynamically regulate parameters of the level set functional over the process of segmentation. Finally, for full automation, appropriate initializations and local window size are generated based on the current contour position probabilities. We demonstrate the proposed method on the dataset of MICCAI 2017 LiTS Challenge and 3DIRCADb that include low contrast and heterogeneous tumors as well as noisy images. To illustrate the strength of our method, we evaluated it against the state-of-the-art methods. Compared with the level set framework with fixed parameters, our method performed better significantly with an average DICE improvement of 0.15. We also analyzed a challenging dataset 3DIRCADb of tumors and obtained a competitive DICE of 0.85±0.06 with the proposed method.


Author(s):  
Kinjal V. Joshi ◽  
Narendra M. Patel

Automatic abnormal event detection in a surveillance scene is very significant because of more consciousness about public safety. Because of usefulness and complexity, currently, it is an open research area. In this manuscript, the authors have proposed a novel convolutional neural network (CNN) model to detect an abnormal event in a surveillance scene. In this work, CNN is used in two ways. Firstly, it is used for both feature extraction and classification. In a second way, CNN is used for feature extraction, and support vector machine (SVM) is used for classification. Without any pre-processing, the proposed model gives better results compared to state-of-the-art methods. Experiments are carried out on four different publicly available benchmark datasets and one combined dataset, which contains all images of four datasets. The performance is measured by accuracy and area under the ROC (receiver operating characteristic) curve (AUC). The experimental results determine the efficacy of the proposed model.


Sign in / Sign up

Export Citation Format

Share Document