Multimodal social media video classification with deep neural networks

The amount of data generated by modern communication devices is enormous, reaching petabytes. The rate of data generation is also increasing at an unprecedented rate. Though modern technology supports storage in massive amounts, the industry is reluctant in retaining the data, which includes the following characteristics: redundancy in data, unformatted records with outdated information, data that misleads the prediction and data with no impact on the class prediction. Out of all of this data, social media plays a significant role in data generation. As compared to other data generators, the ratio at which the social media generates the data is comparatively higher. Industry and governments are both worried about the circulation of mischievous or malcontents, as they are extremely susceptible and are used by criminals. So it is high time to develop a model to classify the social media contents as fair and unfair. The developed model should have higher accuracy in predicting the class of contents. In this article, tensor flow based deep neural networks are deployed with a fixed Epoch count of 15, in order to attain 25% more accuracy over the other existing models. Activation methods like “Relu” and “Sigmoid”, which are specific for Tensor flow platforms support to attain the improved prediction accuracy.

Download Full-text

Towards the Importance of the Type of Deep Neural Network and Employment of Pre-trained Word Vectors for Toxicity Detection: An Experimental Study

Journal of Web Engineering ◽

10.13052/jwe1540-9589.2082 ◽

2021 ◽

Author(s):

Abdullah Talha Kabakus

Keyword(s):

Neural Networks ◽

Social Media ◽

Deep Neural Networks ◽

Word Embedding ◽

Experimental Result ◽

Daily Lives ◽

Gold Standard Dataset ◽

Detection Approach ◽

Proposed Model ◽

Social Media Platforms

As a natural consequence of offering many advantages to their users, social media platforms have become a part of daily lives. Recent studies emphasize the necessity of an automated way of detecting the offensive posts in social media since these ‘toxic’ posts have become pervasive. To this end, a novel toxic post detection approach based on Deep Neural Networks was proposed within this study. Given that several word embedding methods exist, we shed light on which word embedding method produces better results when employed with the five most common types of deep neural networks, namely, , , , , and a combination of and . To this end, the word vectors for the given comments were obtained through four different methods, namely, () , () , () , and () the layer of deep neural networks. Eventually, a total of twenty benchmark models were proposed and both trained and evaluated on a gold standard dataset which consists of tweets. According to the experimental result, the best , , was obtained on the proposed model without employing pre-trained word vectors which outperformed the state-of-the-art works and implies the effective embedding ability of s. Other key findings obtained through the conducted experiments are that the models, that constructed word embeddings through the layers, obtained higher s and converged much faster than the models that utilized pre-trained word vectors.

Download Full-text

IUST at SemEval-2020 Task 9: Sentiment Analysis for Code-Mixed Social Media Text Using Deep Neural Networks and Linear Baselines

10.18653/v1/2020.semeval-1.170 ◽

2020 ◽

Author(s):

Soroush Javdan ◽

Taha Shangipour ataei ◽

Behrouz Minaei-Bidgoli

Keyword(s):

Neural Networks ◽

Social Media ◽

Sentiment Analysis ◽

Deep Neural Networks ◽

Social Media Text

Download Full-text

Multilayer and Multimodal Fusion of Deep Neural Networks for Video Classification

Proceedings of the 2016 ACM on Multimedia Conference - MM '16 ◽

10.1145/2964284.2964297 ◽

2016 ◽

Cited By ~ 33

Author(s):

Xiaodong Yang ◽

Pavlo Molchanov ◽

Jan Kautz

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Multimodal Fusion ◽

Video Classification

Download Full-text

A novel keyframe extraction method for video classification using deep neural networks

Neural Computing and Applications ◽

10.1007/s00521-021-06322-x ◽

2021 ◽

Author(s):

Rukiye Savran Kızıltepe ◽

John Q. Gan ◽

Juan José Escobar

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Classification Accuracy ◽

Extraction Method ◽

Deep Neural Networks ◽

Temporal Information ◽

Video Classification ◽

Classification Problems ◽

Keyframe Extraction ◽

Effective Combination

AbstractCombining convolutional neural networks (CNNs) and recurrent neural networks (RNNs) produces a powerful architecture for video classification problems as spatial–temporal information can be processed simultaneously and effectively. Using transfer learning, this paper presents a comparative study to investigate how temporal information can be utilized to improve the performance of video classification when CNNs and RNNs are combined in various architectures. To enhance the performance of the identified architecture for effective combination of CNN and RNN, a novel action template-based keyframe extraction method is proposed by identifying the informative region of each frame and selecting keyframes based on the similarity between those regions. Extensive experiments on KTH and UCF-101 datasets with ConvLSTM-based video classifiers have been conducted. Experimental results are evaluated using one-way analysis of variance, which reveals the effectiveness of the proposed keyframe extraction method in the sense that it can significantly improve video classification accuracy.

Download Full-text