Cyberbullying detection in social media text based on character‐level convolutional neural network with shortcuts

This article describes how spam detection in the social media text is becoming increasing important because of the exponential increase in the spam volume over the network. It is challenging, especially in case of text within the limited number of characters. Effective spam detection requires more number of efficient features to be learned. In the current article, the use of a deep learning technology known as a convolutional neural network (CNN) is proposed for spam detection with an added semantic layer on the top of it. The resultant model is known as a semantic convolutional neural network (SCNN). A semantic layer is composed of training the random word vectors with the help of Word2vec to get the semantically enriched word embedding. WordNet and ConceptNet are used to find the word similar to a given word, in case it is missing in the word2vec. The architecture is evaluated on two corpora: SMS Spam dataset (UCI repository) and Twitter dataset (Tweets scrapped from public live tweets). The authors' approach outperforms the-state-of-the-art results with 98.65% accuracy on SMS spam dataset and 94.40% accuracy on Twitter dataset.

Download Full-text

CNN based efficient image classification system for smartphone device

10.21203/rs.3.rs-428430/v1 ◽

2021 ◽

Author(s):

Mayank Mishra ◽

Tanupriya Choudhury ◽

Tanmay Sarkar

Keyword(s):

Neural Network ◽

Social Media ◽

Convolutional Neural Network ◽

Image Classification ◽

Classification System ◽

Text Messaging ◽

Motivational Messages ◽

Social Media Text ◽

Social Media Platforms ◽

Optimum Model

Abstract In our work, we look to classify images that make their way into our smartphone devices through various social-media text-messaging platforms. We aim at classifying images into three broad categories: document-based images, quote-based images, and photographs. People, especially students, share many document-based images that include snapshots of essential emails, handwritten notes, articles, etc. Quote based images, consisting of birthday wishes, motivational messages, festival greetings, etc., are among the highly shared images on social media platforms. A significant share of images constitutes photographs of people, including group photographs, selfies, portraits, etc. We train various convolutional neural network (CNN) based models on our self-made dataset and compare their results to find our task’s optimum model.

Download Full-text

Spam Detection on Social Media Using Semantic Convolutional Neural Network

Deep Learning and Neural Networks ◽

10.4018/978-1-7998-0414-7.ch039 ◽

2020 ◽

pp. 704-719

Author(s):

Gauri Jain ◽

Manisha Sharma ◽

Basant Agarwal

Keyword(s):

Neural Network ◽

Social Media ◽

Deep Learning ◽

Convolutional Neural Network ◽

Spam Detection ◽

Learning Technology ◽

The Social ◽

Social Media Text ◽

Current Article ◽

Semantic Layer

This article describes how spam detection in the social media text is becoming increasing important because of the exponential increase in the spam volume over the network. It is challenging, especially in case of text within the limited number of characters. Effective spam detection requires more number of efficient features to be learned. In the current article, the use of a deep learning technology known as a convolutional neural network (CNN) is proposed for spam detection with an added semantic layer on the top of it. The resultant model is known as a semantic convolutional neural network (SCNN). A semantic layer is composed of training the random word vectors with the help of Word2vec to get the semantically enriched word embedding. WordNet and ConceptNet are used to find the word similar to a given word, in case it is missing in the word2vec. The architecture is evaluated on two corpora: SMS Spam dataset (UCI repository) and Twitter dataset (Tweets scrapped from public live tweets). The authors' approach outperforms the-state-of-the-art results with 98.65% accuracy on SMS spam dataset and 94.40% accuracy on Twitter dataset.

Download Full-text

A convolutional neural network for traffic information sensing from social media text

2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC) ◽

10.1109/itsc.2017.8317650 ◽

2017 ◽

Cited By ~ 2

Author(s):

Yuanyuan Chen ◽

Yisheng Lv ◽

Xiao Wang ◽

Fei-Yue Wang

Keyword(s):

Neural Network ◽

Social Media ◽

Convolutional Neural Network ◽

Traffic Information ◽

Social Media Text

Download Full-text

Deep Neural Network Based Sentence Boundary Detection and End Marker Suggestion for Social Media Text

2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS) ◽

10.1109/icccis48478.2019.8974495 ◽

2019 ◽

Author(s):

Jagroop Kaur ◽

Jaswinder Singh

Keyword(s):

Neural Network ◽

Social Media ◽

Deep Neural Network ◽

Boundary Detection ◽

Social Media Text ◽

Sentence Boundary

Download Full-text

Evaluation of COVID-19 spread effect on the Commercial Instagram Posts (CIPs) using ANN: a case study on Holy Shrine, Mashhad, Iran

10.20944/preprints202102.0554.v1 ◽

2021 ◽

Author(s):

Mohammad Javad Shooshtari ◽

Hossein Etemadfard ◽

Rouzbeh Shad

Keyword(s):

Neural Network ◽

Neural Networks ◽

Social Media ◽

Convolutional Neural Network ◽

Business Owners ◽

Shopping Centers ◽

Multi Layer Perceptron ◽

Commercial Activities ◽

Enormous Amount

The widespread deployment of social media has helped researchers access an enormous amount of data in various domains, including the pandemic caused by the COVID-19 spread. This study presents a heuristic approach to classify Commercial Instagram Posts (CIPs) and explores how the businesses around the Holy Shrine – a sacred complex in Mashhad, Iran, surrounded by numerous shopping centers – were impacted by the pandemic. Two datasets of Instagram posts (one gathered data from March 14th to April 10th, 2020, when Holy Shrine and nearby shops were closed, and one extracted data from the same period in 2019), two word embedding models – aimed at vectorizing associated caption of each post, and two neural networks – multi-layer perceptron and convolutional neural network – were employed to classify CIPs in 2019. Among the scenarios defined for the 2019 CIPs classification, the results revealed that the combination of MLP and CBoW achieved the best performance, which was then used for the 2020 CIPs classification. It is found out that the fraction of CIPs to total Instagram posts has increased from 5.58% in 2019 to 8.08% in 2020, meaning that business owners were using Instagram to increase their sales and continue their commercial activities to compensate for the closure of their stores during the pandemic. Moreover, the portion of non-commercial Instagram posts (NCIPs) in total posts has decreased from 94.42% in 2019 to 91.92% in 2020, implying the fact that since the Holy Shrine was closed, Mashhad citizens and tourists could not visit it and take photos to post on their Instagram accounts.

Download Full-text

Neural User Response Generator: Fake News Detection with Collective User Intelligence

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/533 ◽

2018 ◽

Cited By ~ 24

Author(s):

Feng Qian ◽

Chengyue Gong ◽

Karishma Sharma ◽

Yan Liu

Keyword(s):

Neural Network ◽

Social Media ◽

Early Detection ◽

Convolutional Neural Network ◽

Semantic Information ◽

Experimental Results ◽

News Article ◽

Fake News ◽

Additional Information ◽

Word Level

Fake news on social media is a major challenge and studies have shown that fake news can propagate exponentially quickly in early stages. Therefore, we focus on early detection of fake news, and consider that only news article text is available at the time of detection, since additional information such as user responses and propagation patterns can be obtained only after the news spreads. However, we find historical user responses to previous articles are available and can be treated as soft semantic labels, that enrich the binary label of an article, by providing insights into why the article must be labeled as fake. We propose a novel Two-Level Convolutional Neural Network with User Response Generator (TCNN-URG) where TCNN captures semantic information from article text by representing it at the sentence and word level, and URG learns a generative model of user response to article text from historical user responses which it can use to generate responses to new articles in order to assist fake news detection. We conduct experiments on one available dataset and a larger dataset collected by ourselves. Experimental results show that TCNN-URG outperforms the baselines based on prior approaches that detect fake news from article text alone.

Download Full-text