Effective hate-speech detection in Twitter data using recurrent neural networks

Georgios K. Pitsilis; Heri Ramampiaro; Helge Langseth

doi:10.1007/s10489-018-1242-y

Effective hate-speech detection in Twitter data using recurrent neural networks

Applied Intelligence ◽

10.1007/s10489-018-1242-y ◽

2018 ◽

Vol 48 (12) ◽

pp. 4730-4742 ◽

Cited By ~ 24

Author(s):

Georgios K. Pitsilis ◽

Heri Ramampiaro ◽

Helge Langseth

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Hate Speech ◽

Speech Detection ◽

Twitter Data

Download Full-text

Hate speech detection in Twitter using hybrid embeddings and improved cuckoo search-based neural networks

International Journal of Intelligent Computing and Cybernetics ◽

10.1108/ijicc-06-2020-0061 ◽

2020 ◽

Vol 13 (4) ◽

pp. 485-525

Author(s):

Femi Emmanuel Ayo ◽

Olusegun Folorunso ◽

Friday Thomas Ibharalu ◽

Idowu Ademola Osinuga

Keyword(s):

Neural Network ◽

Neural Networks ◽

Feature Extraction ◽

Hate Speech ◽

Short Term Memory ◽

Cuckoo Search ◽

Research Attention ◽

Content Type ◽

Speech Detection ◽

Sentence Level

PurposeHate speech is an expression of intense hatred. Twitter has become a popular analytical tool for the prediction and monitoring of abusive behaviors. Hate speech detection with social media data has witnessed special research attention in recent studies, hence, the need to design a generic metadata architecture and efficient feature extraction technique to enhance hate speech detection.Design/methodology/approachThis study proposes a hybrid embeddings enhanced with a topic inference method and an improved cuckoo search neural network for hate speech detection in Twitter data. The proposed method uses a hybrid embeddings technique that includes Term Frequency-Inverse Document Frequency (TF-IDF) for word-level feature extraction and Long Short Term Memory (LSTM) which is a variant of recurrent neural networks architecture for sentence-level feature extraction. The extracted features from the hybrid embeddings then serve as input into the improved cuckoo search neural network for the prediction of a tweet as hate speech, offensive language or neither.FindingsThe proposed method showed better results when tested on the collected Twitter datasets compared to other related methods. In order to validate the performances of the proposed method, t-test and post hoc multiple comparisons were used to compare the significance and means of the proposed method with other related methods for hate speech detection. Furthermore, Paired Sample t-Test was also conducted to validate the performances of the proposed method with other related methods.Research limitations/implicationsFinally, the evaluation results showed that the proposed method outperforms other related methods with mean F1-score of 91.3.Originality/valueThe main novelty of this study is the use of an automatic topic spotting measure based on naïve Bayes model to improve features representation.

Download Full-text

INF-HatEval at SemEval-2019 Task 5: Convolutional Neural Networks for Hate Speech Detection Against Women and Immigrants on Twitter

10.18653/v1/s19-2074 ◽

2019 ◽

Author(s):

Alison Ribeiro ◽

Nádia Silva

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Hate Speech ◽

Speech Detection

Download Full-text

Hate Speech Detection on Multilingual Twitter Using Convolutional Neural Networks

Revue d intelligence artificielle ◽

10.18280/ria.340111 ◽

2020 ◽

Vol 34 (1) ◽

pp. 81-88

Author(s):

Aya Elouali ◽

Zakaria Elberrichi ◽

Nadia Elouali

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Hate Speech ◽

Speech Detection

Download Full-text

Ranking Convolutional Recurrent Neural Networks for Purchase Stage Identification on Imbalanced Twitter Data

10.18653/v1/e17-2094 ◽

2017 ◽

Author(s):

Heike Adel ◽

Francine Chen ◽

Yan-Ying Chen

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Twitter Data

Download Full-text

UTFPR at SemEval-2019 Task 5: Hate Speech Identification with Recurrent Neural Networks

10.18653/v1/s19-2093 ◽

2019 ◽

Author(s):

Gustavo Henrique Paetzold ◽

Marcos Zampieri ◽

Shervin Malmasi

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Hate Speech ◽

Speech Identification

Download Full-text

HanSEL: Italian Hate Speech detection through Ensemble Learning and Deep Neural Networks

EVALITA Evaluation of NLP and Speech Tools for Italian ◽

10.4000/books.aaccademia.4766 ◽

2018 ◽

pp. 224-229 ◽

Cited By ~ 1

Author(s):

Marco Polignano ◽

Pierpaolo Basile

Keyword(s):

Neural Networks ◽

Ensemble Learning ◽

Deep Neural Networks ◽

Hate Speech ◽

Speech Detection

Download Full-text

Hate Speech Detection in Roman Urdu

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3414524 ◽

2021 ◽

Vol 20 (1) ◽

pp. 1-19

Author(s):

Muhammad Moin Khan ◽

Khurram Shahzad ◽

Muhammad Kamran Malik

Keyword(s):

Deep Learning ◽

Hate Speech ◽

Simple Complex ◽

Speech Detection ◽

European Languages ◽

Twitter Data ◽

Learning Techniques ◽

The Social ◽

Learning Technique ◽

Asian Languages

Hate speech is a specific type of controversial content that is widely legislated as a crime that must be identified and blocked. However, due to the sheer volume and velocity of the Twitter data stream, hate speech detection cannot be performed manually. To address this issue, several studies have been conducted for hate speech detection in European languages, whereas little attention has been paid to low-resource South Asian languages, making the social media vulnerable for millions of users. In particular, to the best of our knowledge, no study has been conducted for hate speech detection in Roman Urdu text, which is widely used in the sub-continent. In this study, we have scrapped more than 90,000 tweets and manually parsed them to identify 5,000 Roman Urdu tweets. Subsequently, we have employed an iterative approach to develop guidelines and used them for generating the Hate Speech Roman Urdu 2020 corpus. The tweets in the this corpus are classified at three levels: Neutral-Hostile, Simple-Complex, and Offensive-Hate speech. As another contribution, we have used five supervised learning techniques, including a deep learning technique, to evaluate and compare their effectiveness for hate speech detection. The results show that Logistic Regression outperformed all other techniques, including deep learning techniques for the two levels of classification, by achieved an F1 score of 0.906 for distinguishing between Neutral-Hostile tweets, and 0.756 for distinguishing between Offensive-Hate speech tweets.

Download Full-text

Spike timing-dependent plasticity in sparse recurrent neural networks

IEICE Proceeding Series ◽

10.15248/proc.1.485 ◽

2014 ◽

Vol 1 ◽

pp. 485-488

Author(s):

Hideyuki Kato ◽

Tohru Ikeguchi

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Spike Timing ◽

Spike Timing Dependent Plasticity ◽

Dependent Plasticity

Download Full-text

Direct Adaptive Control of Process Systems Using Recurrent Neural Networks

1992 American Control Conference ◽

10.23919/acc.1992.4792020 ◽

1992 ◽

Author(s):

Sanjay Parthasarathy ◽

Alexander G. Parlos ◽

Amir F. Atiya

Keyword(s):

Neural Networks ◽

Adaptive Control ◽

Recurrent Neural Networks ◽

Process Systems ◽

Direct Adaptive Control

Download Full-text

L2 approximation properties of recurrent neural networks

1997 European Control Conference (ECC) ◽

10.23919/ecc.1997.7082360 ◽

1997 ◽

Cited By ~ 1

Author(s):

A. Ruiz ◽

D.H. Owens ◽

S. Townley

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Approximation Properties

Download Full-text