Systematic Literature Review Of Hate Speech Detection With Text Mining

Author(s):  
Rini Rini ◽  
Ema Utami ◽  
Anggit Dwi Hartanto
2020 ◽  
Vol 4 (4) ◽  
pp. 213
Author(s):  
Calvin Erico Rudy Salim ◽  
Derwin Suhartono

Hate speech is one of the most challenging problem internet is facing today. This systematic literature review examine hate speech detection problem and will be used to do an experimental approach on detecting hate speech and abusive language. This work also provide an overview of previous research, including methods, algorithms, and main features used. We use two research questions in this literature review which will be the foundation of the next experimental research. Correctly classifying a piece of text as an actual hate speech requires a lot of correctly labelled data. Most common challenges are different languages, out of vocabulary words, long range dependencies and many more. 


IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 67698-67717 ◽  
Author(s):  
Amir Karami ◽  
Morgan Lundy ◽  
Frank Webb ◽  
Yogesh K. Dwivedi

2020 ◽  
Vol 120 (11) ◽  
pp. 2041-2065
Author(s):  
Ioanna Pavlidou ◽  
Savvas Papagiannidis ◽  
Eric Tsui

PurposeThis study is a systematic literature review of crowdsourcing that aims to present the research evidence so far regarding the extent to which it can contribute to organisational performance and produce innovations and provide insights on how organisations can operationalise it successfully.Design/methodology/approachThe systematic literature review revolved around a text mining methodology analysing 106 papers.FindingsThe themes identified are performance, innovation, operational aspects and motivations. The review revealed a few potential directions for future research in each of the themes considered.Practical implicationsThis study helps researchers to consider the recent themes on crowdsourcing and identify potential areas for research. At the same time, it provides practitioners with an understanding of the usefulness and process of crowdsourcing and insights on what the critical elements are in order to organise a successful crowdsourcing project.Originality/valueThis study employed quantitative content analysis in order to identify the main research themes with higher reliability and validity. It is also the first review on crowdsourcing that incorporates the relevant literature on crowdfunding as a value-creation tool.


2021 ◽  
Vol 7 (2) ◽  
pp. 226
Author(s):  
Angelina Pramana Thenata

Era sekarang jumlah berita dari berbagai media sosial yang tersebar dalam waktu singkat dan kebutuhan masyarakat untuk mengkonsumsi berita dalam berbagai referensi dapat mempengaruhi kehidupan masyarakat. Hal ini menyebabkan data yang tersebar dapat dikumpulkan dan dimanfaatkan oleh pemerintah, pengusaha, analisis, ataupun peneliti untuk mengidentifikasi tren, mengembangkan bisnis, memprediksi perilaku pelanggan dan lain sebagainya. Pengumpulan data berita dari media sosial tersebut dapat menggunakan text mining yang melibatkan algoritma yakni Naive Bayes, K-NN, dan SVM. Namun, penggunaan algoritma pada studi kasus yang tidak sesuai dapat memberikan hasil yang tidak optimal. Oleh karena itu, penelitian ini akan menganalisis algoritma text mining yang diimplementasikan pada media sosial berbahasa Indonesia dengan memakai metode systematic literature review. Metode ini dimulai dengan melakukan tahap planning yang menetapkan pertanyaan penelitian, kata pencarian, sumber literatur digital, dan standard literatur. Dilanjutkan dengan tahap conducting yang memilih dan mencocokan standard literatur, serta ekstraksi data. Kemudian tahap reporting yang melakukan analisis hasil ekstraksi data sehingga bisa menemumkan informasi dan pengetahuan. Tolak ukur yang menjadi acuan untuk perbandingan yakni pengujian confusion matrix berupa accuracy, precision, dan recall. Adapun hasil dari penelitian ini ditemukan algoritma Naive Bayes memberikan hasil yang stabil tapi kurang optimal jika diterapkan pada studi kasus media sosial berbahasa Indonesia. Sedangkan algortima K-NN dan SVM ditemukan memberikan hasil yang optimal jika diterapkan pada studi kasus media sosial berbahasa Indonesia yang dibuktikan dengan accuracy (50%-98.13%), precision (58.22%-98.48%), dan recall (21.05%-98%).  


Energies ◽  
2021 ◽  
Vol 14 (22) ◽  
pp. 7810
Author(s):  
Ahmed Abdelaziz ◽  
Vitor Santos ◽  
Miguel Sales Dias

The high level of energy consumption of buildings is significantly influencing occupant behavior changes towards improved energy efficiency. This paper introduces a systematic literature review with two objectives: to understand the more relevant factors affecting energy consumption of buildings and to find the best intelligent computing (IC) methods capable of classifying and predicting energy consumption of different types of buildings. Adopting the PRISMA method, the paper analyzed 822 manuscripts from 2013 to 2020 and focused on 106, based on title and abstract screening and on manuscripts with experiments. A text mining process and a bibliometric map tool (VOS viewer) were adopted to find the most used terms and their relationships, in the energy and IC domains. Our approach shows that the terms “consumption,” “residential,” and “electricity” are the more relevant terms in the energy domain, in terms of the ratio of important terms (TITs), whereas “cluster” is the more commonly used term in the IC domain. The paper also shows that there are strong relations between “Residential Energy Consumption” and “Electricity Consumption,” “Heating” and “Climate. Finally, we checked and analyzed 41 manuscripts in detail, summarized their major contributions, and identified several research gaps that provide hints for further research.


2022 ◽  
Vol 14 (1) ◽  
pp. 0-0

Automatic hate speech detection on social media is becoming an outstanding concern in modern countries. Indeed, hate speech towards people brings about violent acts and social chaos, hence law prohibits it, and it engenders moral and legal implications. It is crucial that we can precisely categorize the hate speech, and not a hate speech automatically, while this allows us to identify easily real people who represent a threat for our society, and who wrongly regard as hateful speakers. In this paper, we applied a complete text mining process and Naïve Bayes machine learning classification algorithm to two different data sets (tweets_Num1 and tweets_Num2) taken from Twitter, to better classify tweets. The results obtained demonstrate that our model performed well regarding different metrics based on the confusion matrix including the accuracy metric, which achieved 87. 23% on the first dataset, and 93. 06% on the second.


Sign in / Sign up

Export Citation Format

Share Document