An Improved Sentiment Analysis Approach to Detect Radical Content on Twitter

Social networks are used by terrorist groups and people who support them to propagate their ideas, ideologies, or doctrines and share their views on terrorism. To analyze tweets related to terrorism, several studies have been proposed in the literature. Some works rely on data mining algorithms; others use lexicon-based or machine learning sentiment analysis. Some recent works adopt other methods that combine multi-techniques. This paper proposes an improved approach for sentiment analysis of radical content related to terrorist activity on Twitter. Unlike other solutions, the proposed approach focuses on using a dictionary of weighted terms, the Word2vec method, and trigrams, with a classification based on fuzzy logic. The authors have conducted experiments with 600 manually annotated tweets and 200,000 automatically collected tweets in English and Arabic to evaluate this approach. The experimental results revealed that the new technique provides between 75% to 78% of precision for radicality detection and 61% to 64% to detect radicality degrees.

Download Full-text

Dr. Phish: Phishing Website Detector

E3S Web of Conferences ◽

10.1051/e3sconf/202129701032 ◽

2021 ◽

Vol 297 ◽

pp. 01032

Author(s):

Harish Kumar ◽

Anshal Prasad ◽

Ninad Rane ◽

Nilay Tamane ◽

Anjali Yeole

Keyword(s):

Machine Learning ◽

Data Mining ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Cyber Crime ◽

Data Mining Algorithms ◽

Learning Techniques ◽

Mining Algorithms ◽

Host Properties ◽

New Strategies

Phishing is a common attack on credulous people by making them disclose their unique information. It is a type of cyber-crime where false sites allure exploited people to give delicate data. This paper deals with methods for detecting phishing websites by analyzing various features of URLs by Machine learning techniques. This experimentation discusses the methods used for detection of phishing websites based on lexical features, host properties and page importance properties. We consider various data mining algorithms for evaluation of the features in order to get a better understanding of the structure of URLs that spread phishing. To protect end users from visiting these sites, we can try to identify the phishing URLs by analyzing their lexical and host-based features.A particular challenge in this domain is that criminals are constantly making new strategies to counter our defense measures. To succeed in this contest, we need Machine Learning algorithms that continually adapt to new examples and features of phishing URLs.

Download Full-text

Hybrid-Intelligent Mobile Indoor Location Using Wi-Fi Signals - Location Method Using Data Mining Algorithms and Type-2 Fuzzy Logic Systems

Proceedings of the 17th International Conference on Enterprise Information Systems ◽

10.5220/0005369806090615 ◽

2015 ◽

Author(s):

Manuel Castañón-Puga ◽

Abby Salazar-Corrales ◽

Carelia Gaxiola-Pacheco ◽

Guillermo Licea ◽

Miguel Flores-Parra ◽

...

Keyword(s):

Data Mining ◽

Fuzzy Logic ◽

Fuzzy Logic Systems ◽

Location Method ◽

Indoor Location ◽

Data Mining Algorithms ◽

Logic Systems ◽

Using Data ◽

Mining Algorithms

Download Full-text

Benchmarking Data Mining Algorithms

Data Warehousing and Web Engineering ◽

10.4018/978-1-931777-02-5.ch003 ◽

2011 ◽

pp. 77-99

Author(s):

Balaji Rajagopalan ◽

Ravi Krovi

Keyword(s):

Machine Learning ◽

Data Mining ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Successful Implementation ◽

Basic Premise ◽

Data Mining Algorithms ◽

External Data ◽

Mining Algorithms ◽

Careful Assessment

Data mining is the process of sifting through the mass of organizational (internal and external) data to identify patterns critical for decision support. Successful implementation of the data mining effort requires a careful assessment of the various tools and algorithms available. The basic premise of this study is that machine-learning algorithms, which are assumption free, should outperform their traditional counterparts when mining business databases. The objective of this study is to test this proposition by investigating the performance of the algorithms for several scenarios. The scenarios are based on simulations designed to reflect the extent to which typical statistical assumptions are violated in the business domain. The results of the computational experiments support the proposition that machine learning algorithms generally outperform their statistical counterparts under certain conditions. These can be used as prescriptive guidelines for the applicability of data mining techniques.

Download Full-text

Bio inspired Ensemble Feature Selection (BEFS) Model with Machine Learning and Data Mining Algorithms for Disease Risk Prediction

2019 5th International Conference On Computing, Communication, Control And Automation (ICCUBEA) ◽

10.1109/iccubea47591.2019.9129304 ◽

2019 ◽

Cited By ~ 1

Author(s):

Syed Javeed Pasha ◽

E. Syed Mohamed

Keyword(s):

Machine Learning ◽

Data Mining ◽

Feature Selection ◽

Risk Prediction ◽

Disease Risk ◽

Data Mining Algorithms ◽

Mining Algorithms

Download Full-text

Classification Techniques on Twitter Data: A Review

Asian Journal of Computer Science and Technology ◽

10.51983/ajcst-2019.8.s2.2022 ◽

2019 ◽

Vol 8 (S2) ◽

pp. 66-69

Author(s):

S. Shafina Banu ◽

K. Syed Kousar Niasi ◽

E. Kannan

Keyword(s):

Data Mining ◽

Sentiment Analysis ◽

Analysis Data ◽

Business Decision ◽

Data Mining Algorithms ◽

Twitter Data ◽

The World ◽

Other Information ◽

Mining Algorithms ◽

Effective Analysis

Data mining is the practice of examining unknown patterns of data according to diverse viewpoints for classification into valuable information, which is composed and gathered in collective areas, such as data warehouses.For effective analysis, data mining algorithms enabling business decision making and other information necessities to eventually cut costs and raise revenue. Sentiment analysis is the method of defining the emotional tone behind a sequence of words, used to gain an accepting of the attitudes, opinions and emotions conveyed within an online mention. Sentiment analysis is tremendously useful in social media observing as it allows us to gain a synopsis of the broader public opinion behind definite topics. The applications of sentiment analysis are extensive and influential. The ability to abstract insights from social data is a practice that is being broadly adopted by organizations across the world. In this paper, we focused on sentiment analysis on the twitter data.

Download Full-text

Case Study: Political profiling based on Twitter Sentiment analysis for Big Data using Data Mining Algorithms

International Journal of Engineering Research and ◽

10.17577/ijertv5is020239 ◽

2016 ◽

Vol V5 (02) ◽

Author(s):

Shirin Hijaz Matwankar ◽

Dr. Shubhash K. Shinde ◽

Keyword(s):

Data Mining ◽

Big Data ◽

Sentiment Analysis ◽

Data Mining Algorithms ◽

Using Data ◽

Mining Algorithms

Download Full-text

Phishing websites blacklisting using machine learning algorithms

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i1.7.10646 ◽

2018 ◽

Vol 7 (1.7) ◽

pp. 179

Author(s):

Nivedhitha G ◽

Carmel Mary Belinda M.J ◽

Rupavathy N

Keyword(s):

Machine Learning ◽

Data Mining ◽

Feature Extraction ◽

Learning Algorithms ◽

Source Code ◽

Machine Learning Algorithms ◽

Paper Machine ◽

Data Mining Algorithms ◽

Mining Algorithms ◽

The Web

The development of the phishing sites is by all accounts amazing. Despite the fact that the web clients know about these sorts of phishing assaults, part of clients move toward becoming casualty to these assaults. Quantities of assaults are propelled with the point of making web clients trust that they are speaking with a trusted entity. Phishing is one among them. Phishing is consistently developing since it is anything but difficult to duplicate a whole site utilizing the HTML source code. By rolling out slight improvements in the source code, it is conceivable to guide the victim to the phishing site. Phishers utilize part of strategies to draw the unsuspected web client. Consequently an efficient mechanism is required to recognize the phishing sites from the real sites keeping in mind the end goal to spare credential data. To detect the phishing websites and to identify it as information leaking sites, the system proposes data mining algorithms. In this paper, machine-learning algorithms have been utilized for modeling the prediction task. The process of identity extraction and feature extraction are discussed in this paper and the various experiments carried out to discover the performance of the models are demonstrated.

Download Full-text