Rumor Detection on Twitter Using a Supervised Machine Learning Framework

This article describes how a rumor can be defined as a circulating unverified story or a doubtful truth. Rumor initiators seek social networks vulnerable to illimitable spread, therefore, online social media becomes their stage. Hence, this misinformation imposes colossal damage to individuals, organizations, and the government, etc. Existing work, analyzing temporal and linguistic characteristics of rumors seems to give ample time for rumor propagation. Meanwhile, with the huge outburst of data on social media, studying these characteristics for each tweet becomes spatially complex. Therefore, in this article, a two-fold supervised machine-learning framework is proposed that detects rumors by filtering and then analyzing their linguistic properties. This method attempts to automate filtering by training multiple classification algorithms with accuracy higher than 81.079%. Finally, using textual characteristics on the filtered data, rumors are detected. The effectiveness of the proposed framework is shown through extensive experiments on over 10,000 tweets.

Download Full-text

Exploring fake news identification using word and sentence embeddings

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189865 ◽

2021 ◽

pp. 1-8

Author(s):

V.T Priyanga ◽

J.P Sanjanasri ◽

Vijay Krishna Menon ◽

E.A Gopalakrishnan ◽

K.P Soman

Keyword(s):

Machine Learning ◽

Social Media ◽

Network Analysis ◽

Supervised Machine Learning ◽

Breeding Ground ◽

Fake News ◽

Data Set ◽

Highly Correlated ◽

Use Of Social Media ◽

The Liar

The widespread use of social media like Facebook, Twitter, Whatsapp, etc. has changed the way News is created and published; accessing news has become easy and inexpensive. However, the scale of usage and inability to moderate the content has made social media, a breeding ground for the circulation of fake news. Fake news is deliberately created either to increase the readership or disrupt the order in the society for political and commercial benefits. It is of paramount importance to identify and filter out fake news especially in democratic societies. Most existing methods for detecting fake news involve traditional supervised machine learning which has been quite ineffective. In this paper, we are analyzing word embedding features that can tell apart fake news from true news. We use the LIAR and ISOT data set. We churn out highly correlated news data from the entire data set by using cosine similarity and other such metrices, in order to distinguish their domains based on central topics. We then employ auto-encoders to detect and differentiate between true and fake news while also exploring their separability through network analysis.

Download Full-text

Fuzzy based feature engineering architecture for sentiment analysis of medical discussion over online social networks

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-202874 ◽

2021 ◽

pp. 1-13

Author(s):

C S Pavan Kumar ◽

L D Dhinesh Babu

Keyword(s):

Machine Learning ◽

Social Networks ◽

Social Media ◽

Sentiment Analysis ◽

Membership Function ◽

Online Social Networks ◽

Learning Model ◽

Feature Engineering ◽

Machine Learning Model ◽

Social Media Platforms

Sentiment analysis is widely used to retrieve the hidden sentiments in medical discussions over Online Social Networking platforms such as Twitter, Facebook, Instagram. People often tend to convey their feelings concerning their medical problems over social media platforms. Practitioners and health care workers have started to observe these discussions to assess the impact of health-related issues among the people. This helps in providing better care to improve the quality of life. Dementia is a serious disease in western countries like the United States of America and the United Kingdom, and the respective governments are providing facilities to the affected people. There is much chatter over social media platforms concerning the patients’ care, healthy measures to be followed to avoid disease, check early indications. These chatters have to be carefully monitored to help the officials take necessary precautions for the betterment of the affected. A novel Feature engineering architecture that involves feature-split for sentiment analysis of medical chatter over online social networks with the pipeline is proposed that can be used on any Machine Learning model. The proposed model used the fuzzy membership function in refining the outputs. The machine learning model has obtained sentiment score is subjected to fuzzification and defuzzification by using the trapezoid membership function and center of sums method, respectively. Three datasets are considered for comparison of the proposed and the regular model. The proposed approach delivered better results than the normal approach and is proved to be an effective approach for sentiment analysis of medical discussions over online social networks.

Download Full-text

Rumor Detection in Business Reviews Using Supervised Machine Learning

2018 5th International Conference on Behavioral, Economic, and Socio-Cultural Computing (BESC) ◽

10.1109/besc.2018.8697323 ◽

2018 ◽

Cited By ~ 2

Author(s):

Ammara Habib ◽

Saima Akbar ◽

Muhammad Zubair Asghar ◽

Asad Masood Khattak ◽

Rahman Ali ◽

...

Keyword(s):

Machine Learning ◽

Supervised Machine Learning ◽

Rumor Detection

Download Full-text

An Experimental Study of Spammer Detection on Chinese Microblogs

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s021819402040029x ◽

2020 ◽

Vol 30 (11n12) ◽

pp. 1759-1777

Author(s):

Jialing Liang ◽

Peiquan Jin ◽

Lin Mu ◽

Jie Zhao

Keyword(s):

Machine Learning ◽

Social Media ◽

User Behavior ◽

Real Data ◽

User Profile ◽

Data Set ◽

Sina Weibo ◽

Factors Affecting ◽

The Government ◽

Hot Event

With the development of Web 2.0, social media such as Twitter and Sina Weibo have become an essential platform for disseminating hot events. Simultaneously, due to the free policy of microblogging services, users can post user-generated content freely on microblogging platforms. Accordingly, more and more hot events on microblogging platforms have been labeled as spammers. Spammers will not only hurt the healthy development of social media but also introduce many economic and social problems. Therefore, the government and enterprises must distinguish whether a hot event on microblogging platforms is a spammer or is a naturally-developing event. In this paper, we focus on the hot event list on Sina Weibo and collect the relevant microblogs of each hot event to study the detecting methods of spammers. Notably, we develop an integral feature set consisting of user profile, user behavior, and user relationships to reflect various factors affecting the detection of spammers. Then, we employ typical machine learning methods to conduct extensive experiments on detecting spammers. We use a real data set crawled from the most prominent Chinese microblogging platform, Sina Weibo, and evaluate the performance of 10 machine learning models with five sampling methods. The results in terms of various metrics show that the Random Forest model and the over-sampling method achieve the best accuracy in detecting spammers and non-spammers.

Download Full-text

Location-Based Online Social Networks: Location-Based Online Social Media, Location-Based Online Social Services

Encyclopedia of Social Network Analysis and Mining ◽

10.1007/978-1-4614-6170-8_100408 ◽

2014 ◽

pp. 820-820

Keyword(s):

Social Networks ◽

Social Media ◽

Social Services ◽

Online Social Networks ◽

Online Social Media

Download Full-text

I Know Where You Are Coming From: On the Impact of Social Media Sources on AI Model Performance (Student Abstract)

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i10.7258 ◽

2020 ◽

Vol 34 (10) ◽

pp. 13971-13972

Author(s):

Yang Qi ◽

Farseev Aleksandr ◽

Filchenkov Andrey

Keyword(s):

Machine Learning ◽

Social Networks ◽

Social Media ◽

Social Network ◽

Model Performance ◽

User Profiling ◽

Personalized Recommendation ◽

Modal Data ◽

Social Media Networks ◽

The Impact

Nowadays, social networks play a crucial role in human everyday life and no longer purely associated with spare time spending. In fact, instant communication with friends and colleagues has become an essential component of our daily interaction giving a raise of multiple new social network types emergence. By participating in such networks, individuals generate a multitude of data points that describe their activities from different perspectives and, for example, can be further used for applications such as personalized recommendation or user profiling. However, the impact of the different social media networks on machine learning model performance has not been studied comprehensively yet. Particularly, the literature on modeling multi-modal data from multiple social networks is relatively sparse, which had inspired us to take a deeper dive into the topic in this preliminary study. Specifically, in this work, we will study the performance of different machine learning models when being learned on multi-modal data from different social networks. Our initial experimental results reveal that social network choice impacts the performance and the proper selection of data source is crucial.

Download Full-text

SVM based Supervised Machine Learning Framework for Glaucoma Classification using Retinal Fundus Images

2021 10th IEEE International Conference on Communication Systems and Network Technologies (CSNT) ◽

10.1109/csnt51715.2021.9509708 ◽

2021 ◽

Author(s):

Deepak R. Parashar ◽

Dheeraj K Agarwal

Keyword(s):

Machine Learning ◽

Supervised Machine Learning ◽

Fundus Images ◽

Learning Framework ◽

Retinal Fundus Images ◽

Retinal Fundus

Download Full-text

A supervised machine learning framework for smart tires

10.1109/rtsi50628.2021.9597342 ◽

2021 ◽

Author(s):

Salvatore Strano ◽

Mario Terzo ◽

Ciro Tordela

Keyword(s):

Machine Learning ◽

Supervised Machine Learning ◽

Learning Framework

Download Full-text

Sentiment Analysis in Crisis Situations for Better Connected Government

Web 2.0 and Cloud Technologies for Implementing Connected Government - Advances in Electronic Government, Digital Divide, and Regional Development ◽

10.4018/978-1-7998-4570-6.ch008 ◽

2021 ◽

pp. 162-181

Author(s):

Asdrúbal López Chau ◽

David Valle-Cruz ◽

Rodrigo Sandoval-Almazán

Keyword(s):

Social Networks ◽

Social Media ◽

Sentiment Analysis ◽

Citizen Participation ◽

Important Means ◽

The Government ◽

Made In ◽

Crisis Situations

One of the pillars of connected government is citizen centricity: an approach in which citizen participation is essential. In Mexico, social networks are currently one of the most important means by which citizens express their needs and provide opinions to the government. The goal of this chapter is to contribute to citizen centricity by adapting the methodology of sentiment analysis of social media posts to an expanded version for crisis situations. The main difference in this approach from the normally accepted one is that instead of using pre-defined classes (positive and negative) for sentiments, the authors first determined the different data categories and then applied them to the classic process of sentiment analysis. This approach was tested using posts on Mexico's earthquake in 2017. They found that needs, demands, and claims made in the posts reflect sentiments in a better way, and this can help to improve the government-citizen connection.

Download Full-text