SOCIAL MEDIA DATA PROCESSING AND ANALYSIS BY MEANS OF MACHINE LEARNING FOR RAPID DETECTION, ASSESSMENT AND MAPPING THE IMPACT OF DISASTERS

Abstract. The main factor determining the possibility of using data obtained from social media as a source of information about the threat of emergencies is their relevance and accuracy. Thus, the important task is the determination of metrics for evaluating these parameters for a specific publication in a social media. It is worth noting the importance of this information channel as a source of eyewitness accounts from the scene. A comparison of social media data and official sources shows that social media contain a significant amount of unique information at different stages of emergency development. Also, when monitoring the situation for a specific event, social media allows to get more relevant information in comparison to official sources. Another important task is to search for emergency messages and their most accurate localization in space. A promising solution for the analysis and processing of social media data during emergency response is the application of artificial intelligence methods, and, particularly, machine learning techniques.

Download Full-text

Sentiment Analysis in Social Media using Machine Learning Techniques

Iraqi Journal of Science ◽

10.24996/ijs.2020.61.1.22 ◽

2020 ◽

pp. 193-201 ◽

Cited By ~ 1

Author(s):

Hayder A. Alatabi ◽

Ayad R. Abbas

Keyword(s):

Machine Learning ◽

Social Media ◽

Sentiment Analysis ◽

Machine Learning Techniques ◽

Great Success ◽

Social Media Data ◽

Learning Techniques ◽

The World ◽

Analysis System ◽

Media Data

Over the last period, social media achieved a widespread use worldwide where the statistics indicate that more than three billion people are on social media, leading to large quantities of data online. To analyze these large quantities of data, a special classification method known as sentiment analysis, is used. This paper presents a new sentiment analysis system based on machine learning techniques, which aims to create a process to extract the polarity from social media texts. By using machine learning techniques, sentiment analysis achieved a great success around the world. This paper investigates this topic and proposes a sentiment analysis system built on Bayesian Rough Decision Tree (BRDT) algorithm. The experimental results show the success of this system where the accuracy of the system is more than 95% on social media data.

Download Full-text

Geo-Tagged Social Media Data-Based Analytical Approach for Perceiving Impacts of Social Events

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi8010015 ◽

2018 ◽

Vol 8 (1) ◽

pp. 15 ◽

Cited By ~ 4

Author(s):

Ruoxin Zhu ◽

Diao Lin ◽

Michael Jendryke ◽

Chenyu Zuo ◽

Linfang Ding ◽

...

Keyword(s):

Machine Learning ◽

Social Media ◽

Research Work ◽

City Management ◽

Social Media Data ◽

Social Events ◽

Time Space ◽

The Impact ◽

Media Data

Studying the impact of social events is important for the sustainable development of society. Given the growing popularity of social media applications, social sensing networks with users acting as smart social sensors provide a unique channel for understanding social events. Current research on social events through geo-tagged social media is mainly focused on the extraction of information about when, where, and what happened, i.e., event detection. There is a trend towards the machine learning of more complex events from even larger input data. This research work will undoubtedly lead to a better understanding of big geo-data. In this study, however, we start from known or detected events, raising further questions on how they happened, how they affect people’s lives, and for how long. By combining machine learning, natural language processing, and visualization methods in a generic analytical framework, we attempt to interpret the impact of known social events from the dimensions of time, space, and semantics based on geo-tagged social media data. The whole analysis process consists of four parts: (1) preprocessing; (2) extraction of event-related information; (3) analysis of event impact; and (4) visualization. We conducted a case study on the “2014 Shanghai Stampede” event on the basis of Chinese Sina Weibo data. The results are visualized in various ways, thus ensuring the feasibility and effectiveness of our proposed framework. Both the methods and the case study can serve as decision references for situational awareness and city management.

Download Full-text

Social media data analysis to predict mental state of users using machine learning techniques

Journal of Education and Health Promotion ◽

10.4103/jehp.jehp_446_20 ◽

2021 ◽

Vol 10 (1) ◽

pp. 301

Author(s):

R Lokeshkumar ◽

OmAshish Mishra ◽

Shivam Kalra

Keyword(s):

Machine Learning ◽

Social Media ◽

Data Analysis ◽

Mental State ◽

Machine Learning Techniques ◽

Social Media Data ◽

Learning Techniques ◽

Media Data

Download Full-text

Hybrid features prediction model of movie quality using Multi-machine learning techniques for effective business resource planning

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-201844 ◽

2021 ◽

Vol 40 (5) ◽

pp. 9361-9382 ◽

Cited By ~ 1

Author(s):

Naeem Iqbal ◽

Rashid Ahmad ◽

Faisal Jamil ◽

Do-Hyeun Kim

Keyword(s):

Machine Learning ◽

Social Media ◽

Resource Planning ◽

Experimental Results ◽

Quality Prediction ◽

Classification Models ◽

Hybrid Features ◽

Social Media Data ◽

Media Data

Quality prediction plays an essential role in the business outcome of the product. Due to the business interest of the concept, it has extensively been studied in the last few years. Advancement in machine learning (ML) techniques and with the advent of robust and sophisticated ML algorithms, it is required to analyze the factors influencing the success of the movies. This paper presents a hybrid features prediction model based on pre-released and social media data features using multiple ML techniques to predict the quality of the pre-released movies for effective business resource planning. This study aims to integrate pre-released and social media data features to form a hybrid features-based movie quality prediction (MQP) model. The proposed model comprises of two different experimental models; (i) predict movies quality using the original set of features and (ii) develop a subset of features based on principle component analysis technique to predict movies success class. This work employ and implement different ML-based classification models, such as Decision Tree (DT), Support Vector Machines with the linear and quadratic kernel (L-SVM and Q-SVM), Logistic Regression (LR), Bagged Tree (BT) and Boosted Tree (BOT), to predict the quality of the movies. Different performance measures are utilized to evaluate the performance of the proposed ML-based classification models, such as Accuracy (AC), Precision (PR), Recall (RE), and F-Measure (FM). The experimental results reveal that BT and BOT classifiers performed accurately and produced high accuracy compared to other classifiers, such as DT, LR, LSVM, and Q-SVM. The BT and BOT classifiers achieved an accuracy of 90.1% and 89.7%, which shows an efficiency of the proposed MQP model compared to other state-of-art- techniques. The proposed work is also compared with existing prediction models, and experimental results indicate that the proposed MQP model performed slightly better compared to other models. The experimental results will help the movies industry to formulate business resources effectively, such as investment, number of screens, and release date planning, etc.

Download Full-text

Predicting ethnicity with data on personal names in Russia

10.31235/osf.io/wf6p4 ◽

2021 ◽

Author(s):

Alexey Bessudnov ◽

Denis Tarasov ◽

Viacheslav Panasovets ◽

Veronica Kostenko ◽

Ivan Smirnov ◽

...

Keyword(s):

Machine Learning ◽

Social Media ◽

Ethnic Groups ◽

Geographical Location ◽

Ethnic Relations ◽

Social Media Data ◽

Personal Names ◽

Learning Classifier ◽

Media Data

In this paper we develop a machine learning classifier that predicts perceived ethnicity from data on personal names for major ethnic groups populating Russia. We collect data from VK, the largest Russian social media website. Ethnicity has been determined from languages spoken by users and their geographical location, with the data manually cleaned by crowd workers. The classifier shows the accuracy of 0.82 for a scheme with 24 ethnic groups and 0.92 for 15 aggregated ethnic groups. It can be used for research on ethnicity and ethnic relations in Russia, in particular with VK and other social media data.

Download Full-text

Communication Sentiment Analyzer using Machine Learning with Naive Bayes Bernoullinb

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a1610.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 5976-5979

Keyword(s):

Machine Learning ◽

Social Media ◽

Major Part ◽

Naive Bayes ◽

Naïve Bayes ◽

User Preferences ◽

Social Media Data ◽

Machine Learning Model ◽

The World ◽

Media Data

In this never-ending social media era it is estimated that over 5 billion people use smartphones. Out of these, there are over 1.5 billion active users in the world. In which we all are a major part and before opening our messages we all are curious about what message we have received. No doubt, we all always hope for a good message to be received. So Sentiment analysis on social media data has been seen by many as an effective tool to monitor user preferences and inclination. Finally, we propose a scalable machine learning model to analyze the polarity of a communicative text using Naive Bayes’ Bernoulli classifier. This paper works on only two polarities that is whether the sentence is positive or negative. Bernoulli classifier is used in this paper because it is best suited for binary inputs which in turn enhances the accuracy of up to 97%.

Download Full-text

A Sentiment Analysis and Role of Twitter for Health Communications

10.4018/978-1-7998-8421-7.ch011 ◽

2022 ◽

pp. 188-205

Author(s):

Erkan Çiçek ◽

Uğur Gündüz

Keyword(s):

Social Media ◽

Crisis Management ◽

Semantic Analysis ◽

Social Media Data ◽

Global Pandemic ◽

The Difference ◽

Textual Content ◽

The Impact ◽

Media Data

Social media has been in our lives so much lately that it is an undeniable fact that global pandemics, which constitute an important part of our lives, are also affected by these networks and that they exist in these networks and share the users. The purpose of making this hashtag analysis is to reveal the difference in discourse and language while analyzing Twitter data and to evaluate the effects of a global pandemic crisis on language, message, and crisis management with social media data. This form of analysis is typically completed through amassing textual content data then investigating the “sentiment” conveyed. Within the scope of the study, 11,300 Twitter messages posted with the #stayhome hashtag between 30 May 2020 and 6 June 2020 were examined. The impact and reliability of social media in disaster management could be questioned by carrying out a content analysis based totally on the semantic analysis of the messages given on the Twitter posts with the phrases and frequencies used.

Download Full-text

An unsupervised machine learning model for discovering latent infectious diseases using social media data

Journal of Biomedical Informatics ◽

10.1016/j.jbi.2016.12.007 ◽

2017 ◽

Vol 66 ◽

pp. 82-94 ◽

Cited By ~ 43

Author(s):

Sunghoon Lim ◽

Conrad S. Tucker ◽

Soundar Kumara

Keyword(s):

Machine Learning ◽

Social Media ◽

Infectious Diseases ◽

Learning Model ◽

Unsupervised Machine Learning ◽

Social Media Data ◽

Machine Learning Model ◽

Media Data

Download Full-text

Social Media to Social Media Analytics

International Journal of Technoethics ◽

10.4018/ijt.2019070104 ◽

2019 ◽

Vol 10 (2) ◽

pp. 57-70 ◽

Cited By ~ 4

Author(s):

Vikas Kumar ◽

Pooja Nanda

Keyword(s):

Social Media ◽

Social Media Analytics ◽

Social Media Data ◽

Ethical Concerns ◽

Ethical Implications ◽

The Social ◽

Social Media Platforms ◽

The Individual ◽

The Impact ◽

Media Data

With the amplification of social media platforms, the importance of social media analytics has exponentially increased for many brands and organizations across the world. Tracking and analyzing the social media data has been contributing as a success parameter for such organizations, however, the data is being poorly harnessed. Therefore, the ethical implications of social media analytics need to be identified and explored for both the organizations and targeted users of social media data. The present work is an exploratory study to identify the various techno-ethical concerns of social media engagement, as well as social media analytics. The impact of these concerns on the individuals, organizations, and society as a whole are discussed. Ethical engagement for the most common social media platforms has been outlined with a number of specific examples to understand the prominent techno-ethical concerns. Both the individual and organizational perspectives have been taken into account to identify the implications of social media analytics.

Download Full-text