scholarly journals SOCIAL MEDIA DATA PROCESSING AND ANALYSIS BY MEANS OF MACHINE LEARNING FOR RAPID DETECTION, ASSESSMENT AND MAPPING THE IMPACT OF DISASTERS

Author(s):  
P. M. Kikin ◽  
A. A. Kolesnikov ◽  
E. A. Panidi

Abstract. The main factor determining the possibility of using data obtained from social media as a source of information about the threat of emergencies is their relevance and accuracy. Thus, the important task is the determination of metrics for evaluating these parameters for a specific publication in a social media. It is worth noting the importance of this information channel as a source of eyewitness accounts from the scene. A comparison of social media data and official sources shows that social media contain a significant amount of unique information at different stages of emergency development. Also, when monitoring the situation for a specific event, social media allows to get more relevant information in comparison to official sources. Another important task is to search for emergency messages and their most accurate localization in space. A promising solution for the analysis and processing of social media data during emergency response is the application of artificial intelligence methods, and, particularly, machine learning techniques.

2020 ◽  
pp. 193-201 ◽  
Author(s):  
Hayder A. Alatabi ◽  
Ayad R. Abbas

Over the last period, social media achieved a widespread use worldwide where the statistics indicate that more than three billion people are on social media, leading to large quantities of data online. To analyze these large quantities of data, a special classification method known as sentiment analysis, is used. This paper presents a new sentiment analysis system based on machine learning techniques, which aims to create a process to extract the polarity from social media texts. By using machine learning techniques, sentiment analysis achieved a great success around the world. This paper investigates this topic and proposes a sentiment analysis system built on Bayesian Rough Decision Tree (BRDT) algorithm. The experimental results show the success of this system where the accuracy of the system is more than 95% on social media data.


2018 ◽  
Vol 8 (1) ◽  
pp. 15 ◽  
Author(s):  
Ruoxin Zhu ◽  
Diao Lin ◽  
Michael Jendryke ◽  
Chenyu Zuo ◽  
Linfang Ding ◽  
...  

Studying the impact of social events is important for the sustainable development of society. Given the growing popularity of social media applications, social sensing networks with users acting as smart social sensors provide a unique channel for understanding social events. Current research on social events through geo-tagged social media is mainly focused on the extraction of information about when, where, and what happened, i.e., event detection. There is a trend towards the machine learning of more complex events from even larger input data. This research work will undoubtedly lead to a better understanding of big geo-data. In this study, however, we start from known or detected events, raising further questions on how they happened, how they affect people’s lives, and for how long. By combining machine learning, natural language processing, and visualization methods in a generic analytical framework, we attempt to interpret the impact of known social events from the dimensions of time, space, and semantics based on geo-tagged social media data. The whole analysis process consists of four parts: (1) preprocessing; (2) extraction of event-related information; (3) analysis of event impact; and (4) visualization. We conducted a case study on the “2014 Shanghai Stampede” event on the basis of Chinese Sina Weibo data. The results are visualized in various ways, thus ensuring the feasibility and effectiveness of our proposed framework. Both the methods and the case study can serve as decision references for situational awareness and city management.


2021 ◽  
Vol 40 (5) ◽  
pp. 9361-9382 ◽  
Author(s):  
Naeem Iqbal ◽  
Rashid Ahmad ◽  
Faisal Jamil ◽  
Do-Hyeun Kim

Quality prediction plays an essential role in the business outcome of the product. Due to the business interest of the concept, it has extensively been studied in the last few years. Advancement in machine learning (ML) techniques and with the advent of robust and sophisticated ML algorithms, it is required to analyze the factors influencing the success of the movies. This paper presents a hybrid features prediction model based on pre-released and social media data features using multiple ML techniques to predict the quality of the pre-released movies for effective business resource planning. This study aims to integrate pre-released and social media data features to form a hybrid features-based movie quality prediction (MQP) model. The proposed model comprises of two different experimental models; (i) predict movies quality using the original set of features and (ii) develop a subset of features based on principle component analysis technique to predict movies success class. This work employ and implement different ML-based classification models, such as Decision Tree (DT), Support Vector Machines with the linear and quadratic kernel (L-SVM and Q-SVM), Logistic Regression (LR), Bagged Tree (BT) and Boosted Tree (BOT), to predict the quality of the movies. Different performance measures are utilized to evaluate the performance of the proposed ML-based classification models, such as Accuracy (AC), Precision (PR), Recall (RE), and F-Measure (FM). The experimental results reveal that BT and BOT classifiers performed accurately and produced high accuracy compared to other classifiers, such as DT, LR, LSVM, and Q-SVM. The BT and BOT classifiers achieved an accuracy of 90.1% and 89.7%, which shows an efficiency of the proposed MQP model compared to other state-of-art- techniques. The proposed work is also compared with existing prediction models, and experimental results indicate that the proposed MQP model performed slightly better compared to other models. The experimental results will help the movies industry to formulate business resources effectively, such as investment, number of screens, and release date planning, etc.


2021 ◽  
Author(s):  
Alexey Bessudnov ◽  
Denis Tarasov ◽  
Viacheslav Panasovets ◽  
Veronica Kostenko ◽  
Ivan Smirnov ◽  
...  

In this paper we develop a machine learning classifier that predicts perceived ethnicity from data on personal names for major ethnic groups populating Russia. We collect data from VK, the largest Russian social media website. Ethnicity has been determined from languages spoken by users and their geographical location, with the data manually cleaned by crowd workers. The classifier shows the accuracy of 0.82 for a scheme with 24 ethnic groups and 0.92 for 15 aggregated ethnic groups. It can be used for research on ethnicity and ethnic relations in Russia, in particular with VK and other social media data.


In this never-ending social media era it is estimated that over 5 billion people use smartphones. Out of these, there are over 1.5 billion active users in the world. In which we all are a major part and before opening our messages we all are curious about what message we have received. No doubt, we all always hope for a good message to be received. So Sentiment analysis on social media data has been seen by many as an effective tool to monitor user preferences and inclination. Finally, we propose a scalable machine learning model to analyze the polarity of a communicative text using Naive Bayes’ Bernoulli classifier. This paper works on only two polarities that is whether the sentence is positive or negative. Bernoulli classifier is used in this paper because it is best suited for binary inputs which in turn enhances the accuracy of up to 97%.


2022 ◽  
pp. 188-205
Author(s):  
Erkan Çiçek ◽  
Uğur Gündüz

Social media has been in our lives so much lately that it is an undeniable fact that global pandemics, which constitute an important part of our lives, are also affected by these networks and that they exist in these networks and share the users. The purpose of making this hashtag analysis is to reveal the difference in discourse and language while analyzing Twitter data and to evaluate the effects of a global pandemic crisis on language, message, and crisis management with social media data. This form of analysis is typically completed through amassing textual content data then investigating the “sentiment” conveyed. Within the scope of the study, 11,300 Twitter messages posted with the #stayhome hashtag between 30 May 2020 and 6 June 2020 were examined. The impact and reliability of social media in disaster management could be questioned by carrying out a content analysis based totally on the semantic analysis of the messages given on the Twitter posts with the phrases and frequencies used.


2019 ◽  
Vol 10 (2) ◽  
pp. 57-70 ◽  
Author(s):  
Vikas Kumar ◽  
Pooja Nanda

With the amplification of social media platforms, the importance of social media analytics has exponentially increased for many brands and organizations across the world. Tracking and analyzing the social media data has been contributing as a success parameter for such organizations, however, the data is being poorly harnessed. Therefore, the ethical implications of social media analytics need to be identified and explored for both the organizations and targeted users of social media data. The present work is an exploratory study to identify the various techno-ethical concerns of social media engagement, as well as social media analytics. The impact of these concerns on the individuals, organizations, and society as a whole are discussed. Ethical engagement for the most common social media platforms has been outlined with a number of specific examples to understand the prominent techno-ethical concerns. Both the individual and organizational perspectives have been taken into account to identify the implications of social media analytics.


Sign in / Sign up

Export Citation Format

Share Document