Using social media data to map the areas most affected by ISIS in Syria

This paper presents a model to collect, save, geocode, and analyze social media data. The model is used to collect and process the social media data concerned with the ISIS terrorist group (the Islamic State in Iraq and Syria), and to map the areas in Syria most affected by ISIS accordingly to the social media data. Mapping process is assumed automated compilation of a density map for the geocoded tweets. Data mined from social media (e.g., Twitter and Facebook) is recognized as dynamic and easily accessible resources that can be used as a data source in spatial analysis and geographical information system. Social media data can be represented as a topic data and geocoding data basing on the text of the mined from social media and processed using Natural Language Processing (NLP) methods. NLP is a subdomain of artificial intelligence concerned with the programming computers to analyze natural human language and texts. NLP allows identifying words used as an initial data by developed geocoding algorithm. In this study, identifying the needed words using NLP was done using two corpora. First corpus contained the names of populated places in Syria. The second corpus was composed in result of statistical analysis of the number of tweets and picking the words that have a location meaning (i.e., schools, temples, etc.). After identifying the words, the algorithm used Google Maps geocoding API in order to obtain the coordinates for posts.

Download Full-text

Using Twitter to explore consumers' sentiments and their social representations towards new food trends

British Food Journal ◽

10.1108/bfj-03-2020-0192 ◽

2020 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Emilio Pindado ◽

Ramo Barrena

Keyword(s):

Social Media ◽

Clustering Algorithm ◽

Social Representations ◽

Local Level ◽

Geographical Information ◽

Social Media Data ◽

Content Type ◽

The Social ◽

Food Trends ◽

Media Data

PurposeThis paper investigates the use of Twitter for studying the social representations of different regions across the world towards new food trends.Design/methodology/approachA density-based clustering algorithm was applied to 7,014 tweets to identify regions of consumers sharing content about food trends. The attitude of their social representations was addressed with the sentiment analysis, and grid maps were used to explore subregional differences.FindingsTwitter users have a weak, positive attitude towards food trends, and significant differences were found across regions identified, which suggests that factors at the regional level such as cultural context determine users' attitude towards food innovations. The subregional analysis showed differences at the local level, which reinforces the evidence that context matters in consumers' attitude expressed in social media.Research limitations/implicationsThe social media content is sensitive to spatio-temporal events. Therefore, research should take into account content, location and contextual information to understand consumers' perceptions. The methodology proposed here serves to identify consumers' regions and to characterize their attitude towards specific topics. It considers not only administrative but also cognitive boundaries in order to analyse subsequent contextual influences on consumers' social representations.Practical implicationsThe approach presented allows marketers to identify regions of interest and localize consumers' attitudes towards their products using social media data, providing real-time information to contrast with their strategies in different areas and adapt them to consumers' feelings.Originality/valueThis study presents a research methodology to analyse food consumers' understanding and perceptions using not only content but also geographical information of social media data, which provides a means to extract more information than the content analysis applied in the literature.

Download Full-text

Perceiving Residents’ Festival Activities Based on Social Media Data: A Case Study in Beijing, China

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10070474 ◽

2021 ◽

Vol 10 (7) ◽

pp. 474

Author(s):

Bingqing Wang ◽

Bin Meng ◽

Juan Wang ◽

Siyu Chen ◽

Jian Liu

Keyword(s):

Social Media ◽

Language Processing ◽

Topic Model ◽

Central Area ◽

Classification Model ◽

Social Media Data ◽

Ring Road ◽

Different Types ◽

Spatial Differences ◽

Media Data

Social media data contains real-time expressed information, including text and geographical location. As a new data source for crowd behavior research in the era of big data, it can reflect some aspects of the behavior of residents. In this study, a text classification model based on the BERT and Transformers framework was constructed, which was used to classify and extract more than 210,000 residents’ festival activities based on the 1.13 million Sina Weibo (Chinese “Twitter”) data collected from Beijing in 2019 data. On this basis, word frequency statistics, part-of-speech analysis, topic model, sentiment analysis and other methods were used to perceive different types of festival activities and quantitatively analyze the spatial differences of different types of festivals. The results show that traditional culture significantly influences residents’ festivals, reflecting residents’ motivation to participate in festivals and how residents participate in festivals and express their emotions. There are apparent spatial differences among residents in participating in festival activities. The main festival activities are distributed in the central area within the Fifth Ring Road in Beijing. In contrast, expressing feelings during the festival is mainly distributed outside the Fifth Ring Road in Beijing. The research integrates natural language processing technology, topic model analysis, spatial statistical analysis, and other technologies. It can also broaden the application field of social media data, especially text data, which provides a new research paradigm for studying residents’ festival activities and adds residents’ perception of the festival. The research results provide a basis for the design and management of the Chinese festival system.

Download Full-text

A Pipeline to Understand Emerging Illness Via Social Media Data Analysis: Case Study on Breast Implant Illness (Preprint)

10.2196/preprints.29768 ◽

2021 ◽

Author(s):

Vishal Dey ◽

Peter Krasniak ◽

Minh Nguyen ◽

Clara Lee ◽

Xia Ning

Keyword(s):

Mental Health ◽

Social Media ◽

Natural Language Processing ◽

Data Analysis ◽

Natural Language ◽

Language Processing ◽

Breast Implant ◽

Public Attention ◽

Social Media Data ◽

Media Data

BACKGROUND A new illness can come to public attention through social media before it is medically defined, formally documented, or systematically studied. One example is a condition known as breast implant illness (BII), which has been extensively discussed on social media, although it is vaguely defined in the medical literature. OBJECTIVE The objective of this study is to construct a data analysis pipeline to understand emerging illnesses using social media data and to apply the pipeline to understand the key attributes of BII. METHODS We constructed a pipeline of social media data analysis using natural language processing and topic modeling. Mentions related to signs, symptoms, diseases, disorders, and medical procedures were extracted from social media data using the clinical Text Analysis and Knowledge Extraction System. We mapped the mentions to standard medical concepts and then summarized these mapped concepts as topics using latent Dirichlet allocation. Finally, we applied this pipeline to understand BII from several BII-dedicated social media sites. RESULTS Our pipeline identified topics related to toxicity, cancer, and mental health issues that were highly associated with BII. Our pipeline also showed that cancers, autoimmune disorders, and mental health problems were emerging concerns associated with breast implants, based on social media discussions. Furthermore, the pipeline identified mentions such as rupture, infection, pain, and fatigue as common self-reported issues among the public, as well as concerns about toxicity from silicone implants. CONCLUSIONS Our study could inspire future studies on the suggested symptoms and factors of BII. Our study provides the first analysis and derived knowledge of BII from social media using natural language processing techniques and demonstrates the potential of using social media information to better understand similar emerging illnesses. CLINICALTRIAL

Download Full-text

The Social Media Macroscope: A science gateway for research using social media data

Future Generation Computer Systems ◽

10.1016/j.future.2019.10.029 ◽

2020 ◽

Vol 111 ◽

pp. 819-828 ◽

Cited By ~ 1

Author(s):

Joseph T. Yun ◽

Nickolas Vance ◽

Chen Wang ◽

Luigi Marini ◽

Joseph Troy ◽

...

Keyword(s):

Social Media ◽

Social Media Data ◽

Science Gateway ◽

The Social ◽

Media Data

Download Full-text

Anatomy of a Protest: Spatial Information, Social Media, and Urban Space

Social Media + Society ◽

10.1177/2056305119897320 ◽

2020 ◽

Vol 6 (1) ◽

pp. 205630511989732

Author(s):

Alireza Karduni ◽

Eric Sauda

Keyword(s):

Social Media ◽

Public Space ◽

Urban Space ◽

Language Processing ◽

Local Community ◽

Spatial Information ◽

Social Media Data ◽

Public Events ◽

Use Of Social Media ◽

Media Data

Black Lives Matter, like many modern movements in the age of information, makes significant use of social media as well as public space to demand justice. In this article, we study the protests in response to the shooting of Keith Lamont Scott by police in Charlotte, North Carolina, on September 2016. Our goal is to measure the significance of urban space within the virtual and physical network of protesters. Using a mixed-methods approach, we identify and study urban space and social media generated by these protests. We conducted interviews with protesters who were among the first to join the Keith Lamont Scott shooting demonstrations. From the interviews, we identify places that were significant in our interviewees’ narratives. Using a combination of natural language processing and social network analysis, we analyze social media data related to the Charlotte protests retrieved from Twitter. We found that social media, local community, and public space work together to organize and motivate protests and that public events such as protests cause a discernible increase in social media activity. Finally, we find that there are two distinct communities who engage social media in different ways; one group involved with social media, local community and urban space, and a second group connected almost exclusively through social media.

Download Full-text

Review of Data Visualization for Social Media Postings

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i4.38.27613 ◽

2018 ◽

Vol 7 (4.38) ◽

pp. 939

Author(s):

Nur Atiqah Sia Abdullah ◽

Hamizah Binti Anuar

Keyword(s):

Social Media ◽

Data Visualization ◽

Line Graph ◽

Data Types ◽

Social Media Data ◽

Data Analyst ◽

The Social ◽

Social Media Platforms ◽

Types Of Information ◽

Media Data

Facebook and Twitter are the most popular social media platforms among netizen. People are now more aggressive to express their opinions, perceptions, and emotions through social media platforms. These massive data provide great value for the data analyst to understand patterns and emotions related to a certain issue. Mining the data needs techniques and time, therefore data visualization becomes trending in representing these types of information. This paper aims to review data visualization studies that involved data from social media postings. Past literature used node-link diagram, node-link tree, directed graph, line graph, heatmap, and stream graph to represent the data collected from the social media platforms. An analysis by comparing the social media data types, representation, and data visualization techniques is carried out based on the previous studies. This paper critically discussed the comparison and provides a suggestion for the suitability of data visualization based on the type of social media data in hand.

Download Full-text

Social Media to Social Media Analytics

International Journal of Technoethics ◽

10.4018/ijt.2019070104 ◽

2019 ◽

Vol 10 (2) ◽

pp. 57-70 ◽

Cited By ~ 4

Author(s):

Vikas Kumar ◽

Pooja Nanda

Keyword(s):

Social Media ◽

Social Media Analytics ◽

Social Media Data ◽

Ethical Concerns ◽

Ethical Implications ◽

The Social ◽

Social Media Platforms ◽

The Individual ◽

The Impact ◽

Media Data

With the amplification of social media platforms, the importance of social media analytics has exponentially increased for many brands and organizations across the world. Tracking and analyzing the social media data has been contributing as a success parameter for such organizations, however, the data is being poorly harnessed. Therefore, the ethical implications of social media analytics need to be identified and explored for both the organizations and targeted users of social media data. The present work is an exploratory study to identify the various techno-ethical concerns of social media engagement, as well as social media analytics. The impact of these concerns on the individuals, organizations, and society as a whole are discussed. Ethical engagement for the most common social media platforms has been outlined with a number of specific examples to understand the prominent techno-ethical concerns. Both the individual and organizational perspectives have been taken into account to identify the implications of social media analytics.

Download Full-text

Corrigendum

Journal of Information Science ◽

10.1177/0165551518781507 ◽

2018 ◽

Vol 45 (1) ◽

pp. 136-136

Keyword(s):

Public Health ◽

Social Media ◽

Information Science ◽

Patient Data ◽

Online Version ◽

Social Media Data ◽

Medical Network ◽

The Social ◽

Media Data ◽

Prior Permission

Ji X, Chun SA, Cappellari P, et al. Linking and using social media data for enhancing public health analytics. Journal of Information Science 2016; 43: 221–245. DOI: 10.1177/0165551515625029 The authors regret that non-anonymised patient data was used from a social medical network without prior permission. With permission from the social medical network, the authors have anonymised the data and corrected the article. The online version of the article has been corrected.

Download Full-text

Mining social media for prescription medication abuse monitoring: a review and proposal for a data-centric framework

Journal of the American Medical Informatics Association ◽

10.1093/jamia/ocz162 ◽

2019 ◽

Vol 27 (2) ◽

pp. 315-329 ◽

Cited By ~ 6

Author(s):

Abeed Sarker ◽

Annika DeRoos ◽

Jeanmarie Perrone

Keyword(s):

Social Media ◽

Language Processing ◽

Prescription Medication ◽

Inclusion Criteria ◽

Major Health Problem ◽

Social Media Data ◽

Use Of Data ◽

Media Source ◽

Multiple Characteristics ◽

Media Data

Abstract Objective Prescription medication (PM) misuse and abuse is a major health problem globally, and a number of recent studies have focused on exploring social media as a resource for monitoring nonmedical PM use. Our objectives are to present a methodological review of social media–based PM abuse or misuse monitoring studies, and to propose a potential generalizable, data-centric processing pipeline for the curation of data from this resource. Materials and Methods We identified studies involving social media, PMs, and misuse or abuse (inclusion criteria) from Medline, Embase, Scopus, Web of Science, and Google Scholar. We categorized studies based on multiple characteristics including but not limited to data size; social media source(s); medications studied; and primary objectives, methods, and findings. Results A total of 39 studies met our inclusion criteria, with 31 (∼79.5%) published since 2015. Twitter has been the most popular resource, with Reddit and Instagram gaining popularity recently. Early studies focused mostly on manual, qualitative analyses, with a growing trend toward the use of data-centric methods involving natural language processing and machine learning. Discussion There is a paucity of standardized, data-centric frameworks for curating social media data for task-specific analyses and near real-time surveillance of nonmedical PM use. Many existing studies do not quantify human agreements for manual annotation tasks or take into account the presence of noise in data. Conclusion The development of reproducible and standardized data-centric frameworks that build on the current state-of-the-art methods in data and text mining may enable effective utilization of social media data for understanding and monitoring nonmedical PM use.

Download Full-text