Identifying Patients on Twitter and Learning from Their Personal Experience: The Case of IBD (Preprint)

2021 ◽  
Author(s):  
Maya Stemmer ◽  
Yisrael Parmet ◽  
Gilad Ravid

BACKGROUND Social media serve as an alternate information source for patients, who use them to share information and provide social support. Though large amounts of health-related data are being posted on Twitter and other social networking platforms each day, research using social media data for understanding chronic conditions and patients' lifestyles is still lacking. OBJECTIVE In this research we contribute to closing this gap by providing a framework for identifying patients with Inflammatory Bowel Disease (IBD) on Twitter and learning from their personal experience. We enable the analysis of patients' tweets by building a classifier of Twitter users that distinguishes patients from other entities. The research aims to assess the feasibility of using social media data to promote chronically ill patients' wellbeing, by relying on the wisdom of the crowd for identifying healthy lifestyles. We seek to leverage posts describing patients' daily activities and the influence on their wellbeing for characterizing different treatments and understanding what works for whom. METHODS In the first stage of the research, a machine learning method combining both social network analysis and natural language processing was used to classify users as patients or not automatically. Three types of features were considered: (1) the user's behavior on Twitter, (2) the content of the user's tweets, and (3) the social structure of the user's network. Different classification algorithms were examined and compared using two measures (F1-score and precision) over 10-fold cross-validation. In the second stage of the research, the obtained classification methods were used to collect tweets of patients, in which they refer to the different lifestyle changes they endure in order to deal with their disease. Using IBM Watson Service for entity sentiment analysis, we calculated the average sentiment of 420 lifestyle-related words that IBD patients use when describing their daily routine. RESULTS The best classification results (F1-score 0.808 and precision 0.809) for identifying IBD patients among Twitter users were achieved by a multiple-instance learning approach, which constitutes the novelty of this research. The sentiment analysis of tweets written by IBD patients identified frequently mentioned lifestyles and their influence on patients' wellbeing. The findings reinforced what is known about suitable nutrition for IBD, and several foods that are known to cause inflammation were highlighted as words with negative sentiment. CONCLUSIONS Patients everywhere use social media to share health and treatment information, learn from each other's experiences, and provide social support. Mining these informative conversations may shed some light on patients' ways of life and support chronic conditions research.

2021 ◽  
Author(s):  
Vadim Moshkin ◽  
Andrew Konstantinov ◽  
Nadezhda Yarushkina ◽  
Alexander Dyrnochkin

2020 ◽  
pp. 193-201 ◽  
Author(s):  
Hayder A. Alatabi ◽  
Ayad R. Abbas

Over the last period, social media achieved a widespread use worldwide where the statistics indicate that more than three billion people are on social media, leading to large quantities of data online. To analyze these large quantities of data, a special classification method known as sentiment analysis, is used. This paper presents a new sentiment analysis system based on machine learning techniques, which aims to create a process to extract the polarity from social media texts. By using machine learning techniques, sentiment analysis achieved a great success around the world. This paper investigates this topic and proposes a sentiment analysis system built on Bayesian Rough Decision Tree (BRDT) algorithm. The experimental results show the success of this system where the accuracy of the system is more than 95% on social media data.


Author(s):  
S. M. Mazharul Hoque Chowdhury ◽  
Sheikh Abujar ◽  
Ohidujjaman ◽  
Khalid Been Md. Badruzzaman ◽  
Syed Akhter Hossain

Author(s):  
Shalin Hai-Jew

Sentiment analysis has been used to assess people's feelings, attitudes, and beliefs, ranging from positive to negative, on a variety of phenomena. Several new autocoding features in NVivo 11 Plus enable the capturing of sentiment analysis and extraction of themes from text datasets. This chapter describes eight scenarios in which these tools may be applied to social media data, to (1) profile egos and entities, (2) analyze groups, (3) explore metadata for latent public conceptualizations, (4) examine trending public issues, (5) delve into public concepts, (6) observe public events, (7) analyze brand reputation, and (8) inspect text corpora for emergent insights.


Healthcare ◽  
2020 ◽  
Vol 8 (3) ◽  
pp. 307
Author(s):  
Li Zhang ◽  
Haimeng Fan ◽  
Chengxia Peng ◽  
Guozheng Rao ◽  
Qing Cong

The widespread use of social media provides a large amount of data for public sentiment analysis. Based on social media data, researchers can study public opinions on human papillomavirus (HPV) vaccines on social media using machine learning-based approaches that will help us understand the reasons behind the low vaccine coverage. However, social media data is usually unannotated, and data annotation is costly. The lack of an abundant annotated dataset limits the application of deep learning methods in effectively training models. To tackle this problem, we propose three transfer learning approaches to analyze the public sentiment on HPV vaccines on Twitter. One was transferring static embeddings and embeddings from language models (ELMo) and then processing by bidirectional gated recurrent unit with attention (BiGRU-Att), called DWE-BiGRU-Att. The others were fine-tuning pre-trained models with limited annotated data, called fine-tuning generative pre-training (GPT) and fine-tuning bidirectional encoder representations from transformers (BERT). The fine-tuned GPT model was built on the pre-trained generative pre-training (GPT) model. The fine-tuned BERT model was constructed with BERT model. The experimental results on the HPV dataset demonstrated the efficacy of the three methods in the sentiment analysis of the HPV vaccination task. The experimental results on the HPV dataset demonstrated the efficacy of the methods in the sentiment analysis of the HPV vaccination task. The fine-tuned BERT model outperforms all other methods. It can help to find strategies to improve vaccine uptake.


Sentiment analysis is one of the heated topic in the field of text mining. As the social media data is increased day by day the main need of the data scientists is to classify the data so that it can be further used for decision making or knowledge discovery. Now –a-days everything and everyone available online so to check the latest trends in business or in daily life one must consider the online data. The main focus of sentiment analysis is to focus on positive or negative comments so that a well define picture is created that what is trending or not but the sarcasm manipulates the data as in sarcastic comment negative comment consider as positive because of the presence of positive words in the comment or data so it is necessary to detect the sarcasm in online data . The data on social media is available in various languages so sentiment analysis in regional languages is also a main step . In the proposed work we focus on two languages i.e Punjabi and English. Here we use deep learning based neural networks for the sarcasm detection in English as well as Punjabi language. In the proposed work we consider three datasets i.e. balanced English dataset, Balanced Punjabi Dataset and unbalanced Punjabi dataset. We used six different models to check the accuracy of the classified data the models we used are LSTM with word embedding layer, BiLSTM with , LSTM+LSTM, BiLSTM+BiLSTM, LSTM+BiLSTM, CNN respectively. LSTM provide better accuracy for balanced Punjabi and English dataset i.e. 95.63% and 94.17% respectively. The accuracy for unbalanced Punjabi dataset is provided by BiLSTM i.e.96.31%.


Sign in / Sign up

Export Citation Format

Share Document