Exploring the Use of Machine Learning to Automate the Qualitative Coding of Church-related Tweets

2020 ◽  
Vol 14 (2) ◽  
pp. 140-159
Author(s):  
Anthony-Paul Cooper ◽  
Emmanuel Awuni Kolog ◽  
Erkki Sutinen

This article builds on previous research around the exploration of the content of church-related tweets. It does so by exploring whether the qualitative thematic coding of such tweets can, in part, be automated by the use of machine learning. It compares three supervised machine learning algorithms to understand how useful each algorithm is at a classification task, based on a dataset of human-coded church-related tweets. The study finds that one such algorithm, Naïve-Bayes, performs better than the other algorithms considered, returning Precision, Recall and F-measure values which each exceed an acceptable threshold of 70%. This has far-reaching consequences at a time where the high volume of social media data, in this case, Twitter data, means that the resource-intensity of manual coding approaches can act as a barrier to understanding how the online community interacts with, and talks about, church. The findings presented in this article offer a way forward for scholars of digital theology to better understand the content of online church discourse.

Author(s):  
Amrita Mishra ◽  

Sentiment Analysis has paved routes for opinion analysis of masses over unrestricted territorial limits. With the advent and growth of social media like Twitter, Facebook, WhatsApp, Snapchat in today’s world, stakeholders and the public often takes to expressing their opinion on them and drawing conclusions. While these social media data are extremely informative and well connected, the major challenge lies in incorporating efficient Text Classification strategies which not only overcomes the unstructured and humongous nature of data but also generates correct polarity of opinions (i.e. positive, negative, and neutral). This paper is a thorough effort to provide a brief study about various approaches to SA including Machine Learning, Lexicon Based, and Automatic Approaches. The paper also highlights the comparison of positive, negative, and neutral tweets of the Sputnik V, Moderna, and Covaxin vaccines used for preventive and emergency use of COVID-19 disease.


Quick data acquisition and analysis became an important tool in the contemporary era. Real time data is made available in World Wide Web (WWW) and social media. Especially social media data is rich in opinions of people of all walks of life. Searching and analysing such data provides required business intelligence (BI) for applications of various domains in the real world. The application may be in the area of politics or banking or insurance or healthcare industry. With the emergence of cloud computing, volumes of data are added to cloud storage infrastructure and it is growing exponentially. In this context, Elasticsearch is the distributed search and analytics engine that is very crucial part of Elastic Stack. For data collection, aggregation and enriching it Beats and Logstash are used and such data is stored in Elasticsearch. For interactive exploration and visualization Kibana is used. Elasticsearch helps in indexing of data, searching efficiently and performing data analytics. In this paper, the utility of Elasticsearch is evaluated for optimising search and data analytics of Twitter data. Empirical study is made with the Elasticsearch tool configured for Windows and also using Amazon Elasticsearch and the results are compared with state of art. The experimental results revealed that the Elasticsearch performs better than the existing ones.


Author(s):  
Noraini Seman ◽  
Nurul Atiqah Razmi

A huge amount of data is generated every minute for social networking and content sharing via Social media sites that can be in a form of structured, unstructured or semi-structured data.  One of the largest used social media sites is Twitter, where each and every day millions of data generated in the form of unstructured tweets. Tweets or opinions of the people can be used to extract sentiments of the people. Sentiment analysis is beneficial for organizations to improve their products and make required changes on demand to increase their profit. In this paper, three machine learning algorithms Support Vector Machine (SVM), Decision Trees (DT), and Naive Bayes (NB) for classifying sentiments of twitters data. The purpose of this research is to compare the outcomes of these algorithms to identify best machine learning method which gives most accurate and efficient results for classifying twitter data. Our experimental result shows that same preprocessing methods on a different dataset affect similarly the classifiers performance. After analyzing the results it is observed that SVM provides 64.96%, 71.26% and 91.25% precision which is better than other two algorithms. Also, overall Recall and F-measure rate of SVM is greater than NB and DT for three datasets. However, it is important to further study current available preprocessing techniques that help us to improve results of various classifiers.


2020 ◽  
Author(s):  
Dong Whi Yoo ◽  
Sindhu Kiranmai Ernala ◽  
Bahador Saket ◽  
Domino Weir ◽  
Elizabeth Arenare ◽  
...  

BACKGROUND Extant studies have suggested that social media data, along with machine learning algorithms, can be used to generate computational mental health insights. Those computational insights have potential to support clinician-patient communication during psychotherapy consultations. However, it has been underexplored how clinicians would perceive and envision utilizing computational insights during consultations. OBJECTIVE We sought to understand clinician perspectives regarding computational mental health insights from patients’ social media. We focused on the opportunities and challenges of utilizing these insights during psychotherapy consultations. METHODS Following user-centered design approaches, we developed a prototype that can analyze consented patients’ Facebook data and visually represent the computational insights. We incorporated the insights into existing clinician-facing assessment tools, the Hamilton Depression Rating Scale and Global Functioning: Social Scale. The design intent is that a clinician will verbally interview a patient (e.g., How was your mood in the past week) while they review relevant insights from the patient’s social media (e.g., number of depression indicative posts.) Using the prototype, we conducted interviews (N=15) and 3 focus groups (N=13) with mental health clinicians: psychiatrists, clinical psychologists, and licensed clinical social workers. The transcribed qualitative data was analyzed using thematic analysis. RESULTS Clinicians reported the potential usefulness of the prototype as collateral information regarding collaborative agenda setting, tracking symptoms, and navigating patient verbal reports. They suggested alternative use scenarios such as collaborative use between clinicians, reviewing the prototype prior to consultations, and using the prototype when patients miss their consultations. They also shared their concerns regarding computational insights: they were not sure whether patients’ social media represent their actual behaviors; they wanted to learn how and when the machine learning algorithm can fail to set their expectations of trust; they worried about situations where they cannot properly respond to the insights, especially emergency situations outside of clinical settings. They also pointed out that reviewing computational insights may increase their workload. CONCLUSIONS Our findings support the touted potential of computational mental health insights from patients’ social media data, especially in the context of psychotherapy consultations. However, sociotechnical issues, such as transparent algorithmic information and institutional support, should be addressed in future endeavors to design an implementable and sustainable technology.


Sign in / Sign up

Export Citation Format

Share Document