scholarly journals A Framework for using Machine Learning to Support Qualitative Data Coding

2021 ◽  
Author(s):  
Peter Baumgartner ◽  
Amanda Smith ◽  
Murrey Olmsted ◽  
Dawn Ohse

Open-ended survey questions provide qualitative data that are useful for a multitude of reasons. However, qualitative data analysis is labor intensive, and researchers often lack the needed time and resources resulting in underutilization of qualitative data. In attempting to address these issues, we looked to machine learning and recent advances in language models and transfer learning to assist in qualitative coding of responses. We trained a machine learning model following the BERT architecture to predict thematic codes that were then adjudicated by human coders. Results suggest this is a promising approach that can be used to support traditional coding methods and has the potential to alleviate some of the burden associated with qualitative data analysis.

2020 ◽  
Vol 223 (3) ◽  
pp. 437.e1-437.e15
Author(s):  
Joshua Guedalia ◽  
Michal Lipschuetz ◽  
Michal Novoselsky-Persky ◽  
Sarah M. Cohen ◽  
Amihai Rottenstreich ◽  
...  

Machine learning is a prominent tool for getting data from large amounts of information. Whereas a good amount of machine learning analysis has targeted on increasing the accuracy and potency of coaching and reasoning algorithms, there is less attention within the equally vital issues of observing the standard of information fed into the machine learning model. The standard of huge information is far away from good. Recent studies have shown that poor quality will bring serious errors to the result of big data analysis and this could have an effect on in making additional precise results from the information. Advantages of data preprocessing within the context of ML are advanced detection of errors, model-quality improves by the usage of better data, savings in engineering hours to debug issues


2021 ◽  
Vol 12 (2) ◽  
pp. 49-66
Author(s):  
Janmenjoy Nayak ◽  
Bighnaraj Naik ◽  
Pandit Byomakesha Dash ◽  
Danilo Pelusi

Biomedical data is often more unstructured in nature, and biomedical data processing task is becoming more complex day by day. Thus, biomedical informatics requires competent data analysis and data mining techniques for designing decision support system's framework to solve clinical and heathcare-related issues. Due to increasingly large and complex data sets and demand of biomedical informatics research, researchers are attracted towards automated machine learning models. This paper is proposed to design an efficient machine learning model based on fuzzy c-means with meta-heuristic optimizations for biomedical data analysis and clustering. The main contributions of this paper are 1) projecting an efficient machine learning model based on fuzzy c-means and meta-heuristic optimization for biomedical data classification, 2) employing benchmark validation techniques and critical hypothesises testing, and 3) providing a background for biomedical data processing with a view of data processing and mining.


Author(s):  
Gyeonghwa Lee

Handling a large number of diverse types of qualitative data can be challenging for qualitative researchers. Also, there seems to be no clear standard for qualitative data analysis unlike the quantitative form. This is one of the main reasons for difficulties in analysing qualitative data and may cause concerns about the issue of research credibility. Dr Adu’s book may help to solve such worries and challenges by providing a step-by-step qualitative data coding process from setting up the research paradigms to presenting findings systematically, concretely, and comprehensively.


2020 ◽  
Vol 11 (2) ◽  
pp. 124
Author(s):  
Sihono Setyo Budi

<p>This research in aimed to find out if Problem  Based Learning Model can improve the  independence of learning and learning achievment  at class XII IPS<sub>2</sub> in MAN 1 Kulon Progo. The subject of this reasearch the students of MAN 1 Kulon Progo at class XII IPS<sub>2</sub> for 33 students. The datum are  obtained by using observation sheet and quantitative   and  qualitative data analysis. It is shown that the are significant improvemens on cycle I, cycle II , and cycle III. The everage scores of learning achievement are 65,7 on cycle I, 71,6 on cycle II and 73,5 on cycle III. The evarega score of independence of learning are 73,06, on cycle I, 76,51 on cycle II and 88,18 on cycle III.</p><p> </p><p>Keywords : Problem  Based Learning Model, Learning Achievement, independence of learning, <strong></strong></p>


Author(s):  
Susan Zholl

Pat Bazeley’s book advances the procedures of qualitative data coding and analysis. Her work provides a highly accessible framework for both new and seasoned researchers to consider in the analysis of interview, survey, photo and video data collected in qualitative studies. In her work Bazeley acknowledges the use of technology to support data analysis (i.e., NVivo), but also promotes the use of more traditional paper - and - pencil methods. This text would be a great addition to a graduate level course in qualitative research.


10.2196/23943 ◽  
2020 ◽  
Vol 22 (11) ◽  
pp. e23943 ◽  
Author(s):  
Roschelle L Fritz ◽  
Marian Wilson ◽  
Gordana Dermody ◽  
Maureen Schmitter-Edgecombe ◽  
Diane J Cook

Background Poorly managed pain can lead to substance use disorders, depression, suicide, worsening health, and increased use of health services. Most pain assessments occur in clinical settings away from patients’ natural environments. Advances in smart home technology may allow observation of pain in the home setting. Smart homes recognizing human behaviors may be useful for quantifying functional pain interference, thereby creating new ways of assessing pain and supporting people living with pain. Objective This study aimed to determine if a smart home can detect pain-related behaviors to perform automated assessment and support intervention for persons with chronic pain. Methods A multiple methods, secondary data analysis was conducted using historic ambient sensor data and weekly nursing assessment data from 11 independent older adults reporting pain across 1-2 years of smart home monitoring. A qualitative approach was used to interpret sensor-based data of 27 unique pain events to support clinician-guided training of a machine learning model. A periodogram was used to calculate circadian rhythm strength, and a random forest containing 100 trees was employed to train a machine learning model to recognize pain-related behaviors. The model extracted 550 behavioral markers for each sensor-based data segment. These were treated as both a binary classification problem (event, control) and a regression problem. Results We found 13 clinically relevant behaviors, revealing 6 pain-related behavioral qualitative themes. Quantitative results were classified using a clinician-guided random forest technique that yielded a classification accuracy of 0.70, sensitivity of 0.72, specificity of 0.69, area under the receiver operating characteristic curve of 0.756, and area under the precision-recall curve of 0.777 in comparison to using standard anomaly detection techniques without clinician guidance (0.16 accuracy achieved; P<.001). The regression formulation achieved moderate correlation, with r=0.42. Conclusions Findings of this secondary data analysis reveal that a pain-assessing smart home may recognize pain-related behaviors. Utilizing clinicians’ real-world knowledge when developing pain-assessing machine learning models improves the model’s performance. A larger study focusing on pain-related behaviors is warranted to improve and test model performance.


Sign in / Sign up

Export Citation Format

Share Document