Characteristic Latent Features for Analyzing Digital Mental Health Interaction and Improved Explainability

Abstract Background: Using smartphones and wearable sensor technology has sparked a broad engagement of data science and machine learning methods to leverage the complex, assorted amount of data. Despite verified processes, there is a reported underdevelopment of user engagement concepts, and the desire for high accuracy or significance has shown to lead to low explicability and irreproducibility. To overcome these issues, we aim to analyze principal characteristics of everyday behavior in digital mental health. Methods: We generated five latent features based on previous research, expert opinions from digital mental health, and informed by data. The features were analyzed with descriptive statistics and data visualization. We carried out two rounds of evaluations with data from 12,400 users of IntelliCare, a mental health platform with 12 apps. First, we focused to proof concept and second, we assessed reproducibility by drawing conclusion from distribution differences. User data was drawn from both research trials and public deployment on Google Play. Results: Our algorithms showed increased rationale for the basic usage of apps with different underlying behavioral strategies. Measures of the distribution of user’s allocated attention, the user’s circadian behavior, their consecutive commitment to a specific strategy, and users’ interaction trajectory are perceived as transferable to the public data set. Because distributions between research trial and public deployment were similar, consistency was shown regarding the underlying behavioral strategies: psychoeducation and goal setting are used as a catalyst to overcome the users’ primary obstacles, sleep hygiene is addressed most regularly, while regular self-reflective thinking is avoided. Relaxation as well as cognitive reframing have increased variance in commitment among public users, indicating the challenging nature of these apps. The relative course of the engagement (learning curve) is similar in research and public data. Conclusions: The deliberate, a-priori engineered features were reproducible across app users from both data sets. These features led to improved results as well as increased interpretability, providing an increased understanding of how people engage with multiple mental health apps over time. Since we based the generation of features on generic interaction proxies, these methods are applicable to other cases in artificial intelligence and digital health.

Download Full-text

Characteristic Latent Features for Analyzing Digital Mental Health Interaction and Improved Explainability

10.21203/rs.3.rs-68076/v1 ◽

2020 ◽

Author(s):

Max-Marcel Theilig ◽

Ashley Arehart Knapp ◽

Jennifer Nicholas ◽

Rüdiger Zarnekow ◽

David Curtis Mohr

Keyword(s):

Mental Health ◽

Data Science ◽

Digital Health ◽

A Priori ◽

Sensor Technology ◽

User Engagement ◽

Behavioral Strategies ◽

Data Set ◽

Public Data ◽

Latent Features

Abstract Background: Using smartphones and wearable sensor technology has sparked a broad engagement of data science and machine learning methods to leverage the complex, assorted amount of data. Despite verified processes, there is a reported underdevelopment of user engagement concepts, and the desire for high accuracy or significance has shown to lead to low explicability and irreproducibility. To overcome these issues, we aim to analyze principal characteristics of everyday behavior in digital mental health. Methods: We generated five latent features based on previous research, expert opinions from digital mental health, and informed by data. The features were analyzed with descriptive statistics and data visualization. We carried out two rounds of evaluations with data from 12,400 users of IntelliCare, a mental health platform with 12 apps. First, we focused to proof concept and second, we assessed reproducibility by drawing conclusion from distribution differences. User data was drawn from both research trials and public deployment on Google Play. Results: Our algorithms showed increased rationale for the basic usage of apps with different underlying behavioral strategies. Measures of the distribution of user’s allocated attention, the user’s circadian behavior, their consecutive commitment to a specific strategy, and users’ interaction trajectory curve are perceived as transferable to the public data set. Because distributions between research trial and public deployment were similar, consistency was shown regarding the underlying behavioral strategies: psychoeducation and goal setting are used as a catalyst to overcome the users’ primary obstacles, sleep hygiene is addressed most regularly, while regular emotional exposure is avoided. Relaxation as well as cognitive reframing have increased variance in commitment among public users, indicating the challenging nature of these apps. The relative course of the engagement (learning curve) is similar in research and public data. Conclusions: The deliberate, a-priori engineered features were reproducible across app users from both data sets. These features led to improved results as well as increased interpretability, providing an increased understanding of how people engage with multiple mental health apps over time. Since we based the generation of features on generic interaction proxies, these methods are applicable to other cases in artificial intelligence and digital health.

Download Full-text

Characteristic Latent Features for Analyzing Digital Mental Health Interaction and Improved Explainability (Preprint)

10.2196/preprints.28014 ◽

2021 ◽

Author(s):

Max-Marcel Theilig ◽

Ashley A Knapp ◽

Jennifer M Nicholas ◽

Rüdiger Zarnekow ◽

David C Mohr

Keyword(s):

Mental Health ◽

Data Science ◽

Digital Health ◽

A Priori ◽

User Interaction ◽

User Engagement ◽

Behavioral Strategies ◽

Data Set ◽

Public Data ◽

Latent Features

BACKGROUND Using mobile health technology has sparked a broad engagement of data science and machine learning methods to leverage the complex, assorted amount of data for mental health purposes. Despite many studies, there is a reported underdevelopment of user engagement concepts, and the desire for high accuracy or significance has shown to lead to low explicability and irreproducibility. OBJECTIVE To overcome such reasons of poor analysis input and facilitate the reproducibility and credibility of artificial intelligence applications, we aim to explore principal characteristics of user interaction with digital mental health. METHODS We generated five latent features based on previous research, expert opinions from digital mental health, and informed by data. The features were analyzed with descriptive statistics and data visualization. We carried out two rounds of evaluations with data from 12,400 users of IntelliCare, a mental health platform with 12 apps. First, we focused to proof concept and second, we assessed reproducibility by drawing conclusion from distribution differences. User data was drawn from both research trials and public deployment on Google Play. RESULTS Our algorithms showed advantages over commonly used concepts and reproduce in our public data set with different underlying behavioral strategies. These measures relate to the distribution of a user’s allocated attention, users’ circadian behavior, their consecutive commitment to a specific strategy, and users’ interaction trajectory. Because distributions between research trial and public deployment were similar, consistency was implied regarding the underlying behavioral strategies: psychoeducation and goal setting are used as a catalyst to overcome the users’ primary obstacles, sleep hygiene is addressed most regularly, while regular self-reflective thinking is avoided. Relaxation as well as cognitive reframing have increased variance in commitment among public users, indicating the challenging nature of these apps. The relative course of users’ engagement is similar in research and public data. CONCLUSIONS We argue that deliberate, a-priori feature engineering is essential for reproducible, tangible, and explainable study analyses. Our features enable improved results as well as interpretability, providing an increased understanding of how people engage with multiple mental health apps over time. Since we based the generation of features on generic interaction, these methods are applicable to further methods of study analysis and digital health.

Download Full-text

A public data set of spatio-temporal match events in soccer competitions

Scientific Data ◽

10.1038/s41597-019-0247-7 ◽

2019 ◽

Vol 6 (1) ◽

Cited By ~ 12

Author(s):

Luca Pappalardo ◽

Paolo Cintia ◽

Alessio Rossi ◽

Emanuele Massucco ◽

Paolo Ferragina ◽

...

Keyword(s):

Data Science ◽

Social Systems ◽

Team Sports ◽

Data Set ◽

Evaluation Of Performance ◽

Public Data ◽

Wide Range ◽

Spatio Temporal ◽

Measurement And Evaluation ◽

Temporal Events

Abstract Soccer analytics is attracting increasing interest in academia and industry, thanks to the availability of sensing technologies that provide high-fidelity data streams for every match. Unfortunately, these detailed data are owned by specialized companies and hence are rarely publicly available for scientific research. To fill this gap, this paper describes the largest open collection of soccer-logs ever released, containing all the spatio-temporal events (passes, shots, fouls, etc.) that occured during each match for an entire season of seven prominent soccer competitions. Each match event contains information about its position, time, outcome, player and characteristics. The nature of team sports like soccer, halfway between the abstraction of a game and the reality of complex social systems, combined with the unique size and composition of this dataset, provide an ideal ground for tackling a wide range of data science problems, including the measurement and evaluation of performance, both at individual and at collective level, and the determinants of success and failure.

Download Full-text

Measurement in an on-demand virtual mental health system: a review and viewpoint (Preprint)

10.2196/preprints.24895 ◽

2020 ◽

Author(s):

Sarah Kunkle ◽

John A Naslund ◽

Justin Hunt ◽

Dana Udall ◽

Erica Hayes ◽

...

Keyword(s):

Mental Health ◽

Data Science ◽

New Technologies ◽

Digital Health ◽

Clinical Care ◽

The United States ◽

On Demand ◽

Measurement Based Care ◽

Multidisciplinary Perspective ◽

Development And Management

UNSTRUCTURED Mental health is a growing public health priority in the United States and globally. Measurement-based care (MBC) has been shown to improve outcomes in clinical care, yet there are significant challenges to implementation. New technologies offer opportunities to address these obstacles and to measure new and existing constructs at a scale that was previously not possible. This paper aims to summarize existing literature on MBC, focusing on mental health and digital health. Specifically, we describe a case example called Ginger, a novel on-demand virtual system for delivering mental health services, and demonstrate how this platform aligns with core principles of MBC in mental health based on existing frameworks and findings from the literature. Additionally, we integrate feedback from multidisciplinary stakeholders (clinical practitioners, data science, product development and management, and research) to develop a perspective on key tenets of measurement in this system. To navigate the challenges of translating traditional measurement tools into new technologies in addition to developing new measurements, this multidisciplinary perspective is required. Ultimately, this will enhance our understanding of mental health and ability to develop interventions to improve outcomes.

Download Full-text

A Convenient and Low-Cost Model of Depression Screening and Early Warning Based on Voice Data Using for Public Mental Health

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph18126441 ◽

2021 ◽

Vol 18 (12) ◽

pp. 6441

Author(s):

Xin Chen ◽

Zhigeng Pan

Keyword(s):

Mental Health ◽

Early Warning ◽

Cost Model ◽

Low Cost ◽

Ethical Review ◽

Algorithm Selection ◽

Data Set ◽

Public Data ◽

Voice Data ◽

The Voice

Depression is a common mental health disease, which has great harm to public health. At present, the diagnosis of depression mainly depends on the interviews between doctors and patients, which is subjective, slow and expensive. Voice data are a kind of data that are easy to obtain and have the advantage of low cost. It has been proved that it can be used in the diagnosis of depression. The voice data used for modeling in this study adopted the authoritative public data set, which had passed the ethical review. The features of voice data were extracted by Python programming, and the voice features were stored in the format of CSV files. Through data processing, a big database, containing 1479 voice feature samples, was generated for modeling. Then, the decision tree screening model of depression was established by 10-fold cross validation and algorithm selection. The experiment achieved 83.4% prediction accuracy on voice data set. According to the prediction results of the model, the patients can be given early warning and intervention in time, so as to realize the health management of personal depression.

Download Full-text

Plant PhysioSpace: a robust tool to compare stress response across plant species

10.1101/2020.11.16.384305 ◽

2020 ◽

Author(s):

Ali Hadizadeh Esfahani ◽

Janina Maß ◽

Asis Hallab ◽

Bernhard M. Schuldt ◽

David Nevarez ◽

...

Keyword(s):

Stress Response ◽

A Priori ◽

Plant Stress ◽

Marker Genes ◽

Specific Gene ◽

Data Sets ◽

Data Set ◽

Plant Stress Response ◽

Gene Sets ◽

Public Data

AbstractGeneralization of transcriptomics results can be achieved by comparison across experiments, which is based on integration of interrelated transcriptomics studies into a compendium. Both characterization of the fate of the organism under study as well as distinguishing between generic and specific responses can be gained in such a broader context. We have built such a compendium for plant stress response, which is based on integrating publicly available data sets for plant stress response to generalize results across studies and extract the most robust and meaningful information possible from them.There are numerous methods and tools to analyze such data sets, most focusing on gene-wise dimension reduction of data to obtain marker genes and gene sets, e.g. for pathway analysis. Relying only on isolated biological modules might lead to missing of important confounders and relevant context. Therefore, we have chosen a different approach: Our novel tool, which we called Plant PhysioSpace, provides the ability to compute experimental conditions across species and platforms without a priori reducing the reference information to specific gene-sets. It extracts physiologically relevant signatures from a reference data set, a collection of public data sets, by integrating and transforming heterogeneous reference gene expression data into a set of physiology-specific patterns, called PhysioSpace. New experimental data can be mapped to these PhysioSpaces, resulting in similarity scores, providing quantitative similarity of the new experiment to an a priori compendium.Here we report the implementation of two R packages, one software and one data package, and a shiny web application, which provides plant biologists convenient ways to access the method and a precomputed compendium of more than 900 PhysioSpace basis vectors from 4 different species (Arabidopsis thaliana, Oryza sativa, Glycine max, and Triticum aestivum).The tool reduces the dimensionality of data sample-wise (and not gene-wise), which results in a vector containing all genes. This method is very robust against noise and change of platform while still being sensitive. Plant PhysioSpace can therefore be used as an inter-species or cross-platform similarity measure. We demonstrate that Plant PhysioSpace can successfully translate stress responses between different species and platforms (including single cell technologies).

Download Full-text

Prevalence of Self-Reported Depression Symptoms and Perceived Anxiety Among Community-Dwelling U.S. Adults Reporting Tinnitus

Perspectives of the ASHA Special Interest Groups ◽

10.1044/2020_persp-19-00178 ◽

2020 ◽

Vol 5 (4) ◽

pp. 959-970

Author(s):

Kelly M. Reavis ◽

James A. Henry ◽

Lynn M. Marshall ◽

Kathleen F. Carlson

Keyword(s):

Mental Health ◽

Fold Increase ◽

Community Dwelling ◽

Self Report ◽

Depression Symptoms ◽

Community Based Interventions ◽

Data Set ◽

Health Distress ◽

Validated Questionnaire ◽

Mental Health Distress

Purpose The aim of this study was to examine the relationship between tinnitus and self-reported mental health distress, namely, depression symptoms and perceived anxiety, in adults who participated in the National Health and Nutrition Examinations Survey between 2009 and 2012. A secondary aim was to determine if a history of serving in the military modified the associations between tinnitus and mental health distress. Method This was a cross-sectional study design of a national data set that included 5,550 U.S. community-dwelling adults ages 20 years and older, 12.7% of whom were military Veterans. Bivariable and multivariable logistic regression was used to estimate the association between tinnitus and mental health distress. All measures were based on self-report. Tinnitus and perceived anxiety were each assessed using a single question. Depression symptoms were assessed using the Patient Health Questionnaire, a validated questionnaire. Multivariable regression models were adjusted for key demographic and health factors, including self-reported hearing ability. Results Prevalence of tinnitus was 15%. Compared to adults without tinnitus, adults with tinnitus had a 1.8-fold increase in depression symptoms and a 1.5-fold increase in perceived anxiety after adjusting for potential confounders. Military Veteran status did not modify these observed associations. Conclusions Findings revealed an association between tinnitus and both depression symptoms and perceived anxiety, independent of potential confounders, among both Veterans and non-Veterans. These results suggest, on a population level, that individuals with tinnitus have a greater burden of perceived mental health distress and may benefit from interdisciplinary health care, self-help, and community-based interventions. Supplemental Material https://doi.org/10.23641/asha.12568475

Download Full-text

Facts for Policymakers: Complex Trauma and Mental Health of Children Placed in Foster Care: Highlights from the National Center for Child Traumatic Stress (NCCTS) Core Data Set

PsycEXTRA Dataset ◽

10.1037/e617062012-001 ◽

2011 ◽

Keyword(s):

Mental Health ◽

Foster Care ◽

Traumatic Stress ◽

Complex Trauma ◽

Data Set ◽

Core Data ◽

Child Traumatic Stress

Download Full-text

Implementation Strategy for a Digital Health Tool Influences User Engagement

Diabetes ◽

10.2337/db18-1320-p ◽

2018 ◽

Vol 67 (Supplement 1) ◽

pp. 1320-P

Author(s):

MANSUR SHOMALI ◽

MALINDA PEEPLES

Keyword(s):

Implementation Strategy ◽

Digital Health ◽

User Engagement

Download Full-text

Comparative Analysis of Machine Learning Techniques Using Predictive Modeling

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813999200904164539 ◽

2020 ◽

Vol 13 ◽

Author(s):

Ritu Khandelwal ◽

Hemlata Goyal ◽

Rajveer Singh Shekhawat

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Data Science ◽

Training Data ◽

Machine Learning Techniques ◽

Future Trends ◽

Data Set ◽

Learning Stage ◽

Learning Techniques ◽

Different Types

Introduction: Machine learning is an intelligent technology that works as a bridge between businesses and data science. With the involvement of data science, the business goal focuses on findings to get valuable insights on available data. The large part of Indian Cinema is Bollywood which is a multi-million dollar industry. This paper attempts to predict whether the upcoming Bollywood Movie would be Blockbuster, Superhit, Hit, Average or Flop. For this Machine Learning techniques (classification and prediction) will be applied. To make classifier or prediction model first step is the learning stage in which we need to give the training data set to train the model by applying some technique or algorithm and after that different rules are generated which helps to make a model and predict future trends in different types of organizations. Methods: All the techniques related to classification and Prediction such as Support Vector Machine(SVM), Random Forest, Decision Tree, Naïve Bayes, Logistic Regression, Adaboost, and KNN will be applied and try to find out efficient and effective results. All these functionalities can be applied with GUI Based workflows available with various categories such as data, Visualize, Model, and Evaluate. Result: To make classifier or prediction model first step is learning stage in which we need to give the training data set to train the model by applying some technique or algorithm and after that different rules are generated which helps to make a model and predict future trends in different types of organizations Conclusion: This paper focuses on Comparative Analysis that would be performed based on different parameters such as Accuracy, Confusion Matrix to identify the best possible model for predicting the movie Success. By using Advertisement Propaganda, they can plan for the best time to release the movie according to the predicted success rate to gain higher benefits. Discussion: Data Mining is the process of discovering different patterns from large data sets and from that various relationships are also discovered to solve various problems that come in business and helps to predict the forthcoming trends. This Prediction can help Production Houses for Advertisement Propaganda and also they can plan their costs and by assuring these factors they can make the movie more profitable.

Download Full-text