scholarly journals EFAR-MMLA: An Evaluation Framework to Assess and Report Generalizability of Machine Learning Models in MMLA

Sensors ◽  
2021 ◽  
Vol 21 (8) ◽  
pp. 2863
Author(s):  
Pankaj Chejara ◽  
Luis P. Prieto ◽  
Adolfo Ruiz-Calleja ◽  
María Jesús Rodríguez-Triana ◽  
Shashi Kant Shankar ◽  
...  

Multimodal Learning Analytics (MMLA) researchers are progressively employing machine learning (ML) techniques to develop predictive models to improve learning and teaching practices. These predictive models are often evaluated for their generalizability using methods from the ML domain, which do not take into account MMLA’s educational nature. Furthermore, there is a lack of systematization in model evaluation in MMLA, which is also reflected in the heterogeneous reporting of the evaluation results. To overcome these issues, this paper proposes an evaluation framework to assess and report the generalizability of ML models in MMLA (EFAR-MMLA). To illustrate the usefulness of EFAR-MMLA, we present a case study with two datasets, each with audio and log data collected from a classroom during a collaborative learning session. In this case study, regression models are developed for collaboration quality and its sub-dimensions, and their generalizability is evaluated and reported. The framework helped us to systematically detect and report that the models achieved better performance when evaluated using hold-out or cross-validation but quickly degraded when evaluated across different student groups and learning contexts. The framework helps to open up a “wicked problem” in MMLA research that remains fuzzy (i.e., the generalizability of ML models), which is critical to both accumulating knowledge in the research community and demonstrating the practical relevance of these techniques.

2020 ◽  
Vol 10 (17) ◽  
pp. 5942 ◽  
Author(s):  
Juan de la Torre ◽  
Javier Marin ◽  
Sergio Ilarri ◽  
Jose J. Marin

Given the exponential availability of data in health centers and the massive sensorization that is expected, there is an increasing need to manage and analyze these data in an effective way. For this purpose, data mining (DM) and machine learning (ML) techniques would be helpful. However, due to the specific characteristics of the field of healthcare, a suitable DM and ML methodology adapted to these particularities is required. The applied methodology must structure the different stages needed for data-driven healthcare, from the acquisition of raw data to decision-making by clinicians, considering the specific requirements of this field. In this paper, we focus on a case study of cervical assessment, where the goal is to predict the potential presence of cervical pain in patients affected with whiplash diseases, which is important for example in insurance-related investigations. By analyzing in detail this case study in a real scenario, we show how taking care of those particularities enables the generation of reliable predictive models in the field of healthcare. Using a database of 302 samples, we have generated several predictive models, including logistic regression, support vector machines, k-nearest neighbors, gradient boosting, decision trees, random forest, and neural network algorithms. The results show that it is possible to reliably predict the presence of cervical pain (accuracy, precision, and recall above 90%). We expect that the procedure proposed to apply ML techniques in the field of healthcare will help technologists, researchers, and clinicians to create more objective systems that provide support to objectify the diagnosis, improve test treatment efficacy, and save resources.


Author(s):  
Xiaowei Wang ◽  
YeongAe Heo

Abstract Machine learning (ML) approaches have gained increasing attention in the structural engineering field to evaluate structural performance using structural health monitoring (SHM) data. Supervised ML approaches can accelerate the learning process by using labeled training datasets to map an input to output dataset. But, SHM data are not informative to drive a mapping function to determine the real-world performance of large-scale complex structures in particular for future events. To leverage a framework for evaluating the system-level structural performance, this study couples supervised ML approaches with an advanced finite element (FE) model considering pre- and post-event model validation and updating. A well-instrumented system experiencing multiple seismic events is employed as a case study to demonstrate the proposed framework. An FE model of the instrumented system is created and validated using pre-event SHM datasets. Numerical data obtained from the FE model are used for datasets to develop ML prediction models, which are then validated by a post-event SHM dataset. Eight popular ML algorithms are examined and compared to shed light on the effectiveness of the ML algorithms for the proposed framework. The case study results indicate that the Random Forests and Neural Network algorithms provide better estimation for the structural system. The results also imply the need of post-event updating for numerical models used in the case study.


2021 ◽  
Vol 5 (1) ◽  
pp. 76-85
Author(s):  
Maisha Islam ◽  
Tiffany-Lily Burnett ◽  
Sarah-Louise Collins

This case study describes a staff-student partnership project from the perspective of three staff members based across independent departments within a UK higher education institution (HEI) and its students’ union. The authors, drawing upon an intersecting passion for advancing student equality, diversity, inclusion, widening participation and student engagement, developed a cross-collaborative and student-centred partnership project to create a series of guides specifically for underrepresented student groups. The guides, which sought to provide appropriate information and guidance in order to actively enhance students’ overall experience whilst navigating university life, were developed and co-created through lived student experience. This case study critically reflects upon this form of partnership, along with its benefits and challenges, and considers its contribution to literature on staff-student partnership beyond the formal realm of learning and teaching.


2021 ◽  
Author(s):  
Norberto Sánchez-Cruz ◽  
Jose L. Medina-Franco

<p>Epigenetic targets are a significant focus for drug discovery research, as demonstrated by the eight approved epigenetic drugs for treatment of cancer and the increasing availability of chemogenomic data related to epigenetics. This data represents a large amount of structure-activity relationships that has not been exploited thus far for the development of predictive models to support medicinal chemistry efforts. Herein, we report the first large-scale study of 26318 compounds with a quantitative measure of biological activity for 55 protein targets with epigenetic activity. Through a systematic comparison of machine learning models trained on molecular fingerprints of different design, we built predictive models with high accuracy for the epigenetic target profiling of small molecules. The models were thoroughly validated showing mean precisions up to 0.952 for the epigenetic target prediction task. Our results indicate that the herein reported models have considerable potential to identify small molecules with epigenetic activity. Therefore, our results were implemented as freely accessible and easy-to-use web application.</p>


Sign in / Sign up

Export Citation Format

Share Document