Data science education through education data: an end-to-end perspective

Author(s):  
A. Ravishankar Rao ◽  
Yashvi Desai ◽  
Kavita Mishra
2020 ◽  
Vol 30 (Supplement_5) ◽  
Author(s):  
J Doetsch ◽  
I Lopes ◽  
R Redinha ◽  
H Barros

Abstract The usage and exchange of “big data” is at the forefront of the data science agenda where Record Linkage plays a prominent role in biomedical research. In an era of ubiquitous data exchange and big data, Record Linkage is almost inevitable, but raises ethical and legal problems, namely personal data and privacy protection. Record Linkage refers to the general merging of data information to consolidate facts about an individual or an event that are not available in a separate record. This article provides an overview of ethical challenges and research opportunities in linking routine data on health and education with cohort data from very preterm (VPT) infants in Portugal. Portuguese, European and International law has been reviewed on data processing, protection and privacy. A three-stage analysis was carried out: i) interplay of threefold law-levelling for Record Linkage at different levels; ii) impact of data protection and privacy rights for data processing, iii) data linkage process' challenges and opportunities for research. A framework to discuss the process and its implications for data protection and privacy was created. The GDPR functions as utmost substantial legal basis for the protection of personal data in Record Linkage, and explicit written consent is considered the appropriate basis for the processing sensitive data. In Portugal, retrospective access to routine data is permitted if anonymised; for health data if it meets data processing requirements declared with an explicit consent; for education data if the data processing rules are complied. Routine health and education data can be linked to cohort data if rights of the data subject and requirements and duties of processors and controllers are respected. A strong ethical context through the application of the GDPR in all phases of research need to be established to achieve Record Linkage between cohort and routine collected records for health and education data of VPT infants in Portugal. Key messages GDPR is the most important legal framework for the protection of personal data, however, its uniform approach granting freedom to its Member states hampers Record Linkage processes among EU countries. The question remains whether the gap between data protection and privacy is adequately balanced at three legal levels to guarantee freedom for research and the improvement of health of data subjects.


Author(s):  
Katherine Leu

Postsecondary education is awash in data. Postsecondary institutions track data on students’ demographics, academic performance, course-taking, and financial aid, and have put these data to use, applying data analytics and data science to issues in college completion. Meanwhile, an extensive amount of higher education data are being collected outside of institutions, opening possibilities for data linkages. Newer sources of postsecondary education data could provide an even richer view of student success and improve equity. To explore this potential, this brief describes existing applications of analytics to student success, presents a framework to structure understanding of postsecondary data topics, suggests potential extensions of these data to student success, and describes practical and ethical challenges.


2020 ◽  
Vol 9 (2) ◽  
pp. 25-36
Author(s):  
Necmi Gürsakal ◽  
Ecem Ozkan ◽  
Fırat Melih Yılmaz ◽  
Deniz Oktay

The interest in data science is increasing in recent years. Data science, including mathematics, statistics, big data, machine learning, and deep learning, can be considered as the intersection of statistics, mathematics and computer science. Although the debate continues about the core area of data science, the subject is a huge hit. Universities have a high demand for data science. They are trying to live up to this demand by opening postgraduate and doctoral programs. Since the subject is a new field, there are significant differences between the programs given by universities in data science. Besides, since the subject is close to statistics, most of the time, data science programs are opened in the statistics departments, and this also causes differences between the programs. In this article, we will summarize the data science education developments in the world and in Turkey specifically and how data science education should be at the graduate level.


JAMIA Open ◽  
2018 ◽  
Vol 1 (2) ◽  
pp. 159-165
Author(s):  
Robert Hoyt ◽  
Victoria Wangia-Anderson

Abstract Objective To discuss and illustrate the utility of two open collaborative data science platforms, and how they would benefit data science and informatics education. Methods and Materials The features of two online data science platforms are outlined. Both are useful for new data projects and both are integrated with common programming languages used for data analysis. One platform focuses more on data exploration and the other focuses on containerizing, visualization, and sharing code repositories. Results Both data science platforms are open, free, and allow for collaboration. Both are capable of visual, descriptive, and predictive analytics Discussion Data science education benefits by having affordable open and collaborative platforms to conduct a variety of data analyses. Conclusion Open collaborative data science platforms are particularly useful for teaching data science skills to clinical and nonclinical informatics students. Commercial data science platforms exist but are cost-prohibitive and generally limited to specific programming languages.


Sign in / Sign up

Export Citation Format

Share Document