CiDAEN: An Online Data Science Course

Author(s):  
Luis de la Ossa ◽  
Pablo Bermejo ◽  
Juan A. Aledo ◽  
José A. Gámez ◽  
José M. Puerta ◽  
...  
Keyword(s):  
JAMIA Open ◽  
2018 ◽  
Vol 1 (2) ◽  
pp. 159-165
Author(s):  
Robert Hoyt ◽  
Victoria Wangia-Anderson

Abstract Objective To discuss and illustrate the utility of two open collaborative data science platforms, and how they would benefit data science and informatics education. Methods and Materials The features of two online data science platforms are outlined. Both are useful for new data projects and both are integrated with common programming languages used for data analysis. One platform focuses more on data exploration and the other focuses on containerizing, visualization, and sharing code repositories. Results Both data science platforms are open, free, and allow for collaboration. Both are capable of visual, descriptive, and predictive analytics Discussion Data science education benefits by having affordable open and collaborative platforms to conduct a variety of data analyses. Conclusion Open collaborative data science platforms are particularly useful for teaching data science skills to clinical and nonclinical informatics students. Commercial data science platforms exist but are cost-prohibitive and generally limited to specific programming languages.


2018 ◽  
Author(s):  
Jen Schradie

With a growing interest in data science and online analytics, researchers are increasingly using data derived from the Internet. Whether for qualitative or quantitative analysis, online data, including “Big Data,” can often exclude marginalized populations, especially those from the poor and working class, as the digital divide remains a persistent problem. This methodological commentary on the current state of digital data and methods disentangles the hype from the reality of digitally produced data for sociological research. In the process, it offers strategies to address the weaknesses of data that is derived from the Internet in order to represent marginalized populations.


2019 ◽  
Vol 112 (6) ◽  
pp. 473-476 ◽  
Author(s):  
Gemma F. Mojica ◽  
Christina N. Azmy ◽  
Hollylynne S. Lee

Concord Consortium's Common Online Data Analysis Platform (CODAP), a free Web-based data tool designed for students in grades 6-12 and higher, is continuously being updated and developed for diverse projects in data science, science education, and mathematics/statistics education (https://codap.concord.org/). Teachers and students can access CODAP without downloading software or registering for accounts. Although some Web-based technology tools provide certain features for free and require users to pay a fee to use additional features, CODAP has no hidden costs. Devices need only be connected to the Internet using an updated Web browser (Chrome is preferred). CODAP is not optimized (yet) for use on such touchscreen devices as tablets or iPads®.


Author(s):  
Sean Kross ◽  
Roger D Peng ◽  
Brian S Caffo ◽  
Ira Gooding ◽  
Jeffrey T Leek

Over the last three decades data has become ubiquitous and cheap. This transition has accelerated over the last five years and training in statistics, machine learning, and data analysis have struggled to keep up. In April 2014 we launched a program of nine courses, the Johns Hopkins Data Science Specialization, which has now had more than 4 million enrollments over the past three years. Here the program is described and compared to both standard and more recently developed data science curricula. We show that novel pedagogical and administrative decisions introduced in our program are now standard in online data science programs. The impact of the Data Science Specialization on data science education in the US is also discussed. Finally we conclude with some thoughts about the future of data science education in a data democratized world.


Author(s):  
Sean Kross ◽  
Roger D Peng ◽  
Brian S Caffo ◽  
Ira Gooding ◽  
Jeffrey T Leek

Over the last three decades data has become ubiquitous and cheap. This transition has accelerated over the last five years and training in statistics, machine learning, and data analysis have struggled to keep up. In April 2014 we launched a program of nine courses, the Johns Hopkins Data Science Specialization, which has now had more than 4 million enrollments over the past three years. Here the program is described and compared to both standard and more recently developed data science curricula. We show that novel pedagogical and administrative decisions introduced in our program are now standard in online data science programs. The impact of the Data Science Specialization on data science education in the US is also discussed. Finally we conclude with some thoughts about the future of data science education in a data democratized world.


2021 ◽  
Vol 0 (0) ◽  
pp. 0
Author(s):  
Lovisa Sundin ◽  
Nourhan Sakr ◽  
Juho Leinonen ◽  
Quintin Cutts

<p style='text-indent:20px;'>With the rising demand for data science skills, the ability to wrangle data programmatically becomes a crucial barrier. In this paper, we discuss the centrality of API (application programming interface) lookup to data wrangling, and how an ontology-structured command menu could facilitate it. We design thumbnail graphics as visual alternatives to explaining data wrangling operations and use a survey to validate their quality. We furthermore predict that thumbnail graphics make the menu more navigable, improving lookup efficiency and performance. Our predictions are tested using Slice N Dice, an online data wrangling tutorial platform that collects learner activity. It includes both non-programmatic and programmatic data wrangling exercises. Participants from a multi-institutional sample (<i>n</i> = 200) were randomly assigned the tutorial either with or without thumbnail graphics. Our results show that thumbnail graphics reduce the need for clarifications, thereby assisting API lookup for novices learning data wrangling. We further present some negative results regarding performance gain and follow up with a discussion on why the differences are subtle and how they can be improved. Last but not least, we complement our statistical results with a qualitative study where we receive positive feedback from our participants on the design and helpfulness of the thumbnail graphics.</p>


Author(s):  
Charles Bouveyron ◽  
Gilles Celeux ◽  
T. Brendan Murphy ◽  
Adrian E. Raftery

2020 ◽  
Vol 41 (1) ◽  
pp. 30-36
Author(s):  
Steven V. Rouse

Abstract. Previous research has supported the use of Amazon’s Mechanical Turk (MTurk) for online data collection in individual differences research. Although MTurk Masters have reached an elite status because of strong approval ratings on previous tasks (and therefore gain higher payment for their work) no research has empirically examined whether researchers actually obtain higher quality data when they require that their MTurk Workers have Master status. In two different online survey studies (one using a personality test and one using a cognitive abilities test), the psychometric reliability of MTurk data was compared between a sample that required a Master qualification type and a sample that placed no status-level qualification requirement. In both studies, the Master samples failed to outperform the standard samples.


Sign in / Sign up

Export Citation Format

Share Document