Defining Data Science by a Data-Driven Quantification of the Community

Frank Emmert-Streib; Matthias Dehmer

doi:10.3390/make1010015

Defining Data Science by a Data-Driven Quantification of the Community

Machine Learning and Knowledge Extraction ◽

10.3390/make1010015 ◽

2018 ◽

Vol 1 (1) ◽

pp. 235-251 ◽

Cited By ~ 15

Author(s):

Frank Emmert-Streib ◽

Matthias Dehmer

Keyword(s):

Quantitative Analysis ◽

Data Science ◽

Data Driven ◽

Statistical Regression ◽

Important Research ◽

Academic Field ◽

Research Fields ◽

Science Community ◽

Fully Automatic ◽

Using Data

Data science is a new academic field that has received much attention in recent years. One reason for this is that our increasingly digitalized society generates more and more data in all areas of our lives and science and we are desperately seeking for solutions to deal with this problem. In this paper, we investigate the academic roots of data science. We are using data of scientists and their citations from Google Scholar, who have an interest in data science, to perform a quantitative analysis of the data science community. Furthermore, for decomposing the data science community into its major defining factors corresponding to the most important research fields, we introduce a statistical regression model that is fully automatic and robust with respect to a subsampling of the data. This statistical model allows us to define the ‘importance’ of a field as its predictive abilities. Overall, our method provides an objective answer to the question ‘What is data science?’.

Download Full-text

Data-Driven Management and Interoperable Metrics for Special Collections and Archives User Services

RBM A Journal of Rare Books Manuscripts and Cultural Heritage ◽

10.5860/rbm.13.2.379 ◽

2012 ◽

Vol 13 (2) ◽

pp. 129-151 ◽

Cited By ~ 3

Author(s):

Joyce Chapman ◽

Elizabeth Yakel

Keyword(s):

Decision Making ◽

Quantitative Analysis ◽

Data Driven ◽

Significant Challenge ◽

Special Collections ◽

Quantitative Metrics ◽

Day To Day Operations ◽

Using Data ◽

Institutional Boundaries ◽

Operational Data

While special collections and archives managers have at times recognized the importance of using data to drive decision making, translating this objective into reality and integrating data analysis into day-to-day operations has proven to be a significant challenge. There have also been obstacles to formulating quantitative metrics for special collections and archives and rendering them interoperable across institutional boundaries. This article attempts to focus a conversation around two issues: 1) the importance of quantitative analysis of operational data for improving research services in special collections and archives; and 2) the need for the profession to achieve consensus on definitions for . . .

Download Full-text

Four Generations in Data Engineering for Data Science

Datenbank-Spektrum ◽

10.1007/s13222-021-00399-3 ◽

2021 ◽

Author(s):

Meike Klettke ◽

Uta Störl

Keyword(s):

Data Science ◽

Data Curation ◽

Data Driven ◽

Environmental Sciences ◽

Scientific Methods ◽

Domain Experts ◽

The Past ◽

Research Fields ◽

Data Engineering ◽

The Moment

AbstractData-driven methods and data science are important scientific methods in many research fields. All data science approaches require professional data engineering components. At the moment, computer science experts are needed for solving these data engineering tasks. Simultaneously, scientists from many fields (like natural sciences, medicine, environmental sciences, and engineering) want to analyse their data autonomously. The arising task for data engineering is the development of tools that can support an automated data curation and are utilisable for domain experts. In this article, we will introduce four generations of data engineering approaches classifying the data engineering technologies of the past and presence. We will show which data engineering tools are needed for the scientific landscape of the next decade.

Download Full-text

Using Data Expedition as a Formative Assessment Tool in Data Science Education: Reasoning, Justification, and Evaluation

International Journal of Emerging Technologies in Learning (iJET) ◽

10.3991/ijet.v14i11.10202 ◽

2019 ◽

Vol 14 (11) ◽

pp. 107

Author(s):

Olga Maksimenkova ◽

Alexey Neznanov ◽

Irina Radchenko

Keyword(s):

Science Education ◽

Digital Media ◽

Data Science ◽

Evaluation Method ◽

Assessment Tool ◽

Structural Features ◽

Data Driven ◽

Distinctive Features ◽

Accurate Evaluation ◽

Using Data

The paper addresses the questions of data science education of current im-portance. It aims to introduce and justify the framework that allows flexibly evaluate the processes of a data expedition and a digital media created during it. For these purposes, the authors explore features of digital media artefacts which are specific to data expeditions and are essential to accurate evaluation. The ru-brics as a power but hardly formalizable evaluation method in application to digi-tal media artefacts are also discussed. Moreover, the paper documents the experi-ence of rubrics creation according to the suggested framework. The rubrics were successfully adopted to two data-driven journalism courses. The authors also formulate recommendations on data expedition evaluation which should take into consideration structural features of a data expedition, distinctive features of digital media, etc.

Download Full-text

Development of a Pediatric Early Warning System Using Data-Driven Vital Signs

PEDIATRICS ◽

10.1542/peds.137.supplement_3.256a ◽

2016 ◽

Vol 137 (Supplement 3) ◽

pp. 256A-256A

Author(s):

Catherine Ross ◽

Iliana Harrysson ◽

Lynda Knight ◽

Veena Goel ◽

Sarah Poole ◽

...

Keyword(s):

Early Warning ◽

Early Warning System ◽

Vital Signs ◽

Warning System ◽

Data Driven ◽

Using Data

Download Full-text

A Coordinated Tracking Control of Multi-Agent Systems Using Data-Driven Methods

2018 37th Chinese Control Conference (CCC) ◽

10.23919/chicc.2018.8483722 ◽

2018 ◽

Author(s):

You Wu ◽

Guo-Ping Liu

Keyword(s):

Tracking Control ◽

Data Driven ◽

Multi Agent Systems ◽

Agent Systems ◽

Multi Agent ◽

Using Data ◽

Coordinated Tracking

Download Full-text

ACCURATELY ESTIMATING SHEAR SLOWNESS USING DATA-DRIVEN QUADRUPOLE SONIC LOGGING-WHILE-DRILLING DATA PROCESSING

10.30632/t60als-2019_u ◽

2019 ◽

Author(s):

Ruijia Wang ◽

Richard Coates

Keyword(s):

Data Processing ◽

Data Driven ◽

Drilling Data ◽

Logging While Drilling ◽

Using Data ◽

Sonic Logging

Download Full-text

Review for "Adaptive frequency control support of a DFIG based on second-order derivative controller using data-driven method"

10.1002/2050-7038.12424/v2/review1 ◽

2020 ◽

Keyword(s):

Frequency Control ◽

Second Order ◽

Data Driven ◽

Order Derivative ◽

Using Data

Download Full-text

Decision letter for "Adaptive frequency control support of a DFIG based on second-order derivative controller using data-driven method"

10.1002/2050-7038.12424/v2/decision1 ◽

2020 ◽

Keyword(s):

Frequency Control ◽

Second Order ◽

Data Driven ◽

Order Derivative ◽

Using Data

Download Full-text

Enhanced Resilient State Estimation Using Data-Driven Auxiliary Models

IEEE Transactions on Industrial Informatics ◽

10.1109/tii.2019.2924246 ◽

2020 ◽

Vol 16 (1) ◽

pp. 639-647 ◽

Cited By ~ 6

Author(s):

Olugbenga Moses Anubi ◽

Charalambos Konstantinou

Keyword(s):

State Estimation ◽

Data Driven ◽

Using Data

Download Full-text

Lifestyle carbon footprints and changes in lifestyles to limit global warming to 1.5 °C, and ways forward for related research

Sustainability Science ◽

10.1007/s11625-021-01018-6 ◽

2021 ◽

Author(s):

Ryu Koide ◽

Michael Lettenmeier ◽

Lewis Akenji ◽

Viivi Toivio ◽

Aryanie Amellina ◽

...

Keyword(s):

Global Warming ◽

Quantitative Analysis ◽

Sustainability Science ◽

Scenario Development ◽

Carbon Footprints ◽

Related Research ◽

Living Lab ◽

Per Capita ◽

Using Data ◽

Subnational Analysis

AbstractThis paper presents an approach for assessing lifestyle carbon footprints and lifestyle change options aimed at achieving the 1.5 °C climate goal and facilitating the transition to decarbonized lifestyles through stakeholder participatory research. Using data on Finland and Japan it shows potential impacts of reducing carbon footprints through changes in lifestyles for around 30 options covering food, housing, and mobility domains, in comparison with the 2030 and 2050 per-capita targets (2.5–3.2 tCO2e by 2030; 0.7–1.4 tCO2e by 2050). It discusses research opportunities for expanding the footprint-based quantitative analysis to incorporate subnational analysis, living lab, and scenario development aiming at advancing sustainability science on the transition to decarbonized lifestyles.

Download Full-text