scholarly journals Four Generations in Data Engineering for Data Science

Author(s):  
Meike Klettke ◽  
Uta Störl

AbstractData-driven methods and data science are important scientific methods in many research fields. All data science approaches require professional data engineering components. At the moment, computer science experts are needed for solving these data engineering tasks. Simultaneously, scientists from many fields (like natural sciences, medicine, environmental sciences, and engineering) want to analyse their data autonomously. The arising task for data engineering is the development of tools that can support an automated data curation and are utilisable for domain experts. In this article, we will introduce four generations of data engineering approaches classifying the data engineering technologies of the past and presence. We will show which data engineering tools are needed for the scientific landscape of the next decade.

2021 ◽  
pp. 026638212110619
Author(s):  
Sharon Richardson

During the past two decades, there have been a number of breakthroughs in the fields of data science and artificial intelligence, made possible by advanced machine learning algorithms trained through access to massive volumes of data. However, their adoption and use in real-world applications remains a challenge. This paper posits that a key limitation in making AI applicable has been a failure to modernise the theoretical frameworks needed to evaluate and adopt outcomes. Such a need was anticipated with the arrival of the digital computer in the 1950s but has remained unrealised. This paper reviews how the field of data science emerged and led to rapid breakthroughs in algorithms underpinning research into artificial intelligence. It then discusses the contextual framework now needed to advance the use of AI in real-world decisions that impact human lives and livelihoods.


2019 ◽  
Vol 5 (1) ◽  
Author(s):  
Alberto Prieto

Can data-driven approaches help researchers reconstruct Roman history? Scientific methods are now being used to reexamine ancient slavery, wealth distribution, health, and the costs of trade. Such approaches are demonstrated in The Science of Roman History: Biology, Climate, and the Future of the Past, edited by Walter Scheidel. But Alberto Prieto finds not enough of the book’s data to be Roman.


Author(s):  
Antoine Mazières ◽  
Camille Roth

The study of surnames as both linguistic and geographical markers of the past has proven valuable in several research fields spanning from biology and genetics to demography and social mobility. This article builds on the existing literature to conceive and develop a surname origin classifier based on a data-driven typology. This enables us to explore a methodology to describe large-scale estimates of the relative diversity of social groups, especially when such data is scarcely available. We subsequently analyze the representativeness of surname origins for 15 socio-professional groups in France.


2018 ◽  
Vol 1 (1) ◽  
pp. 235-251 ◽  
Author(s):  
Frank Emmert-Streib ◽  
Matthias Dehmer

Data science is a new academic field that has received much attention in recent years. One reason for this is that our increasingly digitalized society generates more and more data in all areas of our lives and science and we are desperately seeking for solutions to deal with this problem. In this paper, we investigate the academic roots of data science. We are using data of scientists and their citations from Google Scholar, who have an interest in data science, to perform a quantitative analysis of the data science community. Furthermore, for decomposing the data science community into its major defining factors corresponding to the most important research fields, we introduce a statistical regression model that is fully automatic and robust with respect to a subsampling of the data. This statistical model allows us to define the ‘importance’ of a field as its predictive abilities. Overall, our method provides an objective answer to the question ‘What is data science?’.


Abstract Artificial intelligence (AI) and machine learning (ML) have become important tools for environmental scientists and engineers, both in research and in applications. Although these methods have become quite popular in recent years, they are not new. The use of AI methods began in the 1950s and environmental scientists were adopting them by the 1980s. Although an “AI Winter” temporarily slowed the growth, a more recent resurgence has brought it back with gusto. This paper tells the story of the evolution of AI in the field through the lens of the AMS Committee on Artificial Intelligence Applications to Environmental Science. The environmental sciences possess a host of problems amenable to advancement by intelligent techniques. We review a few of the early applications along with the ML methods of the time and how their progression has impacted these sciences. While AI methods have changed from expert systems in the eighties to neural networks and other data-driven methods, and more recently deep learning, the environmental problems tackled have remained similar. We discuss the types of applications that have shown some of the biggest advances due to AI usage and how they have evolved over the past decades, including topics in weather forecasting, probabilistic prediction, climate estimation, optimization problems, image processing, and improving forecasting models. We finish with a look at where AI as employed in environmental science appears to be headed and some thoughts on how it might be best blended with physical / dynamical modeling approaches to further advance our science.


Chelovek RU ◽  
2020 ◽  
pp. 18-53
Author(s):  
Sergei Avanesov ◽  

Abstract. The article analyzes the autobiography of the famous Russian philosopher, theologian and scientist Pavel Florensky, as well as those of his texts that retain traces of memories. According to Florensky, the personal biography is based on family history and continues in children. He addresses his own biography to his children. Memories based on diary entries are designed as a memory diary, that is, as material for future memories. The past becomes actual in autobiography, turns into a kind of present. The past, from the point of view of its realization in the present, gains meaning and significance. The au-thor is active in relation to his own past, transforming it from a collection of disparate facts into a se-quence of events. A person can only see the true meaning of such events from a great distance. Therefore, the philosopher remembers not so much the circumstances of his life as the inner impressions of the en-counter with reality. The most powerful personality-forming experiences are associated with childhood. Even the moment of birth can decisively affect the character of a person and the range of his interests. The foundations of a person's worldview are laid precisely in childhood. Florensky not only writes mem-oirs about himself, but also tries to analyze the problems of time and memory. A person is immersed in time, but he is able to move into the past through memory and into the future through faith. An autobi-ography can never be written to the end because its author lives on. However, reaching the depths of life, he is able to build his path in such a way that at the end of this path he will unite with the fullness of time, with eternity.


CounterText ◽  
2020 ◽  
Vol 6 (1) ◽  
pp. 26-32
Author(s):  
Norbert Bugeja

In this retrospective piece, the Guest Editor of the first number of CounterText (a special issue titled Postcolonial Springs) looks back at the past five years from various scholarly and personal perspectives. He places particular focus on an event that took place mid-way between the 2011 uprisings across a number of Arab countries and the moment of writing: the March 2015 terror attack on the Bardo National Museum in Tunis, which killed twenty-two people and had a profound effect on Tunisian popular consciousness and that of the post-2011 Arab nations. In this context, the author argues for a renewed perspective on memoir as at once a memorial practice and a political gesture in writing, one that exceeds concerns of genre and form to encompass an ongoing project of political re-cognition following events that continue to remap the agenda for the region. The piece makes a brief final pitch for Europe's need to re-cognise, within those modes of ‘articulacy-in-difficulty’ active on its southern borders, specific answers to its own present quandaries.


Author(s):  
Rafael Komiljonov

The article examines the Genesis of the institution of jury trial in the Russian Empire from the moment of its introduction to the end of the Provisional government. It is noted that the emergence of a trial with the participation of jurors was influenced by Western models of the judicial process, and the forms of participation of citizens in the administration of justice that previously existed on the territory of the Russian state were taken into account. The role that the jury system has played with some success in the search for truth, justice, and the implementation of effective and independent justice in the past centuries is particularly highlighted.


2021 ◽  
Vol 13 (12) ◽  
pp. 6779
Author(s):  
Aleksander Owerczuk

In the past centuries, small towns in Podlaskie Voivodeship were dominated by wooden buildings. Nowadays, there are not many of them left. However, they can still be found in the centers of towns, including some market squares. These are often inconspicuous objects, mainly wooden houses. This paper discusses the issue of the significance of wooden buildings, especially houses, in maintaining and restoring historic values of market spaces in small towns of Podlaskie Voivodeship in the examples of Bielsk Podlaski and Kleszczele. The research determined the moment of rapid changes, during which most marketplace buildings lost their historic form. The existing condition was analyzed in terms of its historical values. Conclusions were formulated on the scope and type of restoration works for individual market squares. Finally, general conclusions from the research on the market squares of Bielsk Podlaski and Kleszczele were presented.


Animals ◽  
2021 ◽  
Vol 11 (8) ◽  
pp. 2323
Author(s):  
Lloyd A. Courtenay ◽  
Darío Herranz-Rodrigo ◽  
José Yravedra ◽  
José Mª Vázquez-Rodríguez ◽  
Rosa Huguet ◽  
...  

Human populations have been known to develop complex relationships with large carnivore species throughout time, with evidence of both competition and collaboration to obtain resources throughout the Pleistocene. From this perspective, many archaeological and palaeontological sites present evidence of carnivore modifications to bone. In response to this, specialists in the study of microscopic bone surface modifications have resorted to the use of 3D modeling and data science techniques for the inspection of these elements, reaching novel limits for the discerning of carnivore agencies. The present research analyzes the tooth mark variability produced by multiple Iberian wolf individuals, with the aim of studying how captivity may affect the nature of tooth marks left on bone. In addition to this, four different populations of both wild and captive Iberian wolves are also compared for a more in-depth comparison of intra-species variability. This research statistically shows that large canid tooth pits are the least affected by captivity, while tooth scores appear more superficial when produced by captive wolves. The superficial nature of captive wolf tooth scores is additionally seen to correlate with other metric features, thus influencing overall mark morphologies. In light of this, the present study opens a new dialogue on the reasons behind this, advising caution when using tooth scores for carnivore identification and contemplating how elements such as stress may be affecting the wolves under study.


Sign in / Sign up

Export Citation Format

Share Document