scholarly journals Causal discovery in machine learning: Theories and applications

2021 ◽  
Vol 0 (0) ◽  
pp. 0
Author(s):  
Ana Rita Nogueira ◽  
João Gama ◽  
Carlos Abreu Ferreira
Author(s):  
Dómhnall J. Jennings ◽  
Eduardo Alonso ◽  
Esther Mondragón ◽  
Charlotte Bonardi

Standard associative learning theories typically fail to conceptualise the temporal properties of a stimulus, and hence cannot easily make predictions about the effects such properties might have on the magnitude of conditioning phenomena. Despite this, in intuitive terms we might expect that the temporal properties of a stimulus that is paired with some outcome to be important. In particular, there is no previous research addressing the way that fixed or variable duration stimuli can affect overshadowing. In this chapter we report results which show that the degree of overshadowing depends on the distribution form - fixed or variable - of the overshadowing stimulus, and argue that conditioning is weaker under conditions of temporal uncertainty. These results are discussed in terms of models of conditioning and timing. We conclude that the temporal difference model, which has been extensively applied to the reinforcement learning problem in machine learning, accounts for the key findings of our study.


2019 ◽  
Vol 19 (228) ◽  
Author(s):  
Andrew Tiffin

Machine learning tools are well known for their success in prediction. But prediction is not causation, and causal discovery is at the core of most questions concerning economic policy. Recently, however, the literature has focused more on issues of causality. This paper gently introduces some leading work in this area, using a concrete example—assessing the impact of a hypothetical banking crisis on a country’s growth. By enabling consideration of a rich set of potential nonlinearities, and by allowing individually-tailored policy assessments, machine learning can provide an invaluable complement to the skill set of economists within the Fund and beyond.


Author(s):  
Fernando Antonanzas-Torres ◽  
Andres Sanz-Garcia ◽  
Javier Antonanzas-Torres ◽  
Oscar Perpiñán-Lamiguero ◽  
Francisco Javier Martínez-de-Pisón-Ascacibar

Most of the research on estimating Solar Global Irradiation (SGI) is based on the development of parametric models. However, the use of methods based on the use of statistics and machine-learning theories can provide a significant improvement in reducing the prediction errors. The chapter evaluates the performance of different Soft Computing (SC) methods, such as support vector regression and artificial neural networks-multilayer perceptron, in SGI modeling against classical parametric and lineal models. SC methods demonstrate a higher generalization capacity applied to SGI modeling than classic parametric models. As a result, SC models suppose an alternative to satellite-derived models to estimate SGI in near-to-present time in areas in which no pyranometers are installed nearby.


Author(s):  
Zhaneta Tasheva ◽  
Rosen Bogdanov

The relationship between cognitivism as learning theory in education and machine learning is characterized in this survey paper. The cognitivism describes how learning occurs through internal processing of information and thus leads to understanding and retention. Cognitive information processing plays an active role to understand and process information that learner receives and relates it to already known and stored within learner’s memory. Thus, the cognitive approach defines learning as a change in knowledge which is stored in learner’s memory, and not a change in learner’s behaviour. In regard with importance of various learning problems to designing cognitive communications systems the two main classification categories of learning techniques are explained. Furthermore, the cognitive radio learning algorithms that have been proposed are described. Finally, the similarities and differences among the principles of learning theories and machine learning are discussed.


AERA Open ◽  
2020 ◽  
Vol 6 (4) ◽  
pp. 233285842097720
Author(s):  
Shayan Doroudi

In addition to providing a set of techniques to analyze educational data, I claim that data science as a field can provide broader insights to education research. In particular, I show how the bias-variance tradeoff from machine learning can be formally generalized to be applicable to several prominent educational debates, including debates around learning theories (cognitivist vs. situativist and constructivist theories) and pedagogy (direct instruction vs. discovery learning). I then look to see how various data science techniques that have been proposed to navigate the bias-variance tradeoff can yield insights for productively navigating these educational debates going forward.


2020 ◽  
Author(s):  
Shayan Doroudi

In addition to providing a set of techniques to analyze educational data, I claim that data science as a field can provide broader insights to education research. In particular, I show how the bias-variance tradeoff from machine learning can be formally generalized to be applicable to several prominent educational debates, including debates around learning theories (cognitivist vs. situativist and constructivist theories) and pedagogy (direct instruction vs. discovery learning). We then look to see how various data science techniques that have been proposed to navigate the bias-variance tradeoff can yield insights for productively navigating these educational debates going forward.


2021 ◽  
Vol 2021 ◽  
pp. 1-17
Author(s):  
Qian Luo ◽  
Lin Zhang ◽  
Zhiwei Xing ◽  
Huan Xia ◽  
Zhao-Xin Chen

The development of the civil aviation industry has continuously increased the requirements for the efficiency of airport ground support services. In the existing ground support research, there has not yet been a process model that directly obtains support from the ground support log to study the causal relationship between service nodes and flight delays. Most ground support studies mainly use machine learning methods to predict flight delays, and the flight support model they are based on is an ideal model. The study did not conduct an in-depth study of the causal mechanism behind the ground support link and did not reveal the true cause of flight delays. Therefore, there is a certain deviation in the prediction of flight delays by machine learning, and there is a certain deviation between the ideal model based on the research and the actual service process. Therefore, it is of practical significance to obtain the process model from the guarantee log and analyze its causality. However, the existing process causal factor discovery methods only do certain research when the assumption of causal sufficiency is established and does not consider the existence of latent variables. Therefore, this article proposes a framework to realize the discovery of process causal factors without assuming causal sufficiency. The optimized fuzzy mining process model is used as the service benchmark model, and the local causal discovery algorithm is used to discover the causal factors. Under this framework, this paper proposes a new Markov blanket discovery algorithm that does not assume causal sufficiency to discover causal factors and uses benchmark data sets for testing. Finally, the actual flight service data are used for causal discovery among flight service nodes. The local causal discovery algorithm proposed in this paper has a certain competitive advantage in accuracy, F1, and other aspects of the existing causal discovery algorithm. It avoids the occurrence of its dimensional disaster. Through the in-depth analysis of the flight safety reason node discovered by this method, it is found that the unreasonable scheduling of flight support personnel is an important reason for frequent flight delays at the airport.


Author(s):  
Svitlana Volkova ◽  
Dustin Arendt ◽  
Emily Saldanha ◽  
Maria Glenski ◽  
Ellyn Ayton ◽  
...  

AbstractGround Truth program was designed to evaluate social science modeling approaches using simulation test beds with ground truth intentionally and systematically embedded to understand and model complex Human Domain systems and their dynamics Lazer et al. (Science 369:1060–1062, 2020). Our multidisciplinary team of data scientists, statisticians, experts in Artificial Intelligence (AI) and visual analytics had a unique role on the program to investigate accuracy, reproducibility, generalizability, and robustness of the state-of-the-art (SOTA) causal structure learning approaches applied to fully observed and sampled simulated data across virtual worlds. In addition, we analyzed the feasibility of using machine learning models to predict future social behavior with and without causal knowledge explicitly embedded. In this paper, we first present our causal modeling approach to discover the causal structure of four virtual worlds produced by the simulation teams—Urban Life, Financial Governance, Disaster and Geopolitical Conflict. Our approach adapts the state-of-the-art causal discovery (including ensemble models), machine learning, data analytics, and visualization techniques to allow a human-machine team to reverse-engineer the true causal relations from sampled and fully observed data. We next present our reproducibility analysis of two research methods team’s performance using a range of causal discovery models applied to both sampled and fully observed data, and analyze their effectiveness and limitations. We further investigate the generalizability and robustness to sampling of the SOTA causal discovery approaches on additional simulated datasets with known ground truth. Our results reveal the limitations of existing causal modeling approaches when applied to large-scale, noisy, high-dimensional data with unobserved variables and unknown relationships between them. We show that the SOTA causal models explored in our experiments are not designed to take advantage from vasts amounts of data and have difficulty recovering ground truth when latent confounders are present; they do not generalize well across simulation scenarios and are not robust to sampling; they are vulnerable to data and modeling assumptions, and therefore, the results are hard to reproduce. Finally, when we outline lessons learned and provide recommendations to improve models for causal discovery and prediction of human social behavior from observational data, we highlight the importance of learning data to knowledge representations or transformations to improve causal discovery and describe the benefit of causal feature selection for predictive and prescriptive modeling.


2021 ◽  
Vol 14 (1) ◽  
pp. 10
Author(s):  
Leonardo Ranaldi ◽  
Francesca Fallucchi ◽  
Fabio Massimo Zanzotto

Modern AI technologies make use of statistical learners that lead to self-empiricist logic, which, unlike human minds, use learned non-symbolic representations. Nevertheless, it seems that it is not the right way to progress in AI. The structure of symbols—the operations by which the intellectual solution is realized—and the search for strategic reference points evoke important issues in the analysis of AI. Studying how knowledge can be represented through methods of theoretical generalization and empirical observation is only the latest step in a long process of evolution. For many years, humans, seeing language as innate, have carried out symbolic theories. Everything seems to have skipped ahead with the advent of Machine Learning. In this paper, after a long analysis of history, the rule-based and the learning-based vision, we would investigate the syntax as possible meeting point between the different learning theories. Finally, we propose a new vision of knowledge in AI models based on a combination of rules, learning, and human knowledge.


2020 ◽  
Vol 43 ◽  
Author(s):  
Myrthe Faber

Abstract Gilead et al. state that abstraction supports mental travel, and that mental travel critically relies on abstraction. I propose an important addition to this theoretical framework, namely that mental travel might also support abstraction. Specifically, I argue that spontaneous mental travel (mind wandering), much like data augmentation in machine learning, provides variability in mental content and context necessary for abstraction.


Sign in / Sign up

Export Citation Format

Share Document