Interpretable, Data-Efficient and Verifiable Autonomy with High-Level Knowledge

10.36227/techrxiv.12591152.v1 ◽

2020 ◽

Author(s):

Zhe Xu

Keyword(s):

Artificial Intelligence ◽

Neural Networks ◽

Deep Neural Networks ◽

Autonomous Systems ◽

Real Data ◽

Data Driven ◽

Knowledge Representations ◽

Limited Availability ◽

High Level ◽

Level Performance

Despite the fact that artificial intelligence boosted with data-driven methods (e.g., deep neural networks) has surpassed human-level performance in various tasks, its application to autonomous systems still faces fundamental challenges such as lack of interpretability, intensive need for data and lack of verifiability. In this overview paper, I overview some attempts to address these fundamental challenges by explaining, guiding and verifying autonomous systems, taking into account limited availability of simulated and real data, the expressivity of high-level knowledge representations and the uncertainties of the underlying model. Specifically, this paper covers learning high-level knowledge from data for interpretable autonomous systems,guiding autonomous systems with high-level knowledge, andverifying and controlling autonomous systems against high-level specifications.

Download Full-text

Using Deep Learning to Predict Human Decisions, and Cognitive Models to Explain Deep Learning Models

10.1101/2021.01.13.426629 ◽

2021 ◽

Author(s):

Matan Fintz ◽

Margarita Osadchy ◽

Uri Hertz

Keyword(s):

Neural Networks ◽

Decision Making ◽

Deep Learning ◽

Deep Neural Networks ◽

High Capacity ◽

Cognitive Models ◽

Human Behaviour ◽

Data Driven ◽

Human Decision ◽

High Level

AbstractDeep neural networks (DNN) models have the potential to provide new insights in the study of human decision making, due to their high capacity and data-driven design. While these models may be able to go beyond theory-driven models in predicting human behaviour, their opaque nature limits their ability to explain how an operation is carried out. This explainability problem remains unresolved. Here we demonstrate the use of a DNN model as an exploratory tool to identify predictable and consistent human behaviour in value-based decision making beyond the scope of theory-driven models. We then propose using theory-driven models to characterise the operation of the DNN model. We trained a DNN model to predict human decisions in a four-armed bandit task. We found that this model was more accurate than a reinforcement-learning reward-oriented model geared towards choosing the most rewarding option. This disparity in accuracy was more pronounced during times when the expected reward from all options was similar, i.e., no unambiguous good option. To investigate this disparity, we introduced a reward-oblivious model, which was trained to predict human decisions without information about the rewards obtained from each option. This model captured decision-sequence patterns made by participants (e.g., a-b-c-d). In a series of experimental offline simulations of all models we found that the general model was in line with a reward-oriented model’s predictions when one option was clearly better than the others.However, when options’ expected rewards were similar to each other, it was in-line with the reward-oblivious model’s pattern completion predictions. These results indicate the contribution of predictable but task-irrelevant decision patterns to human decisions, especially when task-relevant choices are not immediately apparent. Importantly, we demonstrate how theory-driven cognitive models can be used to characterise the operation of DNNs, making them a useful explanatory tool in scientific investigation.Author SummaryDeep neural networks (DNN) models are an extremely useful tool across multiple domains, and specifically for performing tasks that mimic and predict human behaviour. However, due to their opaque nature and high level of complexity, their ability to explain human behaviour is limited. Here we used DNN models to uncover hitherto overlooked aspects of human decision making, i.e., their reliance on predictable patterns for exploration. For this purpose, we trained a DNN model to predict human choices in a decision-making task. We then characterised this data-driven model using explicit, theory-driven cognitive models, in a set of offline experimental simulations. This relationship between explicit and data-driven approaches, where high-capacity models are used to explore beyond the scope of established models and theory-driven models are used to explain and characterise these new grounds, make DNN models a powerful scientific tool.

Download Full-text

On regularization properties of artificial datasets for deep learning

Computer Science and Mathematical Modelling ◽

10.5604/01.3001.0013.6599 ◽

2019 ◽

Vol 0 (9/2019) ◽

pp. 13-18

Author(s):

Karol Antczak

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Deep Neural Networks ◽

Real Data ◽

Training Data ◽

Generation Process ◽

Data Generation ◽

Artificial Data ◽

High Level ◽

Artificial Datasets

The paper discusses regularization properties of artificial data for deep learning. Artificial datasets allow to train neural networks in the case of a real data shortage. It is demonstrated that the artificial data generation process, described as injecting noise to high-level features, bears several similarities to existing regularization methods for deep neural networks. One can treat this property of artificial data as a kind of “deep” regularization. It is thus possible to regularize hidden layers of the network by generating the training data in a certain way.

Download Full-text

High accuracy data-driven heliostat calibration and state prediction with pretrained deep neural networks

Solar Energy ◽

10.1016/j.solener.2021.01.046 ◽

2021 ◽

Vol 218 ◽

pp. 48-56

Author(s):

Max Pargmann ◽

Daniel Maldonado Quinto ◽

Peter Schwarzbözl ◽

Robert Pitz-Paal

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

High Accuracy ◽

Data Driven ◽

State Prediction ◽

Accuracy Data

Download Full-text

Prediction Models for Truck Accidents at Freeway Ramps in Washington State Using Regression and Artificial Intelligence Techniques

Transportation Research Record Journal of the Transportation Research Board ◽

10.3141/1635-04 ◽

1998 ◽

Vol 1635 (1) ◽

pp. 30-36 ◽

Cited By ~ 9

Author(s):

Wael H. Awad ◽

Bruce N. Janson

Keyword(s):

Artificial Intelligence ◽

Neural Networks ◽

Linear Regression ◽

Prediction Models ◽

Washington State ◽

Training Data ◽

Coefficient Of Determination ◽

Training Process ◽

Truck Accidents ◽

High Level

Three different modeling approaches were applied to explain truck accidents at interchanges in Washington State during a 27-month period. Three models were developed for each ramp type including linear regression, neural networks, and a hybrid system using fuzzy logic and neural networks. The study showed that linear regression was able to predict accident frequencies that fell within one standard deviation from the overall mean of the dependent variable. However, the coefficient of determination was very low in all cases. The other two artificial intelligence (AI) approaches showed a high level of performance in identifying different patterns of accidents in the training data and presented a better fit when compared to the regression model. However, the ability of these AI models to predict test data that were not included in the training process showed unsatisfactory results.

Download Full-text

High accurancy and effectiveness with deep neural networks and artificial intelligence in pathological diagnosis of prostate cancer: Initial results

European Urology Supplements ◽

10.1016/s1569-9056(18)31063-7 ◽

2018 ◽

Vol 17 (2) ◽

pp. e304-e308

Author(s):

C. Zhang ◽

Q. Zhang ◽

X. Gao ◽

P. Liu ◽

H. Guo

Keyword(s):

Prostate Cancer ◽

Artificial Intelligence ◽

Neural Networks ◽

Deep Neural Networks ◽

Pathological Diagnosis ◽

Initial Results

Download Full-text

Critical Assessment of Artificial Intelligence Methods for Prediction of hERG Channel Inhibition in the ‘Big Data’ Era

10.26434/chemrxiv.12119040 ◽

2020 ◽

Cited By ~ 1

Author(s):

Vishal Babu Siramshetty ◽

Dac-Trung Nguyen ◽

Natalia J. Martinez ◽

Anton Simeonov ◽

Noel T. Southall ◽

...

Keyword(s):

Artificial Intelligence ◽

Neural Networks ◽

Big Data ◽

Recurrent Neural Networks ◽

Deep Neural Networks ◽

Prediction Models ◽

Chemical Space ◽

Superior Performance ◽

Gradient Boosting ◽

Artificial Intelligence Methods

The rise of novel artificial intelligence methods necessitates a comparison of this wave of new approaches with classical machine learning for a typical drug discovery project. Inhibition of the potassium ion channel, whose alpha subunit is encoded by human Ether-à-go-go-Related Gene (hERG), leads to prolonged QT interval of the cardiac action potential and is a significant safety pharmacology target for the development of new medicines. Several computational approaches have been employed to develop prediction models for assessment of hERG liabilities of small molecules including recent work using deep learning methods. Here we perform a comprehensive comparison of prediction models based on classical (random forests and gradient boosting) and modern (deep neural networks and recurrent neural networks) artificial intelligence methods. The training set (~9000 compounds) was compiled by integrating hERG bioactivity data from ChEMBL database with experimental data generated from an in-house, high-throughput thallium flux assay. We utilized different molecular descriptors including the latent descriptors, which are real-valued continuous vectors derived from chemical autoencoders trained on a large chemical space (> 1.5 million compounds). The models were prospectively validated on ~840 in-house compounds screened in the same thallium flux assay. The deep neural networks performed significantly better than the classical methods with the latent descriptors. The recurrent neural networks that operate on SMILES provided highest model sensitivity. The best models were merged into a consensus model that offered superior performance compared to reference models from academic and commercial domains. Further, we shed light on the potential of artificial intelligence methods to exploit the chemistry big data and generate novel chemical representations useful in predictive modeling and tailoring new chemical space.

Download Full-text

384 High Accuracy and Effectiveness With Deep Neural Networks and Artificial Intelligence in Detection of Early Esophageal Neoplasia in Barrett's Esophagus

The American Journal of Gastroenterology ◽

10.14309/01.ajg.0000591068.35597.2b ◽

2019 ◽

Vol 114 (1) ◽

pp. S224-S225

Author(s):

Rintaro Hashimoto ◽

Nabil El Hage Chehade ◽

Kenneth J. Chang ◽

Tyler Dao ◽

Andrew Ninh ◽

...

Keyword(s):

Artificial Intelligence ◽

Neural Networks ◽

Barrett’S Esophagus ◽

Barrett's Esophagus ◽

Deep Neural Networks ◽

High Accuracy ◽

Esophageal Neoplasia

Download Full-text

Simulating and Predicting Dynamical Systems with Spatial Semantic Pointers

Neural Computation ◽

10.1162/neco_a_01410 ◽

2021 ◽

pp. 1-35

Author(s):

Aaron R. Voelker ◽

Peter Blouw ◽

Xuan Choo ◽

Nicole Sandra-Yaffa Dumont ◽

Terrence C. Stewart ◽

...

Keyword(s):

Neural Networks ◽

Dynamical Systems ◽

Cognitive Processes ◽

Deep Neural Networks ◽

Learning Task ◽

Topological Spaces ◽

Multiple Objects ◽

Symbolic Structure ◽

High Level ◽

Vector Representations

Abstract While neural networks are highly effective at learning task-relevant representations from data, they typically do not learn representations with the kind of symbolic structure that is hypothesized to support high-level cognitive processes, nor do they naturally model such structures within problem domains that are continuous in space and time. To fill these gaps, this work exploits a method for defining vector representations that bind discrete (symbol-like) entities to points in continuous topological spaces in order to simulate and predict the behavior of a range of dynamical systems. These vector representations are spatial semantic pointers (SSPs), and we demonstrate that they can (1) be used to model dynamical systems involving multiple objects represented in a symbol-like manner and (2) be integrated with deep neural networks to predict the future of physical trajectories. These results help unify what have traditionally appeared to be disparate approaches in machine learning.

Download Full-text

Reports of the 2013 AAAI Spring Symposium Series

AI Magazine ◽

10.1609/aimag.v34i3.2493 ◽

2013 ◽

Vol 34 (3) ◽

pp. 93-98 ◽

Cited By ~ 1

Author(s):

Vita Markman ◽

Georgi Stojanov ◽

Bipin Indurkhya ◽

Takashi Kido ◽

Keiki Takadama ◽

...

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Cognitive Development ◽

Behavior Change ◽

Autonomous Systems ◽

Data Driven ◽

Intelligent Robots ◽

Symposium Series ◽

Development Data ◽

Weakly Supervised

The Association for the Advancement of Artificial Intelligence was pleased to present the AAAI 2013 Spring Symposium Series, held Monday through Wednesday, March 25-27, 2013. The titles of the eight symposia were Analyzing Microtext, Creativity and (Early) Cognitive Development, Data Driven Wellness: From Self-Tracking to Behavior Change, Designing Intelligent Robots: Reintegrating AI II, Lifelong Machine Learning, Shikakeology: Designing Triggers for Behavior Change, Trust and Autonomous Systems, and Weakly Supervised Learning from Multimedia. This report contains summaries of the symposia, written, in most cases, by the cochairs of the symposium.

Download Full-text