Key Concepts in AI Safety: Specification in Machine Learning

Mapping Intimacies ◽

10.51593/20210031 ◽

2021 ◽

Author(s):

Tim G. J. Rduner ◽

◽

Helen Toner

Keyword(s):

Machine Learning ◽

Learning Systems ◽

Safety Issues ◽

Key Concepts ◽

Safety Specification ◽

Learning Research ◽

Modern Machine ◽

Ai Safety

This paper is the fourth installment in a series on “AI safety,” an area of machine learning research that aims to identify causes of unintended behavior in machine learning systems and develop tools to ensure these systems work safely and reliably. The first paper in the series, “Key Concepts in AI Safety: An Overview,” outlined three categories of AI safety issues—problems of robustness, assurance, and specification—and the subsequent two papers described problems of robustness and assurance, respectively. This paper introduces specification as a key element in designing modern machine learning systems that operate as intended.

Download Full-text

Key Concepts in AI Safety: Robustness and Adversarial Examples

10.51593/20190041 ◽

2021 ◽

Author(s):

Tim Rudner ◽

Helen Toner

Keyword(s):

Machine Learning ◽

Learning Systems ◽

Safety Issues ◽

Key Concepts ◽

Adversarial Examples ◽

Learning Research ◽

Modern Machine ◽

Ai Safety

This paper is the second installment in a series on “AI safety,” an area of machine learning research that aims to identify causes of unintended behavior in machine learning systems and develop tools to ensure these systems work safely and reliably. The first paper in the series, “Key Concepts in AI Safety: An Overview,” described three categories of AI safety issues: problems of robustness, assurance, and specification. This paper introduces adversarial examples, a major challenge to robustness in modern machine learning systems.

Download Full-text

Key Concepts in AI Safety: Interpretability in Machine Learning

10.51593/20190042 ◽

2021 ◽

Author(s):

Tim Rudner ◽

Helen Toner

Keyword(s):

Machine Learning ◽

Learning Systems ◽

Safety Issues ◽

The Third ◽

Key Concepts ◽

Learning Research ◽

Modern Machine ◽

Ai Safety

This paper is the third installment in a series on “AI safety,” an area of machine learning research that aims to identify causes of unintended behavior in machine learning systems and develop tools to ensure these systems work safely and reliably. The first paper in the series, “Key Concepts in AI Safety: An Overview,” described three categories of AI safety issues: problems of robustness, assurance, and specification. This paper introduces interpretability as a means to enable assurance in modern machine learning systems.

Download Full-text

Key Concepts in AI Safety: An Overview

10.51593/20190040 ◽

2021 ◽

Author(s):

Tim Rudner ◽

Helen Toner

Keyword(s):

Machine Learning ◽

Learning Systems ◽

Safety Issues ◽

Key Concepts ◽

Learning Research ◽

Ai Safety

This paper is the first installment in a series on “AI safety,” an area of machine learning research that aims to identify causes of unintended behavior in machine learning systems and develop tools to ensure these systems work safely and reliably. In it, the authors introduce three categories of AI safety issues: problems of robustness, assurance, and specification. Other papers in this series elaborate on these and further key concepts.

Download Full-text

Machine Learning with Neural Networks

10.1017/9781108860604 ◽

2021 ◽

Author(s):

Bernhard Mehlig

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Statistical Physics ◽

State Of The Art ◽

Science And Engineering ◽

Key Concepts ◽

Learning Research ◽

Programming Skills ◽

Key Aspects ◽

Modern Machine

This modern and self-contained book offers a clear and accessible introduction to the important topic of machine learning with neural networks. In addition to describing the mathematical principles of the topic, and its historical evolution, strong connections are drawn with underlying methods from statistical physics and current applications within science and engineering. Closely based around a well-established undergraduate course, this pedagogical text provides a solid understanding of the key aspects of modern machine learning with artificial neural networks, for students in physics, mathematics, and engineering. Numerous exercises expand and reinforce key concepts within the book and allow students to hone their programming skills. Frequent references to current research develop a detailed perspective on the state-of-the-art in machine learning research.

Download Full-text

AI Accidents: An Emerging Threat

10.51593/20200072 ◽

2021 ◽

Author(s):

Zachary Arnold ◽

◽

Helen Toner

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Learning Systems ◽

Wide Range ◽

Policy Suggestions ◽

The Future ◽

Modern Machine ◽

Artificial Intelligence Systems

As modern machine learning systems become more widely used, the potential costs of malfunctions grow. This policy brief describes how trends we already see today—both in newly deployed artificial intelligence systems and in older technologies—show how damaging the AI accidents of the future could be. It describes a wide range of hypothetical but realistic scenarios to illustrate the risks of AI accidents and offers concrete policy suggestions to reduce these risks.

Download Full-text

Interacting with an Inferred World: The Challenge of Machine Learning for Humane Computer Interaction

Aarhus Series on Human Centered Computing ◽

10.7146/aahcc.v1i1.21197 ◽

2015 ◽

Vol 1 (1) ◽

pp. 12 ◽

Cited By ~ 11

Author(s):

Alan F. Blackwell

Keyword(s):

Machine Learning ◽

Statistical Models ◽

Critical Evaluation ◽

User Interaction ◽

Interactive Systems ◽

Learning Systems ◽

The World ◽

Cognitive Theories ◽

New Generation ◽

Modern Machine

<div class="page" title="Page 1"><div class="layoutArea"><div class="column"><p><span>Classic theories of user interaction have been framed in relation to symbolic models of planning and problem solving, responding in part to the cognitive theories associated with AI research. However, the behavior of modern machine-learning systems is determined by statistical models of the world rather than explicit symbolic descriptions. Users increasingly interact with the world and with others in ways that are mediated by such models. This paper explores the way in which this new generation of technology raises fresh challenges for the critical evaluation of interactive systems. It closes with some proposed measures for the design of inference-based systems that are more open to humane design and use. </span></p></div></div></div>

Download Full-text

Platform for Analysing and Encouraging Student Activity on Contest and E-learning Systems

OLYMPIADS IN INFORMATICS ◽

10.15388/ioi.2018.07 ◽

2018 ◽

Vol 12 ◽

pp. 85-98

Author(s):

Bojan Kostadinov ◽

Mile Jovanov ◽

Emil STANKOV

Keyword(s):

Machine Learning ◽

Data Collection ◽

Educational Policy ◽

Learning Systems ◽

Data Sources ◽

Or Education ◽

Student Activity ◽

The World ◽

E Learning ◽

Analyse Data

Data collection and machine learning are changing the world. Whether it is medicine, sports or education, companies and institutions are investing a lot of time and money in systems that gather, process and analyse data. Likewise, to improve competitiveness, a lot of countries are making changes to their educational policy by supporting STEM disciplines. Therefore, it’s important to put effort into using various data sources to help students succeed in STEM. In this paper, we present a platform that can analyse student’s activity on various contest and e-learning systems, combine and process the data, and then present it in various ways that are easy to understand. This in turn enables teachers and organizers to recognize talented and hardworking students, identify issues, and/or motivate students to practice and work on areas where they’re weaker.

Download Full-text

A Comparative Study on Contemporary Intrusion Detection Datasets for Machine Learning Research

2020 IEEE International Conference on Intelligence and Security Informatics (ISI) ◽

10.1109/isi49825.2020.9280519 ◽

2020 ◽

Author(s):

Smirti Dwibedi ◽

Medha Pujari ◽

Weiqing Sun

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Comparative Study ◽

Learning Research

Download Full-text

A Review of Recent Deep Learning Approaches in Human-Centered Machine Learning

Sensors ◽

10.3390/s21072514 ◽

2021 ◽

Vol 21 (7) ◽

pp. 2514

Author(s):

Tharindu Kaluarachchi ◽

Andrew Reis ◽

Suranga Nanayakkara

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Review Paper ◽

Learning Systems ◽

Learning Approaches ◽

Application Development ◽

Research Gaps ◽

Domain Experts ◽

Working Definition ◽

Real World Application

After Deep Learning (DL) regained popularity recently, the Artificial Intelligence (AI) or Machine Learning (ML) field is undergoing rapid growth concerning research and real-world application development. Deep Learning has generated complexities in algorithms, and researchers and users have raised concerns regarding the usability and adoptability of Deep Learning systems. These concerns, coupled with the increasing human-AI interactions, have created the emerging field that is Human-Centered Machine Learning (HCML). We present this review paper as an overview and analysis of existing work in HCML related to DL. Firstly, we collaborated with field domain experts to develop a working definition for HCML. Secondly, through a systematic literature review, we analyze and classify 162 publications that fall within HCML. Our classification is based on aspects including contribution type, application area, and focused human categories. Finally, we analyze the topology of the HCML landscape by identifying research gaps, highlighting conflicting interpretations, addressing current challenges, and presenting future HCML research opportunities.

Download Full-text

Explainable AI: A Review of Machine Learning Interpretability Methods

Entropy ◽

10.3390/e23010018 ◽

2020 ◽

Vol 23 (1) ◽

pp. 18

Author(s):

Pantelis Linardatos ◽

Vasilis Papastefanopoulos ◽

Sotiris Kotsiantis

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Black Box ◽

Learning Systems ◽

Model Complexity ◽

Learning Models ◽

New Methods ◽

Industrial Adoption ◽

Machine Learning Models ◽

The Way

Recent advances in artificial intelligence (AI) have led to its widespread industrial adoption, with machine learning systems demonstrating superhuman performance in a significant number of tasks. However, this surge in performance, has often been achieved through increased model complexity, turning such systems into “black box” approaches and causing uncertainty regarding the way they operate and, ultimately, the way that they come to decisions. This ambiguity has made it problematic for machine learning systems to be adopted in sensitive yet critical domains, where their value could be immense, such as healthcare. As a result, scientific interest in the field of Explainable Artificial Intelligence (XAI), a field that is concerned with the development of new methods that explain and interpret machine learning models, has been tremendously reignited over recent years. This study focuses on machine learning interpretability methods; more specifically, a literature review and taxonomy of these methods are presented, as well as links to their programming implementations, in the hope that this survey would serve as a reference point for both theorists and practitioners.

Download Full-text