Fundamentals of Machine Learning

Neural networks and Keras

Fundamentals of Machine Learning ◽

10.1093/oso/9780198828044.003.0004 ◽

2019 ◽

pp. 66-90

Author(s):

Thomas P. Trappenberg

Keyword(s):

Neural Network ◽

Neural Networks ◽

Artificial Neural Network ◽

Deep Learning ◽

Basic Operation ◽

Neuron Models ◽

Recent Success ◽

Artificial Neurons ◽

Artificial Neural ◽

The Brain

This chapter discusses the basic operation of an artificial neural network which is the major paradigm of deep learning. The name derives from an analogy to a biological brain. The discussion begins by outlining the basic operations of neurons in the brain and how these operations are abstracted by simple neuron models. It then builds networks of artificial neurons that constitute much of the recent success of AI. The focus of this chapter is on using such techniques, with subsequent consideration of their theoretical embedding.

Download Full-text

Basic probability theory

Fundamentals of Machine Learning ◽

10.1093/oso/9780198828044.003.0006 ◽

2019 ◽

pp. 121-140

Author(s):

Thomas P. Trappenberg

Keyword(s):

Machine Learning ◽

Probability Theory ◽

Random Numbers ◽

Density Functions ◽

Approximation Techniques ◽

Basic Probability ◽

Probability Mass ◽

Learning Probability ◽

Mass Functions ◽

Modern Machine

The discussion provides a refresher of probability theory, in particular with respect to the formulations that build the theoretical language of modern machine learning. Probability theory is the formalism of random numbers, and this chapter outlines what these are and how they are characterized by probability density or probability mass functions. How such functions have traditionally been characterized is covered, and a review of how to work with such mathematical objects such as transforming density functions and how to measure differences between density function is presented. Definitions and basic operations with multiple random variables, including the Bayes law, are covered. The chapter ends with an outline of some important approximation techniques of so-called Monte Carlo methods.

Download Full-text

Scientific programming with Python

Fundamentals of Machine Learning ◽

10.1093/oso/9780198828044.003.0002 ◽

2019 ◽

pp. 17-37

Author(s):

Thomas P. Trappenberg

Keyword(s):

Machine Learning ◽

Learning Community ◽

Programming Language ◽

High Level ◽

Mathematical Operations ◽

Considerable Support

This chapter offers a brief introduction to scientific programming with Python with an emphasis on some mathematical operations that will form the basis of many algorithms. This will specifically include working with matrices and convolutions. Python is a high-level programming language similar to Matlab and R that has gained increasing popularity in the machine learning community. The main reason this book uses Python is that it is freely available and now provides considerable support for machine learning with packages such as sklearn and Keras that are discussed and utilized in this book. Some familiarity with programming concepts is assumed, and the chapter concentrates on a brief introduction to the specific environment and supporting libraries used throughout as well as some basic operations such as convolutions that will be important in later algorithms.

Download Full-text

Artificial intelligence, the brain, and our society

Fundamentals of Machine Learning ◽

10.1093/oso/9780198828044.003.0011 ◽

2019 ◽

pp. 233-242

Author(s):

Thomas P. Trappenberg

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Human Intelligence ◽

Learning Technology ◽

Learning Models ◽

The Impact ◽

The Brain ◽

Machine Learning Models

The concluding chapter is a brief venture into a more general discussion of machine learning, how it relates to artificial intelligence (AI), and the recent impact of this on society. It starts by discussing the relations of machine learning models in relation to the brain and human intelligence. The discussion then moves to the relation between machine learning and AI. While they are now often equated, it is useful to highlight some possible sources of misconceptions. It closes with some brief thought on the impact of machine learning technology our society.

Download Full-text

Cyclic models and recurrent neural networks

Fundamentals of Machine Learning ◽

10.1093/oso/9780198828044.003.0009 ◽

2019 ◽

pp. 183-205

Author(s):

Thomas P. Trappenberg

Keyword(s):

Neural Networks ◽

Dynamical Systems ◽

Bayesian Networks ◽

Recurrent Neural Networks ◽

Directed Graphs ◽

First Principle ◽

Temporal Modeling ◽

Cyclic Graphs ◽

Cyclic Models

This chapter discusses models with cyclic dependencies. There are two principle architectures that are discussed. The first principle architecture of cyclic graphs comprises directed graphs similar to the Bayesian networks except that they include loops. Formally, such networks represent dynamical systems in the wider context and therefore represent some form of temporal modeling. The second type of models have connections between neurons that are bi-directional. These types of networks will be discussed in the context of stochastic units in the second half of this chapter.

Download Full-text

Generative models

Fundamentals of Machine Learning ◽

10.1093/oso/9780198828044.003.0008 ◽

2019 ◽

pp. 162-180

Author(s):

Thomas P. Trappenberg

Keyword(s):

Deep Learning ◽

Expectation Maximization ◽

Naive Bayes ◽

Naïve Bayes ◽

Generative Models ◽

Generative Adversarial Networks ◽

Learning Context ◽

Feature Vectors ◽

Adversarial Networks ◽

The Subject

This chapter presents an introduction to the important topic of building generative models. These are models that are aimed to understand the variety of a class such as cars or trees. A generative mode should be able to generate feature vectors for instances of the class they represent, and such models should therefore be able to characterize the class with all its variations. The subject is discussed both in a Bayesian and in a deep learning context, and also within a supervised and unsupervised context. This area is related to important algorithms such as k-means clustering, expectation maximization (EM), naïve Bayes, generative adversarial networks (GANs), and variational autoencoders (VAE).

Download Full-text

Machine learning with sklearn

Fundamentals of Machine Learning ◽

10.1093/oso/9780198828044.003.0003 ◽

2019 ◽

pp. 38-65

Author(s):

Thomas P. Trappenberg

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Probabilistic Reasoning ◽

Learning Algorithms ◽

General Setting ◽

Random Forest Classifier ◽

Machine Learning Algorithms ◽

Support Vector ◽

Support Vector Classifier ◽

Evaluation Techniques

This chapter’s goal is to show how to apply machine learning algorithms in a general setting using some classic methods. In particular, it demonstrates how to apply three important machine learning algorithms, a support vector classifier (SVC), a random forest classifier (RFC), and a multilayer perceptron (MLP). While many of the methods studied later go beyond these now classic methods, this does not mean that these methods are obsolete. Also, the algorithms discussed here provide some form of baseline to discuss advanced methods like probabilistic reasoning and deep learning. The aim here is to demonstrate that applying machine learning methods based on machine learning libraries is not very difficult. It offers an opportunity to discuss evaluation techniques that are very important in practice.

Download Full-text

Reinforcement learning

Fundamentals of Machine Learning ◽

10.1093/oso/9780198828044.003.0010 ◽

2019 ◽

pp. 206-232

Author(s):

Thomas P. Trappenberg

Keyword(s):

Neural Networks ◽

Decision Making ◽

Reinforcement Learning ◽

Recent Progress ◽

Decision Making Process ◽

Learning Condition ◽

Learning Problem ◽

Markov Decision ◽

Common Learning ◽

Simple Feedback

The discussion here considers a much more common learning condition where an agent, such as a human or a robot, has to learn to make decisions in the environment from simple feedback. Such feedback is provided only after periods of actions in the form of reward or punishment without detailing which of the actions has contributed to the outcome. This type of learning scenario is called reinforcement learning. This learning problem is formalized in a Markov decision-making process with a variety of related algorithms. The second part of this chapter will use function approximators with neural networks which have made recent progress as deep reinforcement learning.

Download Full-text

Regression and optimization

Fundamentals of Machine Learning ◽

10.1093/oso/9780198828044.003.0005 ◽

2019 ◽

pp. 93-120

Author(s):

Thomas P. Trappenberg

Keyword(s):

Machine Learning ◽

Gradient Descent ◽

Probabilistic Models ◽

Learning Algorithm ◽

Superior Performance ◽

Computer Languages ◽

Learning Programs ◽

High Level ◽

Considerable Experience

This chapter returns to the more theoretical embedding of machine learning in regression. Prior chapters have shown that writing machine learning programs is easy using high-level computer languages and with the help of good machine learning libraries. However, applying such algorithms appropriately with superior performance requires considerable experience and a deeper knowledge of the underlying ideas and algorithms. This chapter takes a step back to consider basic regression in more detail, which in turn will form the foundation for discussing probabilistic models in following chapters. This includes the important discussion of gradient descent as a learning algorithm.

Download Full-text

Introduction

Fundamentals of Machine Learning ◽

10.1093/oso/9780198828044.003.0001 ◽

2019 ◽

pp. 1-14

Author(s):

Thomas P. Trappenberg

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Linear Regression ◽

Basic Concept ◽

Recent Progress ◽

Historical Context ◽

High Dimensional ◽

Building Models ◽

High Level ◽

Probabilistic Context

This chapter provides a high-level overview of machine learning, in particular how it is related to building models from data. It starts with placing the basic concept in its historical context and phrases the learning problem in a simple mathematical term as function approximation as well as in a probabilistic context. In contrast to more traditional models, machine learning can be characterized as non-linear regression in high-dimensional spaces. This chapter points out how diverse subareas such as deep learning and Bayesian networks fit into the scheme of things and motivates further study with some examples of recent progress.

Download Full-text

Fundamentals of Machine Learning
Latest Publications

TOTAL DOCUMENTS

H-INDEX

Published By Oxford University Press

Neural networks and Keras

Basic probability theory

Scientific programming with Python

Artificial intelligence, the brain, and our society

Cyclic models and recurrent neural networks

Generative models

Machine learning with sklearn

Reinforcement learning

Regression and optimization

Introduction

Export Citation Format

Fundamentals of Machine LearningLatest Publications

TOTAL DOCUMENTS

H-INDEX

Published By Oxford University Press

Neural networks and Keras

Basic probability theory

Scientific programming with Python

Artificial intelligence, the brain, and our society

Cyclic models and recurrent neural networks

Generative models

Machine learning with sklearn

Reinforcement learning

Regression and optimization

Introduction

Fundamentals of Machine Learning
Latest Publications