Why Use Automated Machine Learning?

Automated Machine Learning for Business ◽

10.1093/oso/9780190941659.003.0001 ◽

2021 ◽

pp. 1-22

Author(s):

Kai R. Larsen ◽

Daniel S. Becker

Keyword(s):

Machine Learning ◽

Human Resources ◽

Customer Service ◽

Resource Availability ◽

Ease Of Use ◽

Algorithm Selection ◽

College Dropout ◽

Exploratory Data ◽

Automated Machine Learning ◽

Marketing Operations

Machine learning is involved in search, translation, detecting depression, likelihood of college dropout, finding lost children, and to sell all kinds of products. While barely beyond its inception, the current machine learning revolution will affect people and organizations no less than the Industrial Revolution’s effect on weavers and many other skilled laborers. Machine learning will automate hundreds of millions of jobs that were considered too complex for machines ever to take over even a decade ago, including driving, flying, painting, programming, and customer service, as well as many of the jobs previously reserved for humans in the fields of finance, marketing, operations, accounting, and human resources. This section explains how automated machine learning addresses exploratory data analysis, feature engineering, algorithm selection, hyperparameter tuning, and model diagnostics. The section covers the eight criteria considered essential for AutoML to have significant impact: accuracy, productivity, ease of use, understanding and learning, resource availability, process transparency, generalization, and recommended actions.

Download Full-text

Auto-CaseRec: Automatically Selecting and Optimizing Recommendation-Systems Algorithms

10.31219/osf.io/4znmd ◽

2020 ◽

Author(s):

Srijan Gupta ◽

Joeran Beel

Keyword(s):

Machine Learning ◽

Recommender Systems ◽

Recommender System ◽

Parameter Tuning ◽

Machine Learning Algorithms ◽

Algorithm Selection ◽

Automated Algorithm ◽

Recommendation Algorithms ◽

Automated Machine Learning ◽

Human Effort

The advances in the ﬁeld of Automated Machine Learning (AutoML) have greatly reduced human effort in selecting and optimizing machine learning algorithms. These advances, however, have not yet widely made it to Recommender-Systems libraries. We introduce Auto-CaseRec, a Python framework based on the CaseRec recommender-system library. Auto-CaseRec provides automated algorithm selection and parameter tuning for recommendation algorithms. An initial evaluation of Auto-CaseRec against the baselines shows an average 13.88% improvement in RMSE for theMovielens100K dataset and an average 17.95% improvement in RMSE for the Last.fm dataset.

Download Full-text

MultiETSC: automated machine learning for early time series classification

Data Mining and Knowledge Discovery ◽

10.1007/s10618-021-00781-5 ◽

2021 ◽

Author(s):

Gilles Ottervanger ◽

Mitra Baratchi ◽

Holger H. Hoos

Keyword(s):

Machine Learning ◽

Time Series ◽

Optimal Algorithm ◽

Empirical Evaluation ◽

Early Time ◽

Time Series Classification ◽

Algorithm Selection ◽

Trade Off ◽

Conflicting Objectives ◽

Automated Machine Learning

AbstractEarly time series classification (EarlyTSC) involves the prediction of a class label based on partial observation of a given time series. Most EarlyTSC algorithms consider the trade-off between accuracy and earliness as two competing objectives, using a single dedicated hyperparameter. To obtain insights into this trade-off requires finding a set of non-dominated (Pareto efficient) classifiers. So far, this has been approached through manual hyperparameter tuning. Since the trade-off hyperparameters only provide indirect control over the earliness-accuracy trade-off, manual tuning is tedious and tends to result in many sub-optimal hyperparameter settings. This complicates the search for optimal hyperparameter settings and forms a hurdle for the application of EarlyTSC to real-world problems. To address these issues, we propose an automated approach to hyperparameter tuning and algorithm selection for EarlyTSC, building on developments in the fast-moving research area known as automated machine learning (AutoML). To deal with the challenging task of optimising two conflicting objectives in early time series classification, we propose MultiETSC, a system for multi-objective algorithm selection and hyperparameter optimisation (MO-CASH) for EarlyTSC. MultiETSC can potentially leverage any existing or future EarlyTSC algorithm and produces a set of Pareto optimal algorithm configurations from which a user can choose a posteriori. As an additional benefit, our proposed framework can incorporate and leverage time-series classification algorithms not originally designed for EarlyTSC for improving performance on EarlyTSC; we demonstrate this property using a newly defined, “naïve” fixed-time algorithm. In an extensive empirical evaluation of our new approach on a benchmark of 115 data sets, we show that MultiETSC performs substantially better than baseline methods, ranking highest (avg. rank 1.98) compared to conceptually simpler single-algorithm (2.98) and single-objective alternatives (4.36).

Download Full-text

Bandit-Based Automated Machine Learning

2018 7th Brazilian Conference on Intelligent Systems (BRACIS) ◽

10.1109/bracis.2018.00029 ◽

2018 ◽

Author(s):

Silvia Cristina Nunes das Dores ◽

Carlos Soares ◽

Duncan Ruiz

Keyword(s):

Machine Learning ◽

Automated Machine Learning

Download Full-text

A Robust Automated Machine Learning System with Pseudoinverse Learning

Cognitive Computation ◽

10.1007/s12559-021-09853-6 ◽

2021 ◽

Author(s):

Ke Wang ◽

Ping Guo

Keyword(s):

Machine Learning ◽

Learning System ◽

Automated Machine Learning

Download Full-text

Automated machine learning to predict the co‐occurrence of isocitrate dehydrogenase mutations and O 6 ‐methylguanine‐DNA methyltransferase promoter methylation in patients with gliomas

Journal of Magnetic Resonance Imaging ◽

10.1002/jmri.27498 ◽

2021 ◽

Author(s):

Simin Zhang ◽

Huaiqiang Sun ◽

Xiaorui Su ◽

Xibiao Yang ◽

Weina Wang ◽

...

Keyword(s):

Machine Learning ◽

Promoter Methylation ◽

Isocitrate Dehydrogenase ◽

Dna Methyltransferase ◽

Methylguanine Dna Methyltransferase ◽

Automated Machine Learning

Download Full-text

Soil Sensors Based Prediction System for Plant Diseases using Exploratory Data Analysis and Machine Learning

IEEE Sensors Journal ◽

10.1109/jsen.2020.3046295 ◽

2020 ◽

pp. 1-1

Author(s):

Manish Kumar ◽

Ahlad Kumar ◽

Vinay S. Palaparthy

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Exploratory Data Analysis ◽

Plant Diseases ◽

Prediction System ◽

Soil Sensors ◽

Exploratory Data

Download Full-text

Automated Machine-Learning Radiotherapy Planning for Pediatric and Adult Brain Tumours

Journal of Medical Imaging and Radiation Sciences ◽

10.1016/j.jmir.2021.03.005 ◽

2021 ◽

Vol 52 (2) ◽

pp. S3

Author(s):

Grace Tsui ◽

Derek S. Tsang ◽

Chris McIntosh ◽

Thomas G. Purdie ◽

Glenn Bauman ◽

...

Keyword(s):

Machine Learning ◽

Brain Tumours ◽

Adult Brain ◽

Radiotherapy Planning ◽

Automated Machine Learning

Download Full-text

Testing the Suitability of Automated Machine Learning for Weeds Identification

AI ◽

10.3390/ai2010004 ◽

2021 ◽

Vol 2 (1) ◽

pp. 34-47

Author(s):

Borja Espejo-Garcia ◽

Ioannis Malounas ◽

Eleanna Vali ◽

Spyros Fountas

Keyword(s):

Machine Learning ◽

Plant Protection ◽

Crop Protection ◽

Identification Problem ◽

Learning System ◽

Classifier Ensembles ◽

Automated Machine Learning ◽

A New Technique ◽

Plant Seedlings ◽

And Training

In the past years, several machine-learning-based techniques have arisen for providing effective crop protection. For instance, deep neural networks have been used to identify different types of weeds under different real-world conditions. However, these techniques usually require extensive involvement of experts working iteratively in the development of the most suitable machine learning system. To support this task and save resources, a new technique called Automated Machine Learning has started being studied. In this work, a complete open-source Automated Machine Learning system was evaluated with two different datasets, (i) The Early Crop Weeds dataset and (ii) the Plant Seedlings dataset, covering the weeds identification problem. Different configurations, such as the use of plant segmentation, the use of classifier ensembles instead of Softmax and training with noisy data, have been compared. The results showed promising performances of 93.8% and 90.74% F1 score depending on the dataset used. These performances were aligned with other related works in AutoML, but they are far from machine-learning-based systems manually fine-tuned by human experts. From these results, it can be concluded that finding a balance between manual expert work and Automated Machine Learning will be an interesting path to work in order to increase the efficiency in plant protection.

Download Full-text