New Machine Learning Developments in ROOT/TMVA

The Toolkit for Multivariate Analysis, TMVA, the machine learning package integrated into the ROOT data analysis framework, has recently seen improvements to its deep learning module, parallelisation of multivariate methods and cross validation. Performance benchmarks on datasets from high-energy physics are presented with a particular focus on the new deep learning module which contains robust fully-connected, convolutional and recurrent deep neural networks implemented on CPU and GPU architectures. Both dense and convo-lutional layers are shown to be competitive on small-scale networks suitable for high-level physics analyses in both training and in single-event evaluation. Par-allelisation efforts show an asymptotical 3-fold reduction in boosted decision tree training time while the cross validation implementation shows significant speed up with parallel fold evaluation.

Download Full-text

Deep Learning and Its Application to LHC Physics

Annual Review of Nuclear and Particle Science ◽

10.1146/annurev-nucl-101917-021019 ◽

2018 ◽

Vol 68 (1) ◽

pp. 161-181 ◽

Cited By ~ 61

Author(s):

Dan Guest ◽

Kyle Cranmer ◽

Daniel Whiteson

Keyword(s):

Machine Learning ◽

Deep Learning ◽

High Energy Physics ◽

High Energy ◽

Learning Tools ◽

Future Prospects ◽

Higher Dimensional ◽

Lhc Physics ◽

Core Concepts ◽

Energy Physics

Machine learning has played an important role in the analysis of high-energy physics data for decades. The emergence of deep learning in 2012 allowed for machine learning tools which could adeptly handle higher-dimensional and more complex problems than previously feasible. This review is aimed at the reader who is familiar with high-energy physics but not machine learning. The connections between machine learning and high-energy physics data analysis are explored, followed by an introduction to the core concepts of neural networks, examples of the key results demonstrating the power of deep learning for analysis of LHC data, and discussion of future prospects and concerns.

Download Full-text

Event Classification with Quantum Machine Learning in High-Energy Physics

Computing and Software for Big Science ◽

10.1007/s41781-020-00047-7 ◽

2021 ◽

Vol 5 (1) ◽

Author(s):

Koji Terashi ◽

Michiru Kaneda ◽

Tomoe Kishimoto ◽

Masahiko Saito ◽

Ryu Sawada ◽

...

Keyword(s):

Machine Learning ◽

High Energy Physics ◽

Quantum Algorithms ◽

Quantum Algorithm ◽

High Energy ◽

Event Classification ◽

Boosted Decision Tree ◽

Input Variables ◽

Background Events ◽

Energy Physics

AbstractWe present studies of quantum algorithms exploiting machine learning to classify events of interest from background events, one of the most representative machine learning applications in high-energy physics. We focus on variational quantum approach to learn the properties of input data and evaluate the performance of the event classification using both simulators and quantum computing devices. Comparison of the performance with standard multi-variate classification techniques based on a boosted-decision tree and a deep neural network using classical computers shows that the quantum algorithm has comparable performance with the standard techniques at the considered ranges of the number of input variables and the size of training samples. The variational quantum algorithm is tested with quantum computers, demonstrating that the discrimination of interesting events from background is feasible. Characteristic behaviors observed during a learning process using quantum circuits with extended gate structures are discussed, as well as the implications of the current performance to the application in high-energy physics experiments.

Download Full-text

Beyond HEP: Photon and accelerator science computing infrastructure at DESY

EPJ Web of Conferences ◽

10.1051/epjconf/202024507036 ◽

2020 ◽

Vol 245 ◽

pp. 07036

Author(s):

Christoph Beyer ◽

Stefan Bujack ◽

Stefan Dietrich ◽

Thomas Finnern ◽

Martin Flemming ◽

...

Keyword(s):

High Performance ◽

Large Scale ◽

High Energy Physics ◽

High Energy ◽

Resource Provisioning ◽

Small Scale ◽

Online Processing ◽

Offline Processing ◽

National Analysis ◽

Energy Physics

DESY is one of the largest accelerator laboratories in Europe. It develops and operates state of the art accelerators for fundamental science in the areas of high energy physics, photon science and accelerator development. While for decades high energy physics (HEP) has been the most prominent user of the DESY compute, storage and network infrastructure, various scientific areas as science with photons and accelerator development have caught up and are now dominating the demands on the DESY infrastructure resources, with significant consequences for the IT resource provisioning. In this contribution, we will present an overview of the computational, storage and network resources covering the various physics communities on site. Ranging from high-throughput computing (HTC) batch-like offline processing in the Grid and the interactive user analyses resources in the National Analysis Factory (NAF) for the HEP community, to the computing needs of accelerator development or of photon sciences such as PETRA III or the European XFEL. Since DESY is involved in these experiments and their data taking, their requirements include fast low-latency online processing for data taking and calibration as well as offline processing, thus high-performance computing (HPC) workloads, that are run on the dedicated Maxwell HPC cluster. As all communities face significant challenges due to changing environments and increasing data rates in the following years, we will discuss how this will reflect in necessary changes to the computing and storage infrastructures. We will present DESY compute cloud and container orchestration plans as a basis for infrastructure and platform services. We will show examples of Jupyter notebooks for small scale interactive analysis, as well as its integration into large scale resources such as batch systems or Spark clusters. To overcome the fragmentation of the various resources for all scientific communities at DESY, we explore how to integrate them into a seamless user experience in an Interdisciplinary Data Analysis Facility.

Download Full-text

Deep Learning Network

Advances in Computer and Electrical Engineering - Neural Networks for Natural Language Processing ◽

10.4018/978-1-7998-1159-6.ch001 ◽

2020 ◽

pp. 1-30

Author(s):

Bhanu Chander

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Deep Learning ◽

Great Power ◽

Human Being ◽

Technology Improvement ◽

Deep Learning Network ◽

High Level ◽

Human Brains ◽

Near Future

Artificial intelligence (AI) is defined as a machine that can do everything a human being can do and produce better results. Means AI enlightening that data can produce a solution for its own results. Inside the AI ellipsoidal, Machine learning (ML) has a wide variety of algorithms produce more accurate results. As a result of technology, improvement increasing amounts of data are available. But with ML and AI, it is very difficult to extract such high-level, abstract features from raw data, moreover hard to know what feature should be extracted. Finally, we now have deep learning; these algorithms are modeled based on how human brains process the data. Deep learning is a particular kind of machine learning that provides flexibility and great power, with its attempts to learn in multiple levels of representation with the operations of multiple layers. Deep learning brief overview, platforms, Models, Autoencoders, CNN, RNN, and Appliances are described appropriately. Deep learning will have many more successes in the near future because it requires very little engineering by hand.

Download Full-text

Democratized image analytics by visual programming through integration of deep models and small-scale machine learning

Nature Communications ◽

10.1038/s41467-019-12397-x ◽

2019 ◽

Vol 10 (1) ◽

Cited By ~ 3

Author(s):

Primož Godec ◽

Matjaž Pančur ◽

Nejc Ilenič ◽

Andrej Čopar ◽

Martin Stražar ◽

...

Keyword(s):

Machine Learning ◽

Image Analysis ◽

Deep Learning ◽

Data Analysis ◽

Protein Localization ◽

Visual Programming ◽

Developmental Competence ◽

Image Clustering ◽

Small Scale ◽

Learning Approaches

Abstract Analysis of biomedical images requires computational expertize that are uncommon among biomedical scientists. Deep learning approaches for image analysis provide an opportunity to develop user-friendly tools for exploratory data analysis. Here, we use the visual programming toolbox Orange (http://orange.biolab.si) to simplify image analysis by integrating deep-learning embedding, machine learning procedures, and data visualization. Orange supports the construction of data analysis workflows by assembling components for data preprocessing, visualization, and modeling. We equipped Orange with components that use pre-trained deep convolutional networks to profile images with vectors of features. These vectors are used in image clustering and classification in a framework that enables mining of image sets for both novel and experienced users. We demonstrate the utility of the tool in image analysis of progenitor cells in mouse bone healing, identification of developmental competence in mouse oocytes, subcellular protein localization in yeast, and developmental morphology of social amoebae.

Download Full-text

Deep learning based DNA:RNA triplex forming potential prediction

BMC Bioinformatics ◽

10.1186/s12859-020-03864-0 ◽

2020 ◽

Vol 21 (1) ◽

Author(s):

Yu Zhang ◽

Yahui Long ◽

Chee Keong Kwoh

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Cross Validation ◽

Roc Curves ◽

Triplex Dna ◽

Triplex Formation ◽

Machine Learning Model ◽

Non Coding Rnas ◽

High Level ◽

Integrated Program

Abstract Background Long non-coding RNAs (lncRNAs) can exert functions via forming triplex with DNA. The current methods in predicting the triplex formation mainly rely on mathematic statistic according to the base paring rules. However, these methods have two main limitations: (1) they identify a large number of triplex-forming lncRNAs, but the limited number of experimentally verified triplex-forming lncRNA indicates that maybe not all of them can form triplex in practice, and (2) their predictions only consider the theoretical relationship while lacking the features from the experimentally verified data. Results In this work, we develop an integrated program named TriplexFPP (Triplex Forming Potential Prediction), which is the first machine learning model in DNA:RNA triplex prediction. TriplexFPP predicts the most likely triplex-forming lncRNAs and DNA sites based on the experimentally verified data, where the high-level features are learned by the convolutional neural networks. In the fivefold cross validation, the average values of Area Under the ROC curves and PRC curves for removed redundancy triplex-forming lncRNA dataset with threshold 0.8 are 0.9649 and 0.9996, and these two values for triplex DNA sites prediction are 0.8705 and 0.9671, respectively. Besides, we also briefly summarize the cis and trans targeting of triplexes lncRNAs. Conclusions The TriplexFPP is able to predict the most likely triplex-forming lncRNAs from all the lncRNAs with computationally defined triplex forming capacities and the potential of a DNA site to become a triplex. It may provide insights to the exploration of lncRNA functions.

Download Full-text

Automatic Pulmonary Nodule Detection Applying Deep Learning or Machine Learning Algorithms to the LIDC-IDRI Database: A Systematic Review

Diagnostics ◽

10.3390/diagnostics9010029 ◽

2019 ◽

Vol 9 (1) ◽

pp. 29 ◽

Cited By ~ 20

Author(s):

Lea Pehrson ◽

Michael Nielsen ◽

Carsten Ammitzbøl Lauridsen

Keyword(s):

Machine Learning ◽

Systematic Review ◽

Deep Learning ◽

Machine Learning Algorithms ◽

Ct Scans ◽

Lung Nodules ◽

Original Research ◽

Feature Based ◽

High Level ◽

Meta Analyses

The aim of this study was to provide an overview of the literature available on machine learning (ML) algorithms applied to the Lung Image Database Consortium Image Collection (LIDC-IDRI) database as a tool for the optimization of detecting lung nodules in thoracic CT scans. This systematic review was compiled according to Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. Only original research articles concerning algorithms applied to the LIDC-IDRI database were included. The initial search yielded 1972 publications after removing duplicates, and 41 of these articles were included in this study. The articles were divided into two subcategories describing their overall architecture. The majority of feature-based algorithms achieved an accuracy >90% compared to the deep learning (DL) algorithms that achieved an accuracy in the range of 82.2%–97.6%. In conclusion, ML and DL algorithms are able to detect lung nodules with a high level of accuracy, sensitivity, and specificity using ML, when applied to an annotated archive of CT scans of the lung. However, there is no consensus on the method applied to determine the efficiency of ML algorithms.

Download Full-text

The TrackML high-energy physics tracking challenge on Kaggle

EPJ Web of Conferences ◽

10.1051/epjconf/201921406037 ◽

2019 ◽

Vol 214 ◽

pp. 06037

Author(s):

Moritz Kiehn ◽

Sabrina Amrouche ◽

Paolo Calafiura ◽

Victor Estrade ◽

Steven Farrell ◽

...

Keyword(s):

Machine Learning ◽

High Energy Physics ◽

High Energy ◽

Training Dataset ◽

High Luminosity ◽

Test Dataset ◽

Event Reconstruction ◽

Computer Scientists ◽

Set Up ◽

Energy Physics

The High-Luminosity LHC (HL-LHC) is expected to reach unprecedented collision intensities, which in turn will greatly increase the complexity of tracking within the event reconstruction. To reach out to computer science specialists, a tracking machine learning challenge (TrackML) was set up on Kaggle by a team of ATLAS, CMS, and LHCb physicists tracking experts and computer scientists building on the experience of the successful Higgs Machine Learning challenge in 2014. A training dataset based on a simulation of a generic HL-LHC experiment tracker has been created, listing for each event the measured 3D points, and the list of 3D points associated to a true track.The participants to the challenge should find the tracks in the test dataset, which means building the list of 3D points belonging to each track.The emphasis is to expose innovative approaches, rather than hyper-optimising known approaches. A metric reflecting the accuracy of a model at finding the proper associations that matter most to physics analysis will allow to select good candidates to augment or replace existing algorithms.

Download Full-text

Down the deep rabbit hole: Untangling deep learning from machine learning and artificial intelligence

First Monday ◽

10.5210/fm.v24i2.8237 ◽

2019 ◽

Author(s):

Niel Chah

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Deep Learning ◽

Historical Context ◽

Historical Background ◽

Deep Roots ◽

Technical Guide ◽

Learning Machine ◽

High Level

Interest in deep learning, machine learning, and artificial intelligence from industry and the general public has reached a fever pitch recently. However, these terms are frequently misused, confused, and conflated. This paper serves as a non-technical guide for those interested in a high-level understanding of these increasingly influential notions by exploring briefly the historical context of deep learning, its public presence, and growing concerns over the limitations of these techniques. As a first step, artificial intelligence and machine learning are defined. Next, an overview of the historical background of deep learning reveals its wide scope and deep roots. A case study of a major deep learning implementation is presented in order to analyze public perceptions shaped by companies focused on technology. Finally, a review of deep learning limitations illustrates systemic vulnerabilities and a growing sense of concern over these systems.

Download Full-text

Deep Convolutional Extreme Learning Machine and Its Application in Handwritten Digit Classification

Computational Intelligence and Neuroscience ◽

10.1155/2016/3049632 ◽

2016 ◽

Vol 2016 ◽

pp. 1-10 ◽

Cited By ~ 21

Author(s):

Shan Pang ◽

Xinyi Yang

Keyword(s):

Deep Learning ◽

Extreme Learning Machine ◽

Neuron Network ◽

Learning Methods ◽

Training Time ◽

Fast Training ◽

Handwritten Digit ◽

Learning Machine ◽

Hidden Layer ◽

High Level

In recent years, some deep learning methods have been developed and applied to image classification applications, such as convolutional neuron network (CNN) and deep belief network (DBN). However they are suffering from some problems like local minima, slow convergence rate, and intensive human intervention. In this paper, we propose a rapid learning method, namely, deep convolutional extreme learning machine (DC-ELM), which combines the power of CNN and fast training of ELM. It uses multiple alternate convolution layers and pooling layers to effectively abstract high level features from input images. Then the abstracted features are fed to an ELM classifier, which leads to better generalization performance with faster learning speed. DC-ELM also introduces stochastic pooling in the last hidden layer to reduce dimensionality of features greatly, thus saving much training time and computation resources. We systematically evaluated the performance of DC-ELM on two handwritten digit data sets: MNIST and USPS. Experimental results show that our method achieved better testing accuracy with significantly shorter training time in comparison with deep learning methods and other ELM methods.

Download Full-text