Reports of my demise are greatly exaggerated: $N$-subjettiness taggers take on jet images

Liam Moore; Karl Nordström; Sreedevi Varma; Malcolm Fairbairn

doi:10.21468/scipostphys.7.3.036

Reports of my demise are greatly exaggerated: $N$-subjettiness taggers take on jet images

SciPost Physics ◽

10.21468/scipostphys.7.3.036 ◽

2019 ◽

Vol 7 (3) ◽

Cited By ~ 9

Author(s):

Liam Moore ◽

Karl Nordström ◽

Sreedevi Varma ◽

Malcolm Fairbairn

Keyword(s):

Neural Network ◽

Neural Networks ◽

Top Quark ◽

Quark Decays ◽

Body Kinematic ◽

Highly Correlated ◽

Object Tagging ◽

Jet Images ◽

Mass Information ◽

Image Network

We compare the performance of a convolutional neural network (CNN) trained on jet images with dense neural networks (DNNs) trained on nn-subjettiness variables to study the distinguishing power of these two separate techniques applied to top quark decays. We find that they perform almost identically and are highly correlated once jet mass information is included, which suggests they are accessing the same underlying information which can be intuitively understood as being contained in 4-, 5-, 6-, and 8-body kinematic phase spaces depending on the sample. This suggests both of these methods are highly useful for heavy object tagging and provides a tentative answer to the question of what the image network is actually learning.

Download Full-text

Combine and conquer: event reconstruction with Bayesian Ensemble Neural Networks

Journal of High Energy Physics ◽

10.1007/jhep04(2021)296 ◽

2021 ◽

Vol 2021 (4) ◽

Author(s):

Jack Y. Araz ◽

Michael Spannowsky

Keyword(s):

Neural Network ◽

Neural Networks ◽

Top Quark ◽

Feature Space ◽

Training Sample ◽

Bayesian Techniques ◽

Training Sample Size ◽

Event Reconstruction ◽

Neural Network Classifiers ◽

Quark Jets

Abstract Ensemble learning is a technique where multiple component learners are combined through a protocol. We propose an Ensemble Neural Network (ENN) that uses the combined latent-feature space of multiple neural network classifiers to improve the representation of the network hypothesis. We apply this approach to construct an ENN from Convolutional and Recurrent Neural Networks to discriminate top-quark jets from QCD jets. Such ENN provides the flexibility to improve the classification beyond simple prediction combining methods by linking different sources of error correlations, hence improving the representation between data and hypothesis. In combination with Bayesian techniques, we show that it can reduce epistemic uncertainties and the entropy of the hypothesis by simultaneously exploiting various kinematic correlations of the system, which also makes the network less susceptible to a limitation in training sample size.

Download Full-text

Improving Nonlinear Process Modelling Through Selective Combination of Multiple Neural Networks using Combined Correlation Coefficient Analysis

Jurnal Teknologi ◽

10.11113/jt.v48.237 ◽

2012 ◽

Author(s):

Zainal Ahmad ◽

Rabiatul ‘Adawiyah Mat Noor

Keyword(s):

Neural Network ◽

Neural Networks ◽

Correlation Coefficient ◽

Network Models ◽

Nonlinear Process ◽

Process Modelling ◽

Combination Method ◽

Neural Network Models ◽

Multiple Neural Networks ◽

Highly Correlated

This paper proposed a selective combination method based on combined correlation coefficient analysis to increase the robustness of the single neural network. The main objective of the proposed approach is to improve the generalisation capability of the neural network models by combining networks that are less correlated. The assumption that we made is that combining networks that are highly correlated might not improve the final prediction performance due to the fact that these networks present the same contribution to the final prediction. This might even deteriorate the robustness of the combined network. The result shows that combination multiple neural networks using the proposed approach improved the performance of the two nonlinear process modelling case studies in which there is a significant reduction of validation sum square error (SSE) of the networks was obtained. Key words: Multiple neural networks, selective combination neural networks, correlation coefficient, nonlinear process modelling

Download Full-text

ATLAS Jet Reconstruction, Calibration, and Tagging of Lorentzboosted Objects

EPJ Web of Conferences ◽

10.1051/epjconf/201818202113 ◽

2018 ◽

Vol 182 ◽

pp. 02113

Author(s):

Steven Schramm

Keyword(s):

Neural Networks ◽

Top Quark ◽

Z Boson ◽

Mass Scale ◽

Hadronic Decay ◽

Jet Substructure ◽

In Situ Calibration ◽

Boosted Decision Trees ◽

Jet Images

Jet reconstruction in the ATLAS detector takes multiple forms, as motivated by the intended usage of the jet. Different jet definitions are used in particular for the study of QCD jets and jets containing the hadronic decay of boosted massive particles. These different types of jets are calibrated through a series of mostly sequential steps, providing excellent uncertainties, including a first in situ calibration of the jet mass scale. Jet tagging is investigated, including both not-top-quark vs gluon discrimination as well as W/Z boson, H → bb, and top-quark identification. This includes a first look at the use of Boosted Decision Trees and Deep Neural Networks built from jet substructure variables, as well as Convolutional Neural Networks built from jet images. In all cases, these advanced techniques are seen to provide gains over the standard approaches, with the magnitude of the gain depending on the use case. Future methods for improving jet tagging are briefly discussed, including jet substructure-oriented particle flow primarily for W/Z tagging and new subjet reconstruction strategies for H → bb tagging.

Download Full-text

Physics Inspired Deep Neural Networks for Top Quark Reconstruction

EPJ Web of Conferences ◽

10.1051/epjconf/202024506029 ◽

2020 ◽

Vol 245 ◽

pp. 06029

Author(s):

Kevin Greif ◽

Kevin Lannon

Keyword(s):

Neural Network ◽

Neural Networks ◽

Computer Vision ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Top Quark ◽

Deep Neural Networks ◽

Great Success ◽

Fully Connected

Deep neural networks (DNNs) have been applied to the fields of computer vision and natural language processing with great success in recent years. The success of these applications has hinged on the development of specialized DNN architectures that take advantage of specific characteristics of the problem to be solved, namely convolutional neural networks for computer vision and recurrent neural networks for natural language processing. This research explores whether a neural network architecture specific to the task of identifying t → Wb decays in particle collision data yields better performance than a generic, fully-connected DNN. Although applied here to resolved top quark decays, this approach is inspired by an DNN technique for tagging boosted top quarks, which consists of defining custom neural network layers known as the combination and Lorentz layers. These layers encode knowledge of relativistic kinematics applied to combinations of particles, and the output of these specialized layers can then be fed into a fully connected neural network to learn tasks such as classification. This research compares the performance of these physics inspired networks to that of a generic, fully-connected DNN, to see if there is any advantage in terms of classification performance, size of the network, or ease of training.

Download Full-text

SCORING MODELING BASED ON NEURAL NETWORKS FOR DETERMINING A BANK BORROWER'S RATING

Economy of Ukraine ◽

10.15407/economyukr.2020.10.054 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 54-62

Author(s):

Oleksii VASYLIEV ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Statistical Data ◽

Activation Function ◽

Decision Making Process ◽

Neural Network Architecture ◽

Acceptable Accuracy ◽

The Neural Network ◽

Sigmoid Activation Function

The problem of applying neural networks to calculate ratings used in banking in the decision-making process on granting or not granting loans to borrowers is considered. The task is to determine the rating function of the borrower based on a set of statistical data on the effectiveness of loans provided by the bank. When constructing a regression model to calculate the rating function, it is necessary to know its general form. If so, the task is to calculate the parameters that are included in the expression for the rating function. In contrast to this approach, in the case of using neural networks, there is no need to specify the general form for the rating function. Instead, certain neural network architecture is chosen and parameters are calculated for it on the basis of statistical data. Importantly, the same neural network architecture can be used to process different sets of statistical data. The disadvantages of using neural networks include the need to calculate a large number of parameters. There is also no universal algorithm that would determine the optimal neural network architecture. As an example of the use of neural networks to determine the borrower's rating, a model system is considered, in which the borrower's rating is determined by a known non-analytical rating function. A neural network with two inner layers, which contain, respectively, three and two neurons and have a sigmoid activation function, is used for modeling. It is shown that the use of the neural network allows restoring the borrower's rating function with quite acceptable accuracy.

Download Full-text

Color Space Transformation using Neural Networks

Color and Imaging Conference ◽

10.2352/issn.2169-2629.2019.27.29 ◽

2019 ◽

Vol 2019 (1) ◽

pp. 153-158

Author(s):

Lindsay MacDonald

Keyword(s):

Neural Network ◽

Neural Networks ◽

Color Space ◽

Reflectance Spectra ◽

Network Architectures ◽

Color Spaces ◽

Natural Materials ◽

Space Transformation ◽

Color Space Transformation

We investigated how well a multilayer neural network could implement the mapping between two trichromatic color spaces, specifically from camera R,G,B to tristimulus X,Y,Z. For training the network, a set of 800,000 synthetic reflectance spectra was generated. For testing the network, a set of 8,714 real reflectance spectra was collated from instrumental measurements on textiles, paints and natural materials. Various network architectures were tested, with both linear and sigmoidal activations. Results show that over 85% of all test samples had color errors of less than 1.0 ΔE2000 units, much more accurate than could be achieved by regression.

Download Full-text

Estimating Pigment Concentrations from Spectral Images Using an Encoder‐Decoder Neural Network

Journal of Imaging Science and Technology ◽

10.2352/j.imagingsci.technol.2020.64.3.030502 ◽

2020 ◽

Vol 64 (3) ◽

pp. 30502-1-30502-15

Author(s):

Kensuke Fukumoto ◽

Norimichi Tsumura ◽

Roy Berns

Keyword(s):

Neural Network ◽

Neural Networks ◽

Absorption Coefficient ◽

Spectral Data ◽

High Accuracy ◽

Pigment Concentration ◽

Scattering Coefficient ◽

A Value ◽

Input And Output ◽

Pigment Concentrations

Abstract A method is proposed to estimate the concentration of pigments mixed in a painting, using the encoder‐decoder model of neural networks. The model is trained to output a value that is the same as its input, and its middle output extracts a certain feature as compressed information about the input. In this instance, the input and output are spectral data of a painting. The model is trained with pigment concentration as the middle output. A dataset containing the scattering coefficient and absorption coefficient of each of 19 pigments was used. The Kubelka‐Munk theory was applied to the coefficients to obtain many patterns of synthetic spectral data, which were used for training. The proposed method was tested using spectral images of 33 paintings, which showed that the method estimates, with high accuracy, the concentrations that have a similar spectrum of the target pigments.

Download Full-text

Neural Network Techniques for Time Series Prediction: A Review

JOIV International Journal on Informatics Visualization ◽

10.30630/joiv.3.3.281 ◽

2019 ◽

Vol 3 (3) ◽

Author(s):

Muhammad Faheem Mushtaq ◽

Urooj Akram ◽

Muhammad Aamir ◽

Haseeb Ali ◽

Muhammad Zulqarnain

Keyword(s):

Neural Network ◽

Neural Networks ◽

Time Series ◽

Time Series Data ◽

Weather Prediction ◽

Time Series Prediction ◽

Series Data ◽

Prediction Problem ◽

Neural Network Models ◽

Physical Time

It is important to predict a time series because many problems that are related to prediction such as health prediction problem, climate change prediction problem and weather prediction problem include a time component. To solve the time series prediction problem various techniques have been developed over many years to enhance the accuracy of forecasting. This paper presents a review of the prediction of physical time series applications using the neural network models. Neural Networks (NN) have appeared as an effective tool for forecasting of time series. Moreover, to resolve the problems related to time series data, there is a need of network with single layer trainable weights that is Higher Order Neural Network (HONN) which can perform nonlinearity mapping of input-output. So, the developers are focusing on HONN that has been recently considered to develop the input representation spaces broadly. The HONN model has the ability of functional mapping which determined through some time series problems and it shows the more benefits as compared to conventional Artificial Neural Networks (ANN). The goal of this research is to present the reader awareness about HONN for physical time series prediction, to highlight some benefits and challenges using HONN.

Download Full-text

Neural networks approached for modelling river suspended sediment concentration due to tropical storms

Global NEST Journal ◽

10.30955/gnj.000628 ◽

2013 ◽

Vol 11 (4) ◽

pp. 457-466

Keyword(s):

Neural Network ◽

Neural Networks ◽

Suspended Sediment ◽

Suspended Sediment Concentration ◽

Time Series Data ◽

Water Discharge ◽

Sediment Concentration ◽

Series Data ◽

Generalized Regression Neural Network ◽

Event Based

Artificial neural networks are one of the advanced technologies employed in hydrology modelling. This paper investigates the potential of two algorithm networks, the feed forward backpropagation (BP) and generalized regression neural network (GRNN) in comparison with the classical regression for modelling the event-based suspended sediment concentration at Jiasian diversion weir in Southern Taiwan. For this study, the hourly time series data comprised of water discharge, turbidity and suspended sediment concentration during the storm events in the year of 2002 are taken into account in the models. The statistical performances comparison showed that both BP and GRNN are superior to the classical regression in the weir sediment modelling. Additionally, the turbidity was found to be a dominant input variable over the water discharge for suspended sediment concentration estimation. Statistically, both neural network models can be successfully applied for the event-based suspended sediment concentration modelling in the weir studied herein when few data are available.

Download Full-text

Squeak and rattle noise classification using radial basis function neural networks

Noise Control Engineering Journal ◽

10.3397/1/376824 ◽

2020 ◽

Vol 68 (4) ◽

pp. 283-293

Author(s):

Oleksandr Pogorilyi ◽

Mohammad Fard ◽

John Davy ◽

Mechanical and Automotive Engineering, School ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

High Accuracy ◽

Training Method ◽

Vehicle Interior ◽

Trained Classifier ◽

Different Types ◽

Noise Classification ◽

Automatic Tool ◽

Multi Class Classification

In this article, an artificial neural network is proposed to classify short audio sequences of squeak and rattle (S&R) noises. The aim of the classification is to see how accurately the trained classifier can recognize different types of S&R sounds. Having a high accuracy model that can recognize audible S&R noises could help to build an automatic tool able to identify unpleasant vehicle interior sounds in a matter of seconds from a short audio recording of the sounds. In this article, the training method of the classifier is proposed, and the results show that the trained model can identify various classes of S&R noises: simple (binary clas- sification) and complex ones (multi class classification).

Download Full-text