Classification of Fermi-LAT sources with deep learning using energy and time spectra

Thorben Finke; Michael Krämer; Silvia Manconi

doi:10.1093/mnras/stab2389

Classification of Fermi-LAT sources with deep learning using energy and time spectra

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/stab2389 ◽

2021 ◽

Vol 507 (3) ◽

pp. 4061-4073

Author(s):

Thorben Finke ◽

Michael Krämer ◽

Silvia Manconi

Keyword(s):

Neural Network ◽

Neural Networks ◽

Active Galactic Nuclei ◽

Photon Energy ◽

Deep Neural Network ◽

Gamma Ray ◽

Galactic Nuclei ◽

Network Architectures ◽

Neural Network Architectures

ABSTRACT Despite the growing number of gamma-ray sources detected by the Fermi-Large Area Telescope (LAT), about one-third of the sources in each survey remains of uncertain type. We present a new deep neural network approach for the classification of unidentified or unassociated gamma-ray sources in the last release of the Fermi-LAT catalogue (4FGL-DR2) obtained with 10 yr of data. In contrast to previous work, our method directly uses the measurements of the photon energy spectrum and time series as input for the classification, instead of specific, human-crafted features. Dense neural networks, and for the first time in the context of gamma-ray source classification recurrent neural networks, are studied in depth. We focus on the separation between extragalactic sources, i.e. active galactic nuclei, and Galactic pulsars, and on the further classification of pulsars into young and millisecond pulsars. Our neural network architectures provide powerful classifiers, with a performance that is comparable to previous analyses based on human-crafted features. Our benchmark neural network predicts that of the sources of uncertain type in the 4FGL-DR2 catalogue, 1050 are active galactic nuclei and 78 are Galactic pulsars, with both classes following the expected sky distribution and the clustering in the variability–curvature plane. We investigate the problem of sample selection bias by testing our architectures against a cross-match test data set using an older catalogue, and propose a feature selection algorithm using autoencoders. Our list of high-confidence candidate sources labelled by the neural networks provides a set of targets for further multiwavelength observations addressed to identify their nature. The deep neural network architectures we develop can be easily extended to include specific features, as well as multiwavelength data on the source photon energy and time spectra coming from different instruments.

Download Full-text

Neural networks for classification of strokes in electrical impedance tomography on a 3D head model

Mathematics in Engineering ◽

10.3934/mine.2022029 ◽

2022 ◽

Vol 4 (4) ◽

pp. 1-22

Author(s):

Valentina Candiani ◽

◽

Matteo Santacesaria ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Electrical Impedance Tomography ◽

Electrical Impedance ◽

Network Architectures ◽

Impedance Tomography ◽

Average Accuracy ◽

Neural Network Architectures ◽

Fully Connected

<abstract><p>We consider the problem of the detection of brain hemorrhages from three-dimensional (3D) electrical impedance tomography (EIT) measurements. This is a condition requiring urgent treatment for which EIT might provide a portable and quick diagnosis. We employ two neural network architectures - a fully connected and a convolutional one - for the classification of hemorrhagic and ischemic strokes. The networks are trained on a dataset with $ 40\, 000 $ samples of synthetic electrode measurements generated with the complete electrode model on realistic heads with a 3-layer structure. We consider changes in head anatomy and layers, electrode position, measurement noise and conductivity values. We then test the networks on several datasets of unseen EIT data, with more complex stroke modeling (different shapes and volumes), higher levels of noise and different amounts of electrode misplacement. On most test datasets we achieve $ \geq 90\% $ average accuracy with fully connected neural networks, while the convolutional ones display an average accuracy $ \geq 80\% $. Despite the use of simple neural network architectures, the results obtained are very promising and motivate the applications of EIT-based classification methods on real phantoms and ultimately on human patients.</p></abstract>

Download Full-text

IMPROVING GENERALIZATION OF NEURAL NETWORKS THROUGH PRUNING

International Journal of Neural Systems ◽

10.1142/s0129065791000352 ◽

1991 ◽

Vol 01 (04) ◽

pp. 317-326 ◽

Cited By ~ 49

Author(s):

Hans Henrik Thodberg

Keyword(s):

Neural Network ◽

Neural Networks ◽

Internal Representation ◽

Network Architectures ◽

Ockham’S Razor ◽

Ockham's Razor ◽

Neural Network Architectures

A technique for constructing neural network architectures with better ability to generalize is presented under the name Ockham's Razor: several networks are trained and then pruned by removing connections one by one and retraining. The networks which achieve fewest connections generalize best. The method is tested on a classification of bit strings (the contiguity problem): the optimal architecture emerges, resulting in perfect generalization. The internal representation of the network changes substantially during the retraining, and this distinguishes the method from previous pruning studies.

Download Full-text

HELLO: improved neural network architectures and methodologies for small variant calling

BMC Bioinformatics ◽

10.1186/s12859-021-04311-4 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Anand Ramachandran ◽

Steven S. Lumetta ◽

Eric W. Klee ◽

Deming Chen

Keyword(s):

Neural Network ◽

Neural Networks ◽

Image Recognition ◽

Deep Neural Network ◽

Deep Neural Networks ◽

Method Development ◽

Variant Calling ◽

Network Architectures ◽

Sequencing Data ◽

Neural Network Architectures

Abstract Background Modern Next Generation- and Third Generation- Sequencing methods such as Illumina and PacBio Circular Consensus Sequencing platforms provide accurate sequencing data. Parallel developments in Deep Learning have enabled the application of Deep Neural Networks to variant calling, surpassing the accuracy of classical approaches in many settings. DeepVariant, arguably the most popular among such methods, transforms the problem of variant calling into one of image recognition where a Deep Neural Network analyzes sequencing data that is formatted as images, achieving high accuracy. In this paper, we explore an alternative approach to designing Deep Neural Networks for variant calling, where we use meticulously designed Deep Neural Network architectures and customized variant inference functions that account for the underlying nature of sequencing data instead of converting the problem to one of image recognition. Results Results from 27 whole-genome variant calling experiments spanning Illumina, PacBio and hybrid Illumina-PacBio settings suggest that our method allows vastly smaller Deep Neural Networks to outperform the Inception-v3 architecture used in DeepVariant for indel and substitution-type variant calls. For example, our method reduces the number of indel call errors by up to 18%, 55% and 65% for Illumina, PacBio and hybrid Illumina-PacBio variant calling respectively, compared to a similarly trained DeepVariant pipeline. In these cases, our models are between 7 and 14 times smaller. Conclusions We believe that the improved accuracy and problem-specific customization of our models will enable more accurate pipelines and further method development in the field. HELLO is available at https://github.com/anands-repo/hello

Download Full-text

Scalable Graph Neural Network Training

ACM SIGOPS Operating Systems Review ◽

10.1145/3469379.3469387 ◽

2021 ◽

Vol 55 (1) ◽

pp. 68-76

Author(s):

Marco Serafini

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Network ◽

Network Architectures ◽

Neural Network Training ◽

Large Graphs ◽

Graph Data ◽

Network Training ◽

Neural Network Architectures ◽

Graph Neural Networks

Graph Neural Networks (GNNs) are a new and increasingly popular family of deep neural network architectures to perform learning on graphs. Training them efficiently is challenging due to the irregular nature of graph data. The problem becomes even more challenging when scaling to large graphs that exceed the capacity of single devices. Standard approaches to distributed DNN training, like data and model parallelism, do not directly apply to GNNs. Instead, two different approaches have emerged in the literature: whole-graph and sample-based training. In this paper, we review and compare the two approaches. Scalability is challenging with both approaches, but we make a case that research should focus on sample-based training since it is a more promising approach. Finally, we review recent systems supporting sample-based training.

Download Full-text

Inclusion of Multiple Cycling of the Potential into Deep Neural Network Classification of Voltammetric Reaction Mechanisms

Faraday Discussions ◽

10.1039/d1fd00050k ◽

2021 ◽

Author(s):

Luke Gundry ◽

Gareth Kennedy ◽

Alan Bond ◽

Jie Zhang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Reaction Mechanisms ◽

Deep Neural Network ◽

Deep Neural Networks ◽

Neural Network Classification ◽

Initial Cycle

The use of Deep Neural Networks (DNNs) for the classification of electrochemical mechanisms based on training with simulations of the initial cycle of potential have been reported. In this paper,...

Download Full-text

An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network

10.31219/osf.io/2zk6a ◽

2021 ◽

Author(s):

Anh Nguyen ◽

Khoa Pham ◽

Dat Ngo ◽

Thanh Ngo ◽

Lam Pham

Keyword(s):

Neural Network ◽

Supervised Classification ◽

Deep Neural Network ◽

State Of The Art ◽

Network Architectures ◽

Activation Functions ◽

Scene Classification ◽

Learning Network ◽

Deep Learning Network

This paper provides an analysis of state-of-the-art activation functions with respect to supervised classification of deep neural network. These activation functions comprise of Rectified Linear Units (ReLU), Exponential Linear Unit (ELU), Scaled Exponential Linear Unit (SELU), Gaussian Error Linear Unit (GELU), and the Inverse Square Root Linear Unit (ISRLU). To evaluate, experiments over two deep learning network architectures integrating these activation functions are conducted. The first model, basing on Multilayer Perceptron (MLP), is evaluated with MNIST dataset to perform these activation functions.Meanwhile, the second model, likely VGGish-based architecture, is applied for Acoustic Scene Classification (ASC) Task 1A in DCASE 2018 challenge, thus evaluate whether these activation functions work well in different datasets as well as different network architectures.

Download Full-text

Deep Neural Network Architectures for Speech Deception Detection: A Brief Survey

10.1007/978-3-030-86230-5_24 ◽

2021 ◽

pp. 301-312

Author(s):

Andrey Lucas Herchonvicz ◽

Rafael de Santiago

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Deception Detection ◽

Network Architectures ◽

Neural Network Architectures

Download Full-text

Analysis of Deep Neural Network Architectures and Similarity Metrics for Low-Dose CT Reconstruction

2020 IEEE Open Conference of Electrical, Electronic and Information Sciences (eStream) ◽

10.1109/estream50540.2020.9108883 ◽

2020 ◽

Author(s):

Jonas Brusokas ◽

Linas Petkevicius

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Low Dose ◽

Similarity Metrics ◽

Network Architectures ◽

Ct Reconstruction ◽

Low Dose Ct ◽

Neural Network Architectures

Download Full-text

Convolutional Neural Network Architectures for Texture Classification of Pulmonary Nodules

Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-030-13469-3_91 ◽

2019 ◽

pp. 783-791

Author(s):

Carlos A. Ferreira ◽

António Cunha ◽

Ana Maria Mendonça ◽

Aurélio Campilho

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Texture Classification ◽

Pulmonary Nodules ◽

Network Architectures ◽

Neural Network Architectures

Download Full-text

Modeling Price and Risk in Chinese Financial Derivative Market with Deep Neural Network Architectures

2020 5th International Conference on Computational Intelligence and Applications (ICCIA) ◽

10.1109/iccia49625.2020.00010 ◽

2020 ◽

Author(s):

Chenyu Wang ◽

Zhongchen Miao ◽

Yuefeng Lin ◽

Hang Jiang ◽

Jian Gao ◽

...

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Network Architectures ◽

Financial Derivative ◽

Neural Network Architectures ◽

Derivative Market

Download Full-text