IMPROVING GENERALIZATION OF NEURAL NETWORKS THROUGH PRUNING

A technique for constructing neural network architectures with better ability to generalize is presented under the name Ockham's Razor: several networks are trained and then pruned by removing connections one by one and retraining. The networks which achieve fewest connections generalize best. The method is tested on a classification of bit strings (the contiguity problem): the optimal architecture emerges, resulting in perfect generalization. The internal representation of the network changes substantially during the retraining, and this distinguishes the method from previous pruning studies.

Download Full-text

Neural networks for classification of strokes in electrical impedance tomography on a 3D head model

Mathematics in Engineering ◽

10.3934/mine.2022029 ◽

2022 ◽

Vol 4 (4) ◽

pp. 1-22

Author(s):

Valentina Candiani ◽

◽

Matteo Santacesaria ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Electrical Impedance Tomography ◽

Electrical Impedance ◽

Network Architectures ◽

Impedance Tomography ◽

Average Accuracy ◽

Neural Network Architectures ◽

Fully Connected

<abstract><p>We consider the problem of the detection of brain hemorrhages from three-dimensional (3D) electrical impedance tomography (EIT) measurements. This is a condition requiring urgent treatment for which EIT might provide a portable and quick diagnosis. We employ two neural network architectures - a fully connected and a convolutional one - for the classification of hemorrhagic and ischemic strokes. The networks are trained on a dataset with $ 40\, 000 $ samples of synthetic electrode measurements generated with the complete electrode model on realistic heads with a 3-layer structure. We consider changes in head anatomy and layers, electrode position, measurement noise and conductivity values. We then test the networks on several datasets of unseen EIT data, with more complex stroke modeling (different shapes and volumes), higher levels of noise and different amounts of electrode misplacement. On most test datasets we achieve $ \geq 90\% $ average accuracy with fully connected neural networks, while the convolutional ones display an average accuracy $ \geq 80\% $. Despite the use of simple neural network architectures, the results obtained are very promising and motivate the applications of EIT-based classification methods on real phantoms and ultimately on human patients.</p></abstract>

Download Full-text

Classification of Fermi-LAT sources with deep learning using energy and time spectra

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/stab2389 ◽

2021 ◽

Vol 507 (3) ◽

pp. 4061-4073

Author(s):

Thorben Finke ◽

Michael Krämer ◽

Silvia Manconi

Keyword(s):

Neural Network ◽

Neural Networks ◽

Active Galactic Nuclei ◽

Photon Energy ◽

Deep Neural Network ◽

Gamma Ray ◽

Galactic Nuclei ◽

Network Architectures ◽

Neural Network Architectures

ABSTRACT Despite the growing number of gamma-ray sources detected by the Fermi-Large Area Telescope (LAT), about one-third of the sources in each survey remains of uncertain type. We present a new deep neural network approach for the classification of unidentified or unassociated gamma-ray sources in the last release of the Fermi-LAT catalogue (4FGL-DR2) obtained with 10 yr of data. In contrast to previous work, our method directly uses the measurements of the photon energy spectrum and time series as input for the classification, instead of specific, human-crafted features. Dense neural networks, and for the first time in the context of gamma-ray source classification recurrent neural networks, are studied in depth. We focus on the separation between extragalactic sources, i.e. active galactic nuclei, and Galactic pulsars, and on the further classification of pulsars into young and millisecond pulsars. Our neural network architectures provide powerful classifiers, with a performance that is comparable to previous analyses based on human-crafted features. Our benchmark neural network predicts that of the sources of uncertain type in the 4FGL-DR2 catalogue, 1050 are active galactic nuclei and 78 are Galactic pulsars, with both classes following the expected sky distribution and the clustering in the variability–curvature plane. We investigate the problem of sample selection bias by testing our architectures against a cross-match test data set using an older catalogue, and propose a feature selection algorithm using autoencoders. Our list of high-confidence candidate sources labelled by the neural networks provides a set of targets for further multiwavelength observations addressed to identify their nature. The deep neural network architectures we develop can be easily extended to include specific features, as well as multiwavelength data on the source photon energy and time spectra coming from different instruments.

Download Full-text

Convolutional Neural Network Architectures for Texture Classification of Pulmonary Nodules

Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-030-13469-3_91 ◽

2019 ◽

pp. 783-791

Author(s):

Carlos A. Ferreira ◽

António Cunha ◽

Ana Maria Mendonça ◽

Aurélio Campilho

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Texture Classification ◽

Pulmonary Nodules ◽

Network Architectures ◽

Neural Network Architectures

Download Full-text

A comparison of neural network architectures for the classification of three types of infant cry vocalizations

Proceedings of 17th International Conference of the Engineering in Medicine and Biology Society ◽

10.1109/iembs.1995.575380 ◽

2002 ◽

Cited By ~ 22

Author(s):

M. Petroni ◽

A.S. Malowany ◽

C.C. Johnston ◽

B.J. Stevens

Keyword(s):

Neural Network ◽

Network Architectures ◽

Infant Cry ◽

Neural Network Architectures

Download Full-text

Evaluation of Statistical and Neural Network Architectures for the Classification of Paddy Kernels Using Morphological Features

International Journal of Food Properties ◽

10.1080/10942912.2015.1071839 ◽

2015 ◽

Vol 19 (6) ◽

pp. 1227-1241

Author(s):

Javd Khazaei ◽

Iman Golpour ◽

Parviz Ahmadi Moghaddam

Keyword(s):

Neural Network ◽

Morphological Features ◽

Network Architectures ◽

Neural Network Architectures

Download Full-text

Identification and classification of dental implant systems using various deep learning‐based convolutional neural network architectures

Clinical Oral Implants Research ◽

10.1111/clr.175_13509 ◽

2019 ◽

Vol 30 (S19) ◽

pp. 217-217

Author(s):

Lee Jae‐Hong

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Dental Implant ◽

Network Architectures ◽

Neural Network Architectures

Download Full-text

Swarm-Based Nature-Inspired Metaheuristics for Neural Network Optimization

Advances in Computational Intelligence and Robotics - Handbook of Research on Modeling, Analysis, and Application of Nature-Inspired Metaheuristic Algorithms ◽

10.4018/978-1-5225-2857-9.ch002 ◽

2018 ◽

pp. 23-53

Author(s):

Swathi Jamjala Narayanan ◽

Boominathan Perumal ◽

Jayant G. Rohra

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Optimization ◽

Network Architectures ◽

Neural Network Optimization ◽

Local Optima ◽

Gradient Based ◽

Neural Network Architectures ◽

Nature Inspired Algorithms ◽

Nature Inspired Metaheuristics

Nature-inspired algorithms have been productively applied to train neural network architectures. There exist other mechanisms like gradient descent, second order methods, Levenberg-Marquardt methods etc. to optimize the parameters of neural networks. Compared to gradient-based methods, nature-inspired algorithms are found to be less sensitive towards the initial weights set and also it is less likely to become trapped in local optima. Despite these benefits, some nature-inspired algorithms also suffer from stagnation when applied to neural networks. The other challenge when applying nature inspired techniques for neural networks would be in handling large dimensional and correlated weight space. Hence, there arises a need for scalable nature inspired algorithms for high dimensional neural network optimization. In this chapter, the characteristics of nature inspired techniques towards optimizing neural network architectures along with its applicability, advantages and limitations/challenges are studied.

Download Full-text