Further improvements in the generalization capability of the BRAINNE technique for extracting symbolic knowledge from neural networks

Performance Evaluation of Convolutional Neural Network Using Synthetic Medical Data Augmentation Generated by GAN

International Journal of Image and Graphics ◽

10.1142/s021946782350002x ◽

2021 ◽

Author(s):

Ramesh Adhikari ◽

Suresh Pokharel

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Data Augmentation ◽

Medical Diagnostics ◽

Generative Adversarial Networks ◽

Generalization Capability ◽

X Ray ◽

Original Dataset ◽

Unseen Data

Data augmentation is widely used in image processing and pattern recognition problems in order to increase the richness in diversity of available data. It is commonly used to improve the classification accuracy of images when the available datasets are limited. Deep learning approaches have demonstrated an immense breakthrough in medical diagnostics over the last decade. A significant amount of datasets are needed for the effective training of deep neural networks. The appropriate use of data augmentation techniques prevents the model from over-fitting and thus increases the generalization capability of the network while testing afterward on unseen data. However, it remains a huge challenge to obtain such a large dataset from rare diseases in the medical field. This study presents the synthetic data augmentation technique using Generative Adversarial Networks to evaluate the generalization capability of neural networks using existing data more effectively. In this research, the convolutional neural network (CNN) model is used to classify the X-ray images of the human chest in both normal and pneumonia conditions; then, the synthetic images of the X-ray from the available dataset are generated by using the deep convolutional generative adversarial network (DCGAN) model. Finally, the CNN model is trained again with the original dataset and augmented data generated using the DCGAN model. The classification performance of the CNN model is improved by 3.2% when the augmented data were used along with the originally available dataset.

Download Full-text

Flat Minima

Neural Computation ◽

10.1162/neco.1997.9.1.1 ◽

1997 ◽

Vol 9 (1) ◽

pp. 1-42 ◽

Cited By ~ 156

Author(s):

Sepp Hochreiter ◽

Jürgen Schmidhuber

Keyword(s):

Neural Networks ◽

Error Function ◽

Low Complexity ◽

Generalization Error ◽

Input Output ◽

Generalization Capability ◽

Training Set ◽

Weight Decay ◽

Optimal Brain Surgeon ◽

And Training

We present a new algorithm for finding low-complexity neural networks with high generalization capability. The algorithm searches for a “flat” minimum of the error function. A flat minimum is a large connected region in weight space where the error remains approximately constant. An MDL-based, Bayesian argument suggests that flat minima correspond to “simple” networks and low expected overfitting. The argument is based on a Gibbs algorithm variant and a novel way of splitting generalization error into underfitting and overfitting error. Unlike many previous approaches, ours does not require gaussian assumptions and does not depend on a “good” weight prior. Instead we have a prior over input output functions, thus taking into account net architecture and training set. Although our algorithm requires the computation of second-order derivatives, it has backpropagation's order of complexity. Automatically, it effectively prunes units, weights, and input lines. Various experiments with feedforward and recurrent nets are described. In an application to stock market prediction, flat minimum search outperforms conventional backprop, weight decay, and “optimal brain surgeon/optimal brain damage.”

Download Full-text

A Probabilistic Neural Network-Based Module for Recognition of Objects from their 3-D Images

International Journal of System Dynamics Applications ◽

10.4018/ijsda.2013040105 ◽

2013 ◽

Vol 2 (2) ◽

pp. 66-79 ◽

Cited By ~ 2

Author(s):

Onsy A. Abdel Alim ◽

Amin Shoukry ◽

Neamat A. Elboughdadly ◽

Gehan Abouelseoud

Keyword(s):

Neural Network ◽

Neural Networks ◽

Decision Making ◽

Pattern Recognition ◽

Object Recognition ◽

Probabilistic Neural Network ◽

Training Procedure ◽

Generalization Capability ◽

Mine Detection ◽

Recognition Of Objects

In this paper, a pattern recognition module that makes use of 3-D images of objects is presented. The proposed module takes advantage of both the generalization capability of neural networks and the possibility of manipulating 3-D images to generate views at different poses of the object that is to be recognized. This allows the construction of a robust 3-D object recognition module that can find use in various applications including military, biomedical and mine detection applications. The paper proposes an efficient training procedure and decision making strategy for the suggested neural network. Sample results of testing the module on 3-D images of several objects are also included along with an insightful discussion of the implications of the results.

Download Full-text

Moving image compression and generalization capability of constructive neural networks

10.1117/12.421169 ◽

2001 ◽

Author(s):

Liying Ma ◽

Khashayar Khorasani

Keyword(s):

Neural Networks ◽

Image Compression ◽

Generalization Capability ◽

Moving Image ◽

Constructive Neural Networks

Download Full-text

Adaptive regularization parameter selection method for enhancing generalization capability of neural networks

Artificial Intelligence ◽

10.1016/s0004-3702(98)00115-5 ◽

1999 ◽

Vol 107 (2) ◽

pp. 347-356 ◽

Cited By ~ 10

Author(s):

Chi-Tat Leung ◽

Tommy W.S. Chow

Keyword(s):

Neural Networks ◽

Regularization Parameter ◽

Selection Method ◽

Parameter Selection ◽

Generalization Capability ◽

Adaptive Regularization ◽

Regularization Parameter Selection

Download Full-text

Optimization of the generalization capability for rainfall–runoff modeling by neural networks: the case of the Lez aquifer (southern France)

Environmental Earth Sciences ◽

10.1007/s12665-011-1450-9 ◽

2011 ◽

Vol 65 (8) ◽

pp. 2365-2375 ◽

Cited By ~ 26

Author(s):

Line Kong A Siou ◽

Anne Johannet ◽

Borrell Estupina Valérie ◽

Séverin Pistre

Keyword(s):

Neural Networks ◽

Rainfall Runoff ◽

Generalization Capability ◽

Southern France ◽

Runoff Modeling

Download Full-text

On representation and generalization capability of pyramid neural networks

Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290) ◽

10.1109/ijcnn.2002.1007659 ◽

2003 ◽

Cited By ~ 2

Author(s):

M. Hoshino ◽

Jinhui Chao

Keyword(s):

Neural Networks ◽

Generalization Capability

Download Full-text

Measurement of Intima-Media Thickness Depending on Intima Media Complex Segmentation by Deep Neural Networks

Journal of Medical Imaging and Health Informatics ◽

10.1166/jmihi.2021.3841 ◽

2021 ◽

Vol 11 (10) ◽

pp. 2546-2557

Author(s):

Sudha Subramaniam ◽

K. B. Jayanthi ◽

C. Rajasekaran ◽

C. Sunder

Keyword(s):

Neural Networks ◽

Intima Media Thickness ◽

Raspberry Pi ◽

Automated Measurement ◽

Generalization Capability ◽

Media Thickness ◽

Segmentation Accuracy ◽

End To End ◽

Pipeline Model ◽

Important Marker

Intima Media Thickness (IMT) of the carotid artery is an important marker indicating the sign of cardiovascular disease. Automated measurement of IMT requires segmentation of intima media complex (IMC).Traditional methods which use shape, color and texture for classification have poor generalization capability. This paper proposes two models- the pipeline model and the end-to-end model using Convolutional Neural Networks (CNN) and auto encoder–decoder network respectively. CNN architecture is implemented and tested by varying the number of convolutional layer, size of the kernel as well as the number of kernels. Auto encoder–decoder performs pixel wise classification using two interconnected pathways for identifying the boundary of lumen-intima (LI) and media adventitia (MA). This helps in reconstruction of the segmented portion for measurement of IMT. Both methods are tested using a dataset of 550 subjects. The results clearly indicate that end-to-end model has an edge over the pipeline model exhibiting lesser deviation between the automated measurement and the measurement made by the radiologist. The pipeline model however has better segmentation accuracy when the size of the image used for training is small. The convolutional neural network with auto encoder–decoder proves robust through sparse representation, and faster learning with better generalization. Also, the experimental setup is analyzed by interconnecting Tensor flow simulated result with Raspberry PI and the outcomes are analyzed.

Download Full-text