neural network training Latest Research Papers

Impact of Asymmetric Weight Update on Neural Network Training With Tiki-Taka Algorithm

Frontiers in Neuroscience ◽

10.3389/fnins.2021.767953 ◽

2022 ◽

Vol 15 ◽

Author(s):

Chaeun Lee ◽

Kyungmi Noh ◽

Wonjae Ji ◽

Tayfun Gokmen ◽

Seyoung Kim

Keyword(s):

Neural Network ◽

Power Efficiency ◽

Calibration Method ◽

Neural Network Training ◽

Systematic Analysis ◽

Training Performance ◽

Actual Weight ◽

Network Training ◽

The Neural Network ◽

The Impact

Recent progress in novel non-volatile memory-based synaptic device technologies and their feasibility for matrix-vector multiplication (MVM) has ignited active research on implementing analog neural network training accelerators with resistive crosspoint arrays. While significant performance boost as well as area- and power-efficiency is theoretically predicted, the realization of such analog accelerators is largely limited by non-ideal switching characteristics of crosspoint elements. One of the most performance-limiting non-idealities is the conductance update asymmetry which is known to distort the actual weight change values away from the calculation by error back-propagation and, therefore, significantly deteriorates the neural network training performance. To address this issue by an algorithmic remedy, Tiki-Taka algorithm was proposed and shown to be effective for neural network training with asymmetric devices. However, a systematic analysis to reveal the required asymmetry specification to guarantee the neural network performance has been unexplored. Here, we quantitatively analyze the impact of update asymmetry on the neural network training performance when trained with Tiki-Taka algorithm by exploring the space of asymmetry and hyper-parameters and measuring the classification accuracy. We discover that the update asymmetry level of the auxiliary array affects the way the optimizer takes the importance of previous gradients, whereas that of main array affects the frequency of accepting those gradients. We propose a novel calibration method to find the optimal operating point in terms of device and network parameters. By searching over the hyper-parameter space of Tiki-Taka algorithm using interpolation and Gaussian filtering, we find the optimal hyper-parameters efficiently and reveal the optimal range of asymmetry, namely the asymmetry specification. Finally, we show that the analysis and calibration method be applicable to spiking neural networks.

Polarization-Based Haze Removal Using Self-Supervised Network

Frontiers in Physics ◽

10.3389/fphy.2021.789232 ◽

2022 ◽

Vol 9 ◽

Author(s):

Yingjie Shi ◽

Enlai Guo ◽

Lianfa Bai ◽

Jing Han

Keyword(s):

Neural Network ◽

Gradient Descent ◽

Transmission Model ◽

Neural Network Training ◽

Original Image ◽

Atmospheric Transmission ◽

Atmospheric Scattering ◽

Haze Removal ◽

Network Training ◽

The Neural Network

Atmospheric scattering caused by suspended particles in the air severely degrades the scene radiance. This paper proposes a method to remove haze by using a neural network that combines scene polarization information. The neural network is self-supervised and online globally optimization can be achieved by using the atmospheric transmission model and gradient descent. Therefore, the proposed method does not require any haze-free image as the constraint for neural network training. The proposed approach is far superior to supervised algorithms in the performance of dehazing and is highly robust to the scene. It is proved that this method can significantly improve the contrast of the original image, and the detailed information of the scene can be effectively enhanced.

Evaluation of Parameter Settings for Training Neural Networks Using Backpropagation Algorithms

10.4018/978-1-6684-2408-7.ch009 ◽

2022 ◽

pp. 202-226

Author(s):

Leema N. ◽

Khanna H. Nehemiah ◽

Elgin Christo V. R. ◽

Kannan A.

Keyword(s):

Neural Network ◽

Neural Networks ◽

Activation Function ◽

Neural Network Training ◽

Network Parameter ◽

Network Parameters ◽

Network Training ◽

Rate Minimum ◽

Hidden Layer ◽

Function Number

Artificial neural networks (ANN) are widely used for classification, and the training algorithm commonly used is the backpropagation (BP) algorithm. The major bottleneck faced in the backpropagation neural network training is in fixing the appropriate values for network parameters. The network parameters are initial weights, biases, activation function, number of hidden layers and the number of neurons per hidden layer, number of training epochs, learning rate, minimum error, and momentum term for the classification task. The objective of this work is to investigate the performance of 12 different BP algorithms with the impact of variations in network parameter values for the neural network training. The algorithms were evaluated with different training and testing samples taken from the three benchmark clinical datasets, namely, Pima Indian Diabetes (PID), Hepatitis, and Wisconsin Breast Cancer (WBC) dataset obtained from the University of California Irvine (UCI) machine learning repository.

Histo-fetch – On-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training

Journal of Pathology Informatics ◽

10.4103/jpi.jpi_59_20 ◽

2022 ◽

Vol 13 (1) ◽

pp. 7

Author(s):

Pinaki Sarder ◽

Brendon Lutnick ◽

LeemaKrishna Murali ◽

Brandon Ginley ◽

AviZ Rosenberg

Keyword(s):

Neural Network ◽

Neural Network Training ◽

Network Training ◽

Whole Slide Images

Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images

Informatics in Medicine Unlocked ◽

10.1016/j.imu.2022.100850 ◽

2022 ◽

pp. 100850

Author(s):

Angus Lang Sun Lee ◽

Curtis Chun Kit To ◽

Alfred Lok Hang Lee ◽

Joshua Jing Xi Li ◽

Ronald Cheong Kin Chan

Keyword(s):

Neural Network ◽

Cancer Detection ◽

Small Cell ◽

Neural Network Training ◽

Small Cell Lung ◽

Tile Size ◽

Network Training ◽

Selection For ◽

Lung Cancer Detection ◽

Whole Slide Images

Artificial Neural Network training using metaheuristics for medical data classification: An experimental study

Expert Systems with Applications ◽

10.1016/j.eswa.2021.116423 ◽

2022 ◽

pp. 116423

Author(s):

Tapas Si ◽

Jayri Bagchi ◽

Péricles B.C. Miranda

Keyword(s):

Neural Network ◽

Experimental Study ◽

Artificial Neural Network ◽

Data Classification ◽

Medical Data ◽

Neural Network Training ◽

Network Training ◽

Medical Data Classification ◽

Artificial Neural ◽

Artificial Neural Network Training

Implementation experiments on convolutional neural network training using synthetic images for 3D pose estimation of an excavator on real images

Automation in Construction ◽

10.1016/j.autcon.2021.103996 ◽

2022 ◽

Vol 133 ◽

pp. 103996

Author(s):

Bilawal Mahmood ◽

SangUk Han ◽

Jongwon Seo

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Pose Estimation ◽

Neural Network Training ◽

3D Pose Estimation ◽

Network Training ◽

Synthetic Images

TSUNAMI: Triple Sparsity-Aware Ultra Energy-Efficient Neural Network Training Accelerator With Multi-Modal Iterative Pruning

IEEE Transactions on Circuits and Systems I Regular Papers ◽

10.1109/tcsi.2021.3138092 ◽

2022 ◽

pp. 1-13

Author(s):

Sangyeob Kim ◽

Juhyoung Lee ◽

Sanghoon Kang ◽

Donghyeon Han ◽

Wooyoung Jo ◽

...

Keyword(s):

Neural Network ◽

Energy Efficient ◽

Neural Network Training ◽

Network Training

MaxwellNet: Physics-driven deep neural network training based on Maxwell’s equations

APL Photonics ◽

10.1063/5.0071616 ◽

2022 ◽

Vol 7 (1) ◽

pp. 011301

Author(s):

Joowon Lim ◽

Demetri Psaltis

Keyword(s):

Neural Network ◽

Maxwell’S Equations ◽

Deep Neural Network ◽

Maxwell's Equations ◽

Neural Network Training ◽

Network Training

Comparative Evaluation of Predicting Energy Consumption of Absorption Heat Pump with Multilayer Shallow Neural Network Training Algorithms

Buildings ◽

10.3390/buildings12010013 ◽

2021 ◽

Vol 12 (1) ◽

pp. 13

Author(s):

Jee-Heon Kim ◽

Nam-Chul Seong ◽

Won-Chang Choi

Keyword(s):

Neural Network ◽

Energy Consumption ◽

Conjugate Gradient ◽

Air Conditioning ◽

Gradient Descent ◽

Predictive Performance ◽

Neural Network Training ◽

Network Algorithms ◽

Network Training ◽

Performance Evaluation Index

The performance of various multilayer neural network algorithms to predict the energy consumption of an absorption chiller in an air conditioning system under the same conditions was compared and evaluated in this study. Each prediction model was created using 12 representative multilayer shallow neural network algorithms. As training data, about a month of actual operation data during the heating period was used, and the predictive performance of 12 algorithms according to the training size was evaluated. The prediction results indicate that the error rates using the measured values are 0.09% minimum, 5.76% maximum, and 1.94 standard deviation (SD) for the Levenberg–Marquardt backpropagation model and 0.41% minimum, 5.05% maximum, and 1.68 SD for the Bayesian regularization backpropagation model. The conjugate gradient with Polak–Ribiére updates backpropagation model yielded lower values than the other two models, with 0.31% minimum, 5.73% maximum, and 1.76 SD. Based on the results for the predictive performance evaluation index, CvRMSE, all other models (conjugate gradient with Fletcher–Reeves updates backpropagation, one-step secant backpropagation, gradient descent with momentum and adaptive learning rate backpropagation, gradient descent with momentum backpropagation) except for the gradient descent backpropagation model yielded results that satisfy ASHRAE (American Society of Heating, Refrigerating and Air-Conditioning Engineers) Guideline 14. The results of this study confirm that the prediction performance may differ for each multilayer neural network training algorithm. Therefore, selecting the appropriate model to fit the characteristics of a specific project is essential.

neural network training
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Impact of Asymmetric Weight Update on Neural Network Training With Tiki-Taka Algorithm

Polarization-Based Haze Removal Using Self-Supervised Network

Evaluation of Parameter Settings for Training Neural Networks Using Backpropagation Algorithms

Histo-fetch – On-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training

Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images

Artificial Neural Network training using metaheuristics for medical data classification: An experimental study

Implementation experiments on convolutional neural network training using synthetic images for 3D pose estimation of an excavator on real images

TSUNAMI: Triple Sparsity-Aware Ultra Energy-Efficient Neural Network Training Accelerator With Multi-Modal Iterative Pruning

MaxwellNet: Physics-driven deep neural network training based on Maxwell’s equations

Comparative Evaluation of Predicting Energy Consumption of Absorption Heat Pump with Multilayer Shallow Neural Network Training Algorithms

Export Citation Format

neural network trainingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Impact of Asymmetric Weight Update on Neural Network Training With Tiki-Taka Algorithm

Polarization-Based Haze Removal Using Self-Supervised Network

Evaluation of Parameter Settings for Training Neural Networks Using Backpropagation Algorithms

Histo-fetch – On-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training

Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images

Artificial Neural Network training using metaheuristics for medical data classification: An experimental study

Implementation experiments on convolutional neural network training using synthetic images for 3D pose estimation of an excavator on real images

TSUNAMI: Triple Sparsity-Aware Ultra Energy-Efficient Neural Network Training Accelerator With Multi-Modal Iterative Pruning

MaxwellNet: Physics-driven deep neural network training based on Maxwell’s equations

Comparative Evaluation of Predicting Energy Consumption of Absorption Heat Pump with Multilayer Shallow Neural Network Training Algorithms

neural network training
Recently Published Documents