DuFeNet: Improve the Accuracy and Increase Shape Bias of Neural Network Models

Signal Image and Video Processing ◽

10.1007/s11760-021-02065-3 ◽

2022 ◽

Author(s):

Zecong Ye ◽

Zhiqiang Gao ◽

Xiaolong Cui ◽

Yaojie Wang ◽

Nanliang Shan

Keyword(s):

Neural Network ◽

Image Classification ◽

Network Structure ◽

Network Performance ◽

Human Visual Perception ◽

Neural Network Models ◽

Bias Effect ◽

Shape Information ◽

Shape Bias ◽

The Neural Network

AbstractIn image classification field, existing work tends to modify the network structure to obtain higher accuracy or faster speed. However, some studies have found that the neural network usually has texture bias effect, which means that the neural network is more sensitive to the texture information than the shape information. Based on such phenomenon, we propose a new way to improve network performance by making full use of gradient information. The dual features network (DuFeNet) is proposed in this paper. In DuFeNet, one sub-network is used to learn the information of gradient features, and the other is a traditional neural network with texture bias. The structure of DuFeNet is easy to implement in the original neural network structure. The experimental results clearly show that DuFeNet can achieve better accuracy in image classification and detection. It can increase the shape bias of the network adapted to human visual perception. Besides, DuFeNet can be used without modifying the structure of the original network at lower additional parameters cost.

Download Full-text

Smooth Non-increasing Square Spatial Extents of Filters in Convolutional Layers of CNNs for Image Classification Problems

Applied Computer Systems ◽

10.2478/acss-2018-0007 ◽

2018 ◽

Vol 23 (1) ◽

pp. 52-62

Author(s):

Vadim V. Romanuke

Keyword(s):

Neural Network ◽

Image Classification ◽

Network Architecture ◽

Network Performance ◽

Medium Size ◽

Classification Problems ◽

Layered Architecture ◽

The Neural Network ◽

Neural Network Classifiers ◽

Secondary Condition

Abstract The present paper considers an open problem of setting hyperparameters for convolutional neural networks aimed at image classification. Since selecting filter spatial extents for convolutional layers is a topical problem, it is approximately solved by accumulating statistics of the neural network performance. The network architecture is taken on the basis of the MNIST database experience. The eight-layered architecture having four convolutional layers is nearly best suitable for classifying small and medium size images. Image databases are formed of grayscale images whose size range is 28 × 28 to 64 × 64 by step 2. Except for the filter spatial extents, the rest of those eight layer hyperparameters are unalterable, and they are chosen scrupulously based on rules of thumb. A sequence of possible filter spatial extents is generated for each size. Then sets of four filter spatial extents producing the best performance are extracted. The rule of this extraction that allows selecting the best filter spatial extents is formalized with two conditions. Mainly, difference between maximal and minimal extents must be as minimal as possible. No unit filter spatial extent is recommended. The secondary condition is that the filter spatial extents should constitute a non-increasing set. Validation on MNIST and CIFAR- 10 databases justifies such a solution, which can be extended for building convolutional neural network classifiers of colour and larger images.

Download Full-text

Digital twin of equipment as a basis for the consumer in digital production

Automation. Modern Techologies ◽

10.36652/0869-4931-2020-74-9-394-402 ◽

2020 ◽

Keyword(s):

Neural Network ◽

Tool Wear ◽

Chip Formation ◽

Network Models ◽

Machining Accuracy ◽

Neural Network Models ◽

Digital Twin ◽

The Neural Network ◽

Digital Production ◽

Cyberphysical System

The neural network models series used in the development of an aggregated digital twin of equipment as a cyber-physical system are presented. The twins of machining accuracy, chip formation and tool wear are examined in detail. On their basis, systems for stabilization of the chip formation process during cutting and diagnose of the cutting too wear are developed. Keywords cyberphysical system; neural network model of equipment; big data, digital twin of the chip formation; digital twin of the tool wear; digital twin of nanostructured coating choice

Download Full-text

Paired Augmentation for Improved Image Classification using Neural Network Models

2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE) ◽

10.1109/csde50874.2020.9411553 ◽

2020 ◽

Author(s):

Shikar Rajcomar ◽

Anban W. Pillay ◽

Edgar Jembere

Keyword(s):

Neural Network ◽

Image Classification ◽

Network Models ◽

Neural Network Models

Download Full-text

Spatial Variability Aware Deep Neural Networks (SVANN): A General Approach

ACM Transactions on Intelligent Systems and Technology ◽

10.1145/3466688 ◽

2021 ◽

Vol 12 (6) ◽

pp. 1-21

Author(s):

Jayant Gupta ◽

Carl Molnar ◽

Yiqun Xie ◽

Joe Knight ◽

Shashi Shekhar

Keyword(s):

Neural Network ◽

Spatial Variability ◽

Network Architecture ◽

Network Models ◽

Neural Network Architecture ◽

Neural Network Models ◽

Climatic Zones ◽

The Neural Network ◽

Plant Hardiness ◽

Interpretation Model

Spatial variability is a prominent feature of various geographic phenomena such as climatic zones, USDA plant hardiness zones, and terrestrial habitat types (e.g., forest, grasslands, wetlands, and deserts). However, current deep learning methods follow a spatial-one-size-fits-all (OSFA) approach to train single deep neural network models that do not account for spatial variability. Quantification of spatial variability can be challenging due to the influence of many geophysical factors. In preliminary work, we proposed a spatial variability aware neural network (SVANN-I, formerly called SVANN ) approach where weights are a function of location but the neural network architecture is location independent. In this work, we explore a more flexible SVANN-E approach where neural network architecture varies across geographic locations. In addition, we provide a taxonomy of SVANN types and a physics inspired interpretation model. Experiments with aerial imagery based wetland mapping show that SVANN-I outperforms OSFA and SVANN-E performs the best of all.

Download Full-text

Towards Enhanced Performance of Neural-Network-Based Fault Detection Using an Sequential D-Optimum Experimental Design

Applied Sciences ◽

10.3390/app8081290 ◽

2018 ◽

Vol 8 (8) ◽

pp. 1290 ◽

Cited By ~ 2

Author(s):

Beata Mrugalska

Keyword(s):

Neural Network ◽

Experimental Design ◽

Fault Detection ◽

Network Models ◽

Neural Model ◽

Neural Network Models ◽

Optimum Experimental Design ◽

The Neural Network ◽

Adaptive Thresholds ◽

Linear Neural Network

Increasing expectations of industrial system reliability require development of more effective and robust fault diagnosis methods. The paper presents a framework for quality improvement on the neural model applied for fault detection purposes. In particular, the proposed approach starts with an adaptation of the modified quasi-outer-bounding algorithm towards non-linear neural network models. Subsequently, its convergence is proven using quadratic boundedness paradigm. The obtained algorithm is then equipped with the sequential D-optimum experimental design mechanism allowing gradual reduction of the neural model uncertainty. Finally, an emerging robust fault detection framework on the basis of the neural network uncertainty description as the adaptive thresholds is proposed.

Download Full-text

Stock Market Prediction Using Artificial Neural Networks

Advanced Engineering Forum ◽

10.4028/www.scientific.net/aef.6-7.1055 ◽

2012 ◽

Vol 6-7 ◽

pp. 1055-1060 ◽

Cited By ~ 4

Author(s):

Yang Bing ◽

Jian Kun Hao ◽

Si Chang Zhang

Keyword(s):

Neural Network ◽

Prediction Models ◽

Learning Algorithm ◽

Stock Exchange ◽

Back Propagation ◽

Composite Index ◽

Network Models ◽

Back Propagation Neural Network ◽

Neural Network Models ◽

The Neural Network

In this study we apply back propagation Neural Network models to predict the daily Shanghai Stock Exchange Composite Index. The learning algorithm and gradient search technique are constructed in the models. We evaluate the prediction models and conclude that the Shanghai Stock Exchange Composite Index is predictable in the short term. Empirical study shows that the Neural Network models is successfully applied to predict the daily highest, lowest, and closing value of the Shanghai Stock Exchange Composite Index, but it can not predict the return rate of the Shanghai Stock Exchange Composite Index in short terms.

Download Full-text

Constructive Learning of Deep Neural Networks for Bigdata Analysis

International Journal of Computer Applications Technology and Research ◽

10.7753/ijcatr0912.1001 ◽

2020 ◽

Vol 9 (12) ◽

pp. 311-322

Author(s):

Soha Abd Mohamed El-Moamen ◽

Marghany Hassan Mohamed ◽

Mohammed F. Farghally

Keyword(s):

Neural Network ◽

Lung Cancer ◽

Binary Classification ◽

Network Models ◽

Classification Model ◽

Neural Network Models ◽

Constructive Learning ◽

The Neural Network ◽

Rapid Pace ◽

Better Than

The need for tracking and evaluation of patients in real-time has contributed to an increase in knowing people’s actions to enhance care facilities. Deep learning is good at both a rapid pace in collecting frameworks of big data healthcare and good predictions for detection the lung cancer early. In this paper, we proposed a constructive deep neural network with Apache Spark to classify images and levels of lung cancer. We developed a binary classification model using threshold technique classifying nodules to benign or malignant. At the proposed framework, the neural network models training, defined using the Keras API, is performed using BigDL in a distributed Spark clusters. The proposed algorithm has metrics AUC-0.9810, a misclassifying rate from which it has been shown that our suggested classifiers perform better than other classifiers.

Download Full-text

Robotic grasp detection using a novel two-stage approach

ASP Transactions on Internet of Things ◽

10.52810/tiot.2021.100031 ◽

2021 ◽

Vol 1 (1) ◽

pp. 19-29

Author(s):

Zhe Chu ◽

Mengkai Hu ◽

Xiangyu Chen

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Models ◽

Particle Swarm Optimizer ◽

Neural Network Models ◽

Two Stage ◽

The Neural Network ◽

End To End ◽

Small Change ◽

Robotic Grasp

Recently, deep learning has been successfully applied to robotic grasp detection. Based on convolutional neural networks (CNNs), there have been lots of end-to-end detection approaches. But end-to-end approaches have strict requirements for the dataset used for training the neural network models and it’s hard to achieve in practical use. Therefore, we proposed a two-stage approach using particle swarm optimizer (PSO) candidate estimator and CNN to detect the most likely grasp. Our approach achieved an accuracy of 92.8% on the Cornell Grasp Dataset, which leaped into the front ranks of the existing approaches and is able to run at real-time speeds. After a small change of the approach, we can predict multiple grasps per object in the meantime so that an object can be grasped in a variety of ways.

Download Full-text

Neural network models of the tactile system develop first-order units with spatially complex receptive fields

10.1101/164954 ◽

2017 ◽

Cited By ~ 2

Author(s):

Charlie W. Zhao ◽

Mark J. Daley ◽

J. Andrew Pruszynski

Keyword(s):

Neural Network ◽

Network Performance ◽

Receptive Fields ◽

Network Models ◽

Network Architectures ◽

Learning Tools ◽

Neural Network Models ◽

First Order ◽

Wide Range ◽

Tactile System

AbstractFirst-order tactile neurons have spatially complex receptive fields. Here we use machine learning tools to show that such complexity arises for a wide range of training sets and network architectures, and benefits network performance, especially on more difficult tasks and in the presence of noise. Our work suggests that spatially complex receptive fields are normatively good given the biological constraints of the tactile periphery.

Download Full-text

Analysis of Fin-Tube Evaporator Performance With Limited Experimental Data Using Artificial Neural Networks

10.1115/imece2000-1466 ◽

2000 ◽

Author(s):

Arturo Pacheco-Vega ◽

Mihir Sen ◽

Rodney L. McClain

Keyword(s):

Neural Network ◽

Heat Rate ◽

Network Models ◽

Activation Function ◽

Operating Conditions ◽

Training Data ◽

Neural Network Models ◽

The Neural Network ◽

Artificial Neural ◽

Fin Tube

Abstract In the current study we consider the problem of accuracy in heat rate estimations from artificial neural network models of heat exchangers used for refrigeration applications. The network configuration is of the feedforward type with a sigmoid activation function and a backpropagation algorithm. Limited experimental measurements from a manufacturer are used to show the capability of the neural network technique in modeling the heat transfer in these systems. Results from this exercise show that a well-trained network correlates the data with errors of the same order as the uncertainty of the measurements. It is also shown that the number and distribution of the training data are linked to the performance of the network when estimating the heat rates under different operating conditions, and that networks trained from few tests may give large errors. A methodology based on the cross-validation technique is presented to find regions where not enough data are available to construct a reliable neural network. The results from three tests show that the proposed methodology gives an upper bound of the estimated error in the heat rates.

Download Full-text