The Compact Support Neural Network

Neural networks are popular and useful in many fields, but they have the problem of giving high confidence responses for examples that are away from the training data. This makes the neural networks very confident in their prediction while making gross mistakes, thus limiting their reliability for safety-critical applications such as autonomous driving and space exploration, etc. This paper introduces a novel neuron generalization that has the standard dot-product-based neuron and the radial basis function (RBF) neuron as two extreme cases of a shape parameter. Using a rectified linear unit (ReLU) as the activation function results in a novel neuron that has compact support, which means its output is zero outside a bounded domain. To address the difficulties in training the proposed neural network, it introduces a novel training method that takes a pretrained standard neural network that is fine-tuned while gradually increasing the shape parameter to the desired value. The theoretical findings of the paper are bound on the gradient of the proposed neuron and proof that a neural network with such neurons has the universal approximation property. This means that the network can approximate any continuous and integrable function with an arbitrary degree of accuracy. The experimental findings on standard benchmark datasets show that the proposed approach has smaller test errors than the state-of-the-art competing methods and outperforms the competing methods in detecting out-of-distribution samples on two out of three datasets.

Download Full-text

Enhancing protein backbone angle prediction by using simpler models of deep neural networks

Scientific Reports ◽

10.1038/s41598-020-76317-6 ◽

2020 ◽

Vol 10 (1) ◽

Author(s):

Fereshteh Mataeimoghadam ◽

M. A. Hakim Newton ◽

Abdollah Dehzangi ◽

Abdul Karim ◽

B. Jayaram ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Structure Prediction ◽

Protein Structures ◽

Absolute Error ◽

Grand Challenge ◽

Protein Backbone ◽

The Neural Network ◽

Benchmark Datasets ◽

The Neural Networks

Abstract Protein structure prediction is a grand challenge. Prediction of protein structures via the representations using backbone dihedral angles has recently achieved significant progress along with the on-going surge of deep neural network (DNN) research in general. However, we observe that in the protein backbone angle prediction research, there is an overall trend to employ more and more complex neural networks and then to throw more and more features to the neural networks. While more features might add more predictive power to the neural network, we argue that redundant features could rather clutter the scenario and more complex neural networks then just could counterbalance the noise. From artificial intelligence and machine learning perspectives, problem representations and solution approaches do mutually interact and thus affect performance. We also argue that comparatively simpler predictors can more easily be reconstructed than the more complex ones. With these arguments in mind, we present a deep learning method named Simpler Angle Predictor (SAP) to train simpler DNN models that enhance protein backbone angle prediction. We then empirically show that SAP can significantly outperform existing state-of-the-art methods on well-known benchmark datasets: for some types of angles, the differences are 6–8 in terms of mean absolute error (MAE). The SAP program along with its data is available from the website https://gitlab.com/mahnewton/sap.

Download Full-text

Applying Fuzzy Logic and Neural Network to Rheumatism Treatment in Oriental Medicine

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2007.p0004 ◽

2007 ◽

Vol 11 (1) ◽

pp. 4-10 ◽

Cited By ~ 2

Author(s):

Cao Thang ◽

◽

Eric W. Cooper ◽

Yukinobu Hoshino ◽

Katsuari Kamei ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Fuzzy Logic ◽

Decision Support ◽

Decision Support System ◽

Support System ◽

Fuzzy Inference ◽

Training Data ◽

Oriental Medicine ◽

The Neural Networks

In this paper, we present an application of soft computing into a decision support system RETS: Rheumatic Evaluation and Treatment System in Oriental Medicine (OM). Inputs of the system are severities of observed symptoms on patients and outputs are a diagnosis of rheumatic states, its explanations and herbal prescriptions. First, an outline of the proposed decision support system is described after considering rheumatic diagnoses and prescriptions by OM doctors. Next, diagnosis by fuzzy inference and prescription by neural networks are described. By fuzzy inference, RETS diagnoses the most appropriate rheumatic state in which the patient appears to be infected, then it gives a prescription written in suitable herbs with reasonable amounts based on neural networks. Training data for the neural networks is collected from experienced OM physicians and OM text books. Finally, we describe evaluations and restrictions of RETS.

Download Full-text

SCORING MODELING BASED ON NEURAL NETWORKS FOR DETERMINING A BANK BORROWER'S RATING

Economy of Ukraine ◽

10.15407/economyukr.2020.10.054 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 54-62

Author(s):

Oleksii VASYLIEV ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Statistical Data ◽

Activation Function ◽

Decision Making Process ◽

Neural Network Architecture ◽

Acceptable Accuracy ◽

The Neural Network ◽

Sigmoid Activation Function

The problem of applying neural networks to calculate ratings used in banking in the decision-making process on granting or not granting loans to borrowers is considered. The task is to determine the rating function of the borrower based on a set of statistical data on the effectiveness of loans provided by the bank. When constructing a regression model to calculate the rating function, it is necessary to know its general form. If so, the task is to calculate the parameters that are included in the expression for the rating function. In contrast to this approach, in the case of using neural networks, there is no need to specify the general form for the rating function. Instead, certain neural network architecture is chosen and parameters are calculated for it on the basis of statistical data. Importantly, the same neural network architecture can be used to process different sets of statistical data. The disadvantages of using neural networks include the need to calculate a large number of parameters. There is also no universal algorithm that would determine the optimal neural network architecture. As an example of the use of neural networks to determine the borrower's rating, a model system is considered, in which the borrower's rating is determined by a known non-analytical rating function. A neural network with two inner layers, which contain, respectively, three and two neurons and have a sigmoid activation function, is used for modeling. It is shown that the use of the neural network allows restoring the borrower's rating function with quite acceptable accuracy.

Download Full-text

Accurate and Transferable Multitask Prediction of Chemical Properties with an Atoms-in-Molecule Neural Network

10.26434/chemrxiv.7151435.v2 ◽

2018 ◽

Author(s):

Roman Zubatyuk ◽

Justin S. Smith ◽

Jerzy Leszczynski ◽

Olexandr Isayev

Keyword(s):

Neural Network ◽

Molecular System ◽

Computational Cost ◽

Chemical Properties ◽

The State ◽

Molecular Properties ◽

Training Data ◽

Dft Methods ◽

Benchmark Datasets ◽

Quantum Phenomena

<p>Atomic and molecular properties could be evaluated from the fundamental Schrodinger’s equation and therefore represent different modalities of the same quantum phenomena. Here we present AIMNet, a modular and chemically inspired deep neural network potential. We used AIMNet with multitarget training to learn multiple modalities of the state of the atom in a molecular system. The resulting model shows on several benchmark datasets the state-of-the-art accuracy, comparable to the results of orders of magnitude more expensive DFT methods. It can simultaneously predict several atomic and molecular properties without an increase in computational cost. With AIMNet we show a new dimension of transferability: the ability to learn new targets utilizing multimodal information from previous training. The model can learn implicit solvation energy (like SMD) utilizing only a fraction of original training data, and archive MAD error of 1.1 kcal/mol compared to experimental solvation free energies in MNSol database.</p>

Download Full-text

Operational Determination of the Activated Sludge Process Using Neural Networks

Water Science & Technology ◽

10.2166/wst.1992.0762 ◽

1992 ◽

Vol 26 (9-11) ◽

pp. 2461-2464 ◽

Cited By ~ 2

Author(s):

R. D. Tyagi ◽

Y. G. Du

Keyword(s):

Neural Network ◽

Neural Networks ◽

Steady State ◽

Activated Sludge ◽

Feedforward Neural Network ◽

Training Data ◽

Activated Sludge Process

A steady-statemathematical model of an activated sludgeprocess with a secondary settler was developed. With a limited number of training data samples obtained from the simulation at steady state, a feedforward neural network was established which exhibits an excellent capability for the operational prediction and determination.

Download Full-text

Tax Fraud Detection through Neural Networks: An Application Using a Sample of Personal Income Taxpayers

Future Internet ◽

10.3390/fi11040086 ◽

2019 ◽

Vol 11 (4) ◽

pp. 86 ◽

Cited By ~ 2

Author(s):

César Pérez López ◽

María Delgado Rodríguez ◽

Sonia de Lucas Santos

Keyword(s):

Neural Network ◽

Neural Networks ◽

Income Tax ◽

Fraud Detection ◽

Personal Income ◽

Personal Income Tax ◽

Tax Returns ◽

The Neural Networks ◽

Tax Fraud ◽

Efficiency Rate

The goal of the present research is to contribute to the detection of tax fraud concerning personal income tax returns (IRPF, in Spanish) filed in Spain, through the use of Machine Learning advanced predictive tools, by applying Multilayer Perceptron neural network (MLP) models. The possibilities springing from these techniques have been applied to a broad range of personal income return data supplied by the Institute of Fiscal Studies (IEF). The use of the neural networks enabled taxpayer segmentation as well as calculation of the probability concerning an individual taxpayer’s propensity to attempt to evade taxes. The results showed that the selected model has an efficiency rate of 84.3%, implying an improvement in relation to other models utilized in tax fraud detection. The proposal can be generalized to quantify an individual’s propensity to commit fraud with regards to other kinds of taxes. These models will support tax offices to help them arrive at the best decisions regarding action plans to combat tax fraud.

Download Full-text

An efficient pruning scheme of deep neural networks for Internet of Things applications

EURASIP Journal on Advances in Signal Processing ◽

10.1186/s13634-021-00744-4 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Chen Qi ◽

Shibo Shen ◽

Rongpeng Li ◽

Zhifeng Zhao ◽

Qing Liu ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Internet Of Things ◽

Deep Neural Networks ◽

Computational Cost ◽

Superior Performance ◽

Compact Structure ◽

Resource Limited ◽

Benchmark Datasets ◽

Iot Devices

AbstractNowadays, deep neural networks (DNNs) have been rapidly deployed to realize a number of functionalities like sensing, imaging, classification, recognition, etc. However, the computational-intensive requirement of DNNs makes it difficult to be applicable for resource-limited Internet of Things (IoT) devices. In this paper, we propose a novel pruning-based paradigm that aims to reduce the computational cost of DNNs, by uncovering a more compact structure and learning the effective weights therein, on the basis of not compromising the expressive capability of DNNs. In particular, our algorithm can achieve efficient end-to-end training that transfers a redundant neural network to a compact one with a specifically targeted compression rate directly. We comprehensively evaluate our approach on various representative benchmark datasets and compared with typical advanced convolutional neural network (CNN) architectures. The experimental results verify the superior performance and robust effectiveness of our scheme. For example, when pruning VGG on CIFAR-10, our proposed scheme is able to significantly reduce its FLOPs (floating-point operations) and number of parameters with a proportion of 76.2% and 94.1%, respectively, while still maintaining a satisfactory accuracy. To sum up, our scheme could facilitate the integration of DNNs into the common machine-learning-based IoT framework and establish distributed training of neural networks in both cloud and edge.

Download Full-text

Localization of Scattering Objects Using Neural Networks

Sensors ◽

10.3390/s21010011 ◽

2020 ◽

Vol 21 (1) ◽

pp. 11

Author(s):

Domonkos Haffner ◽

Ferenc Izsák

Keyword(s):

Neural Networks ◽

Plane Waves ◽

Measurement Data ◽

Scattered Wave ◽

Training Data ◽

Data Set ◽

Main Compound ◽

Incident Plane ◽

The Neural Networks ◽

Simulation Package

The localization of multiple scattering objects is performed while using scattered waves. An up-to-date approach: neural networks are used to estimate the corresponding locations. In the scattering phenomenon under investigation, we assume known incident plane waves, fully reflecting balls with known diameters and measurement data of the scattered wave on one fixed segment. The training data are constructed while using the simulation package μ-diff in Matlab. The structure of the neural networks, which are widely used for similar purposes, is further developed. A complex locally connected layer is the main compound of the proposed setup. With this and an appropriate preprocessing of the training data set, the number of parameters can be kept at a relatively low level. As a result, using a relatively large training data set, the unknown locations of the objects can be estimated effectively.

Download Full-text

An Efficient Neural Network-Based Method for Diagnosing Faults of PV Array

Sustainability ◽

10.3390/su13116194 ◽

2021 ◽

Vol 13 (11) ◽

pp. 6194

Author(s):

Selma Tchoketch_Kebir ◽

Nawal Cheggaga ◽

Adrian Ilinca ◽

Sabri Boulouma

Keyword(s):

Neural Network ◽

Neural Networks ◽

Modeling Process ◽

Photovoltaic Arrays ◽

Mode Of Operation ◽

Different Types ◽

The Neural Networks ◽

Operational Modes ◽

Types Of Faults ◽

Electrical Data

This paper presents an efficient neural network-based method for fault diagnosis in photovoltaic arrays. The proposed method was elaborated on three main steps: the data-feeding step, the fault-modeling step, and the decision step. The first step consists of feeding the real meteorological and electrical data to the neural networks, namely solar irradiance, panel temperature, photovoltaic-current, and photovoltaic-voltage. The second step consists of modeling a healthy mode of operation and five additional faulty operational modes; the modeling process is carried out using two networks of artificial neural networks. From this step, six classes are obtained, where each class corresponds to a predefined model, namely, the faultless scenario and five faulty scenarios. The third step involves the diagnosis decision about the system’s state. Based on the results from the above step, two probabilistic neural networks will classify each generated data according to the six classes. The obtained results show that the developed method can effectively detect different types of faults and classify them. Besides, this method still achieves high performances even in the presence of noises. It provides a diagnosis even in the presence of data injected at reduced real-time, which proves its robustness.

Download Full-text

k-Nearest Neighbor Learning with Graph Neural Networks

Mathematics ◽

10.3390/math9080830 ◽

2021 ◽

Vol 9 (8) ◽

pp. 830

Author(s):

Seokho Kang

Keyword(s):

Neural Network ◽

Nearest Neighbor ◽

Learning Algorithm ◽

Weighting Function ◽

High Sensitivity ◽

Training Data ◽

K Nearest Neighbor ◽

Main Challenge ◽

Benchmark Datasets ◽

Graph Neural Networks

k-nearest neighbor (kNN) is a widely used learning algorithm for supervised learning tasks. In practice, the main challenge when using kNN is its high sensitivity to its hyperparameter setting, including the number of nearest neighbors k, the distance function, and the weighting function. To improve the robustness to hyperparameters, this study presents a novel kNN learning method based on a graph neural network, named kNNGNN. Given training data, the method learns a task-specific kNN rule in an end-to-end fashion by means of a graph neural network that takes the kNN graph of an instance to predict the label of the instance. The distance and weighting functions are implicitly embedded within the graph neural network. For a query instance, the prediction is obtained by performing a kNN search from the training data to create a kNN graph and passing it through the graph neural network. The effectiveness of the proposed method is demonstrated using various benchmark datasets for classification and regression tasks.

Download Full-text