Aggregated Learning: A Vector-Quantization Approach to Learning Neural Network Classifiers

Masoumeh Soflaei; Hongyu Guo; Ali Al-Bashabsheh; Yongyi Mao; Richong Zhang

doi:10.1609/aaai.v34i04.6038

Aggregated Learning: A Vector-Quantization Approach to Learning Neural Network Classifiers

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6038 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5810-5817

Author(s):

Masoumeh Soflaei ◽

Hongyu Guo ◽

Ali Al-Bashabsheh ◽

Yongyi Mao ◽

Richong Zhang

Keyword(s):

Neural Network ◽

Vector Quantization ◽

Rate Distortion ◽

Network Models ◽

Representation Learning ◽

Classification Problem ◽

Neural Network Models ◽

Rate Distortion Theory ◽

Learning Framework ◽

Neural Network Classifiers

We consider the problem of learning a neural network classifier. Under the information bottleneck (IB) principle, we associate with this classification problem a representation learning problem, which we call “IB learning”. We show that IB learning is, in fact, equivalent to a special class of the quantization problem. The classical results in rate-distortion theory then suggest that IB learning can benefit from a “vector quantization” approach, namely, simultaneously learning the representations of multiple input objects. Such an approach assisted with some variational techniques, result in a novel learning framework, “Aggregated Learning”, for classification with neural network models. In this framework, several objects are jointly classified by a single neural network. The effectiveness of this framework is verified through extensive experiments on standard image recognition and text classification tasks.

Download Full-text

Ensemble Deep Learning Models for Heart Disease Classification: A Case Study from Mexico

Information ◽

10.3390/info11040207 ◽

2020 ◽

Vol 11 (4) ◽

pp. 207

Author(s):

Asma Baccouche ◽

Begonya Garcia-Zapirain ◽

Cristian Castillo Olea ◽

Adel Elmaghraby

Keyword(s):

Neural Network ◽

Heart Disease ◽

Ensemble Learning ◽

Heart Diseases ◽

Hypertensive Heart Disease ◽

Network Models ◽

Features Selection ◽

Neural Network Models ◽

Learning Framework ◽

Different Types

Heart diseases are highly ranked among the leading causes of mortality in the world. They have various types including vascular, ischemic, and hypertensive heart disease. A large number of medical features are reported for patients in the Electronic Health Records (EHR) that allow physicians to diagnose and monitor heart disease. We collected a dataset from Medica Norte Hospital in Mexico that includes 800 records and 141 indicators such as age, weight, glucose, blood pressure rate, and clinical symptoms. Distribution of the collected records is very unbalanced on the different types of heart disease, where 17% of records have hypertensive heart disease, 16% of records have ischemic heart disease, 7% of records have mixed heart disease, and 8% of records have valvular heart disease. Herein, we propose an ensemble-learning framework of different neural network models, and a method of aggregating random under-sampling. To improve the performance of the classification algorithms, we implement a data preprocessing step with features selection. Experiments were conducted with unidirectional and bidirectional neural network models and results showed that an ensemble classifier with a BiLSTM or BiGRU model with a CNN model had the best classification performance with accuracy and F1-score between 91% and 96% for the different types of heart disease. These results are competitive and promising for heart disease dataset. We showed that ensemble-learning framework based on deep models could overcome the problem of classifying an unbalanced heart disease dataset. Our proposed framework can lead to highly accurate models that are adapted for clinical real data and diagnosis use.

Download Full-text

A STRUCTURAL DISTANCE-BASED CROSSOVER FOR NEURAL NETWORK CLASSIFIERS

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001412500127 ◽

2012 ◽

Vol 26 (06) ◽

pp. 1250012

Author(s):

MANUEL MORENO ◽

PEDRO ANTONIO GUTIÉRREZ ◽

CÉSAR HERVÁS-MARTÍNEZ

Keyword(s):

Neural Network ◽

Network Models ◽

Crossover Operator ◽

Classification Problems ◽

Neural Network Models ◽

Similarity Parameter ◽

Fitness Evaluation ◽

Machine Learning Classification ◽

Structural Distance ◽

Neural Network Classifiers

This paper presents a structural distance-based crossover for neural network classifiers, which is applied as part of a Memetic Algorithm (MA) for evolving simultaneously the structure and weights of neural network models applied to multiclass problems. Previous researchers have shown that this simultaneous evolution is a way to avoid the noisy fitness evaluation. The MA incorporates a crossover operator that shows to be useful for ameliorating the permutation problem of the network representation (i.e. different genotypes can be used to represent the same neural network phenotype), increasing the structural diversity of the individuals and improving the accuracy of the results. Instead of a recombination probability, the crossover operator considers a similarity parameter (the minimum structural distance), which allows to maintain a trade-off between global and local search. The neural network models selected in this work are the product-unit neural networks (PUNNs), due to their increasing relevance in those classification problems which show a high order relationship between the input variables. The proposed MA is intended to reduce the possible overtraining problems which can raise in some datasets for this kind of models. The evolutionary system is applied to eight classification benchmarks and the results of an analysis of variance contrast (ANOVA) show the effectiveness of the structural-based crossover operator and the capacity of our algorithm to obtain evolved PUNNs with a higher classification accuracy than those obtained using other evolutionary techniques. On the other hand, the results obtained are compared with popular effective machine learning classification methods, resulting in a competitive performance.

Download Full-text

Vector Quantization using Artificial Neural Network Models

Information Technology: Transmission, Processing and Storage - Signal Processing in Telecommunications ◽

10.1007/978-1-4471-1013-2_27 ◽

1996 ◽

pp. 346-357

Author(s):

Aristides S. Galanopoulos ◽

James E. Fowler ◽

Stanley C. Ahalt

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Vector Quantization ◽

Network Models ◽

Neural Network Models ◽

Artificial Neural ◽

Artificial Neural Network Models

Download Full-text

Toxic Comments Classification using Neural Network

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.g1005.0597s20 ◽

2020 ◽

Vol 9 (7S) ◽

pp. 12-15

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Natural Language Processing ◽

Language Processing ◽

Network Models ◽

Classification Problem ◽

The Internet ◽

Neural Network Models ◽

Feature Extraction Technique ◽

Large Corpus

Humans have built broad models of expressing their thoughts via several appliances. The internet has not only become a credible method for expressing one's thoughts, but is also rapidly becoming the single largest means of doing so. In this context, one area of focus is the study of negative online behaviors of users like, toxic comments that are threat, obscenity, insults and abuse. The task of identifying and removing toxic communication from public forums is critical. The undertaking of analyzing a large corpus of comments is infeasible for human moderators. Our approach is to use Natural Language Processing (NLP) techniques to provide an efficient and accurate tool to detect online toxicity. We apply TF-IDF feature extraction technique, Neural Network models to tackle a toxic comment classification problem with a labeled dataset from Wikipedia Talk Page.

Download Full-text

Intrinsic Plasticity for Natural Competition in Koniocortex-Like Neural Networks

International Journal of Neural Systems ◽

10.1142/s0129065716500404 ◽

2016 ◽

Vol 26 (05) ◽

pp. 1650040 ◽

Cited By ~ 2

Author(s):

Francisco Javier Ropero Peláez ◽

Mariana Antonia Aguiar-Furucho ◽

Diego Andina

Keyword(s):

Neural Network ◽

Vector Quantization ◽

Network Models ◽

Activation Function ◽

Inhibitory Interneurons ◽

Neural Network Models ◽

The Third ◽

Initial Network ◽

Sensory Cortices ◽

Intrinsic Plasticity

In this paper, we use the neural property known as intrinsic plasticity to develop neural network models that resemble the koniocortex, the fourth layer of sensory cortices. These models evolved from a very basic two-layered neural network to a complex associative koniocortex network. In the initial network, intrinsic and synaptic plasticity govern the shifting of the activation function, and the modification of synaptic weights, respectively. In this first version, competition is forced, so that the most activated neuron is arbitrarily set to one and the others to zero, while in the second, competition occurs naturally due to inhibition between second layer neurons. In the third version of the network, whose architecture is similar to the koniocortex, competition also occurs naturally owing to the interplay between inhibitory interneurons and synaptic and intrinsic plasticity. A more complex associative neural network was developed based on this basic koniocortex-like neural network, capable of dealing with incomplete patterns and ideally suited to operating similarly to a learning vector quantization network. We also discuss the biological plausibility of the networks and their role in a more complex thalamocortical model.

Download Full-text

Multinomial logistic regression and product unit neural network models: Application of a new hybrid methodology for solving a classification problem in the livestock sector

Expert Systems with Applications ◽

10.1016/j.eswa.2009.04.070 ◽

2009 ◽

Vol 36 (10) ◽

pp. 12225-12235 ◽

Cited By ~ 9

Author(s):

Mercedes Torres ◽

Cesar Hervás ◽

Carlos García

Keyword(s):

Neural Network ◽

Logistic Regression ◽

Multinomial Logistic Regression ◽

Network Models ◽

Classification Problem ◽

Neural Network Models ◽

Livestock Sector ◽

Hybrid Methodology

Download Full-text

Method of complex copper-zinc ore typification using neural network models

MINING INFORMATIONAL AND ANALYTICAL BULLETIN ◽

10.25018/0236-1493-2020-5-0-140-147 ◽

2020 ◽

Vol 5 ◽

pp. 140-147 ◽

Cited By ~ 1

Author(s):

T.N. Aleksandrova ◽

◽

E.K. Ushakov ◽

A.V. Orlova ◽

◽

...

Keyword(s):

Neural Network ◽

Network Models ◽

Neural Network Models ◽

Copper Zinc ◽

Complex Copper

Download Full-text

Digital twin of equipment as a basis for the consumer in digital production

Automation. Modern Techologies ◽

10.36652/0869-4931-2020-74-9-394-402 ◽

2020 ◽

Keyword(s):

Neural Network ◽

Tool Wear ◽

Chip Formation ◽

Network Models ◽

Machining Accuracy ◽

Neural Network Models ◽

Digital Twin ◽

The Neural Network ◽

Digital Production ◽

Cyberphysical System

The neural network models series used in the development of an aggregated digital twin of equipment as a cyber-physical system are presented. The twins of machining accuracy, chip formation and tool wear are examined in detail. On their basis, systems for stabilization of the chip formation process during cutting and diagnose of the cutting too wear are developed. Keywords cyberphysical system; neural network model of equipment; big data, digital twin of the chip formation; digital twin of the tool wear; digital twin of nanostructured coating choice

Download Full-text

Universal approximation with error bounds for dynamic artificial neural network models: A tutorial and some new results

2011 IEEE International Symposium on Computer-Aided Control System Design (CACSD) ◽

10.1109/cacsd.2011.6044542 ◽

2011 ◽

Cited By ~ 4

Author(s):

Kwang Ki Kevin Kim ◽

Ernesto Rios Patron ◽

Richard D. Braatz

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Error Bounds ◽

Network Models ◽

Universal Approximation ◽

Neural Network Models ◽

Artificial Neural ◽

Artificial Neural Network Models

Download Full-text

Bridging the Analytical and Artificial Neural Network Models for Keyhole Formation with Experimental Verification in Laser-melting Deposition: A Novel Approach

Results in Physics ◽

10.1016/j.rinp.2021.104440 ◽

2021 ◽

pp. 104440

Author(s):

Muhammad Arif Mahmood ◽

Andrei C. Popescu ◽

Mihai Oane ◽

Asma Channa ◽

Sabin Mihai ◽

...

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Experimental Verification ◽

Network Models ◽

Laser Melting ◽

Neural Network Models ◽

Novel Approach ◽

Laser Melting Deposition ◽

Artificial Neural ◽

Artificial Neural Network Models

Download Full-text