ensembles of classifiers Latest Research Papers

Neuroscience has seen a dramatic increase in the types of recording modalities and complexity of neural time-series data collected from them. The brain is a highly recurrent system producing rich, complex dynamics that result in different behaviors. Correctly distinguishing such nonlinear neural time series in real-time, especially those with non-obvious links to behavior, could be useful for a wide variety of applications. These include detecting anomalous clinical events such as seizures in epilepsy, and identifying optimal control spaces for brain machine interfaces. It remains challenging to correctly distinguish nonlinear time-series patterns because of the high intrinsic dimensionality of such data, making accurate inference of state changes (for intervention or control) difficult. Simple distance metrics, which can be computed quickly do not yield accurate classifications. On the other end of the spectrum of classification methods, ensembles of classifiers or deep supervised tools offer higher accuracy but are slow, data-intensive, and computationally expensive. We introduce a reservoir-based tool, state tracker (TRAKR), which offers the high accuracy of ensembles or deep supervised methods while preserving the computational benefits of simple distance metrics. After one-shot training, TRAKR can accurately, and in real time, detect deviations in test patterns. By forcing the weighted dynamics of the reservoir to fit a desired pattern directly, we avoid many rounds of expensive optimization. Then, keeping the output weights frozen, we use the error signal generated by the reservoir in response to a particular test pattern as a classification boundary. We show that, using this approach, TRAKR accurately detects changes in synthetic time series. We then compare our tool to several others, showing that it achieves highest classification performance on a benchmark dataset, sequential MNIST, even when corrupted by noise. Additionally, we apply TRAKR to electrocorticography (ECoG) data from the macaque orbitofrontal cortex (OFC), a higher-order brain region involved in encoding the value of expected outcomes. We show that TRAKR can classify different behaviorally relevant epochs in the neural time series more accurately and efficiently than conventional approaches. Therefore, TRAKR can be used as a fast and accurate tool to distinguish patterns in complex nonlinear time-series data, such as neural recordings.

Download Full-text

Applying Complementary Credit Scores to Calculate Aggregate Ranking

Journal of Corporate Finance Research / Корпоративные Финансы | ISSN 2073-0438 ◽

10.17323/j.jcfr.2073-0438.15.3.2021.5-13 ◽

2021 ◽

Vol 15 (3) ◽

pp. 5-12

Author(s):

Zinaida Seleznyova

Keyword(s):

Logistic Regression ◽

Financial Institutions ◽

Credit Scoring ◽

Predictive Ability ◽

Financial Statements ◽

Performance Measure ◽

Ensembles Of Classifiers ◽

Different Types ◽

Credit Scores ◽

Types Of Information

Researchers have been improving credit scoring models for decades, as an increase in the predictive ability of scoring even by a small amount can allow financial institutions to avoid significant losses. Many researchers believe that ensembles of classifiers or aggregated scorings are the most effective. However, ensembles outperform base classifiers by thousandths of a percent on unbalanced samples. This article proposes an aggregated scoring model. In contrast to previous models, its base classifiers are focused on identifying different types of borrowers. We illustrate the effectiveness of such scoring aggregation on real unbalanced data. As the effectiveness indicator we use the performance measure of the area under the ROC curve. The DeLong, DeLong and Clarke-Pearson test is used to measure the statistical difference between two or more areas. In addition, we apply a logistic model of defaults (logistic regression) to the data of company financial statements. This model is usually used to identify default borrowers. To obtain a scoring aimed at non-default borrowers, we employ a modified Kemeny median, which was initially developed to rank companies with credit ratings. Both scores are aggregated by logistic regression. Our data Russian banks that existed or defaulted between July 1, 2010, and July 1, 2015. This sample of banks is highly unbalanced, with a concentration of defaults of about 5%. The aggregation was carried out for banks with several ratings. We show that aggregated classifiers based on different types of information significantly improve the discriminatory power of scoring even on an unbalanced sample. Moreover, the absolute value of this improvement surpasses all the values previously obtained from unbalanced samples. The aggregated scoring and the approach to its construction can be applied by financial institutions to credit risk assessment and as an auxiliary tool in the decision-making process thanks to the relatively high interpretability of the scores.

Download Full-text

Data analysis methods in astronomic objects classification (Sloan Digital Sky Survey DR14)

Российский технологический журнал ◽

10.32362/2500-316x-2021-9-3-66-77 ◽

2021 ◽

Vol 9 (3) ◽

pp. 66-77

Author(s):

V. A. Golov ◽

D. A. Petrusevich

Keyword(s):

Machine Learning ◽

Statistical Information ◽

Classification Systems ◽

Sloan Digital Sky Survey ◽

Gradient Boosting ◽

Lagrange Points ◽

Machine Learning Classification ◽

Ensembles Of Classifiers ◽

The Earth ◽

Sky Survey

In the paper Sloan Digital Sky Survey DR14 dataset was investigated. It contains statistical information about many astronomical objects. The information was obtained within the framework of the Sloan Digital Sky Survey project. There are telescopes at the Earth surface, at the Earth orbit and in the Lagrange points of some systems (Earth–Moon, Sun–Earth). The telescopes gain information in different frequency ranges. The large quantity of statistical information leads to the demand for analytical algorithms and systems capable of making classification. Such information is marked up well enough to build machine learning classification systems. The paper presents the results of a number of classifiers. The handled data contains measures of three types of astronomical objects of the Sloan Digital Sky Survey DR14 dataset (star, quasar, galaxy). The CART decision tree, logistic regression, naïve Bayes classifiers and ensembles of classifiers (random forest, gradient boosting) were implemented. Conclusions about special features of each machine learning classifier trained to solve this task are made at the end of the paper. In some cases, classifiers’ structure can be explained physically. The accuracy of the classifiers built in this research is more than 90% (metrics F1, precision and recall are implemented, because the classes are unbalanced). Taking these values into account classification task is supposed to be successfully solved. At the same time, the structure of classifiers and importance of features can be used as a physical explanation of the solution.

Download Full-text

Using novelty search to explicitly create diversity in ensembles of classifiers

Proceedings of the Genetic and Evolutionary Computation Conference ◽

10.1145/3449639.3459308 ◽

2021 ◽

Author(s):

Rui P. Cardoso ◽

Emma Hart ◽

David Burth Kurka ◽

Jeremy V. Pitt

Keyword(s):

Ensembles Of Classifiers ◽

Novelty Search

Download Full-text

An Ensemble of Convolutional Neural Networks for Audio Classification

Applied Sciences ◽

10.3390/app11135796 ◽

2021 ◽

Vol 11 (13) ◽

pp. 5796

Author(s):

Loris Nanni ◽

Gianluca Maguolo ◽

Sheryl Brahnam ◽

Michelangelo Paci

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Data Augmentation ◽

Extensive Study ◽

Urban Environments ◽

Sound Recognition ◽

Audio Classification ◽

Environmental Sound ◽

Ensembles Of Classifiers ◽

Sound Classification

Research in sound classification and recognition is rapidly advancing in the field of pattern recognition. One important area in this field is environmental sound recognition, whether it concerns the identification of endangered species in different habitats or the type of interfering noise in urban environments. Since environmental audio datasets are often limited in size, a robust model able to perform well across different datasets is of strong research interest. In this paper, ensembles of classifiers are combined that exploit six data augmentation techniques and four signal representations for retraining five pre-trained convolutional neural networks (CNNs); these ensembles are tested on three freely available environmental audio benchmark datasets: (i) bird calls, (ii) cat sounds, and (iii) the Environmental Sound Classification (ESC-50) database for identifying sources of noise in environments. To the best of our knowledge, this is the most extensive study investigating ensembles of CNNs for audio classification. The best-performing ensembles are compared and shown to either outperform or perform comparatively to the best methods reported in the literature on these datasets, including on the challenging ESC-50 dataset. We obtained a 97% accuracy on the bird dataset, 90.51% on the cat dataset, and 88.65% on ESC-50 using different approaches. In addition, the same ensemble model trained on the three datasets managed to reach the same results on the bird and cat datasets while losing only 0.1% on ESC-50. Thus, we have managed to create an off-the-shelf ensemble that can be trained on different datasets and reach performances competitive with the state of the art.

Download Full-text

Empirical Evaluation of Noise Influence on Supervised Machine Learning Algorithms Using Intrusion Detection Datasets

Security and Communication Networks ◽

10.1155/2021/8836057 ◽

2021 ◽

Vol 2021 ◽

pp. 1-28

Author(s):

Khalid M. Al-Gethami ◽

Mousa T. Al-Akhras ◽

Mohammed Alawairdhi

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Extreme Values ◽

Empirical Evaluation ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Intrusion Detection Systems ◽

Support Vector ◽

Detection Systems ◽

Ensembles Of Classifiers

Optimizing the detection of intrusions is becoming more crucial due to the continuously rising rates and ferocity of cyber threats and attacks. One of the popular methods to optimize the accuracy of intrusion detection systems (IDSs) is by employing machine learning (ML) techniques. However, there are many factors that affect the accuracy of the ML-based IDSs. One of these factors is noise, which can be in the form of mislabelled instances, outliers, or extreme values. Determining the extent effect of noise helps to design and build more robust ML-based IDSs. This paper empirically examines the extent effect of noise on the accuracy of the ML-based IDSs by conducting a wide set of different experiments. The used ML algorithms are decision tree (DT), random forest (RF), support vector machine (SVM), artificial neural networks (ANNs), and Naïve Bayes (NB). In addition, the experiments are conducted on two widely used intrusion datasets, which are NSL-KDD and UNSW-NB15. Moreover, the paper also investigates the use of these ML algorithms as base classifiers with two ensembles of classifiers learning methods, which are bagging and boosting. The detailed results and findings are illustrated and discussed in this paper.

Download Full-text

A New Method for Stance Detection based on Feature Selection Techniques and Ensembles of Classifiers

IEEE Access ◽

10.1109/access.2021.3116657 ◽

2021 ◽

pp. 1-1

Author(s):

Sergey Vychegzhanin ◽

Evgeny Kotelnikov

Keyword(s):

Feature Selection ◽

New Method ◽

Ensembles Of Classifiers ◽

Feature Selection Techniques

Download Full-text

Heterogeneous ensembles of classifiers in predicting Bundesliga football results

Procedia Computer Science ◽

10.1016/j.procs.2021.08.161 ◽

2021 ◽

Vol 192 ◽

pp. 1573-1582

Author(s):

Jan Kozak ◽

Szymon Głowania

Keyword(s):

Ensembles Of Classifiers

Download Full-text

High-accuracy classification of thread quality in tapping processes with ensembles of classifiers for imbalanced learning

Measurement ◽

10.1016/j.measurement.2020.108328 ◽

2021 ◽

Vol 168 ◽

pp. 108328

Author(s):

Jose F. Diez-Pastor ◽

Alain Gil Del Val ◽

Fernando Veiga ◽

Andres Bustillo

Keyword(s):

High Accuracy ◽

Imbalanced Learning ◽

Ensembles Of Classifiers

Download Full-text

Selection of Players and Team for an Indian Premier League Cricket Match Using Ensembles of Classifiers

2020 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT) ◽

10.1109/conecct50063.2020.9198371 ◽

2020 ◽

Author(s):

P Jhansi Rani ◽

Aditya Vidyadhar Kamath ◽

Aadith Menon ◽

Prajwal Dhatwalia ◽

D Rishabh ◽

...

Keyword(s):

Ensembles Of Classifiers ◽

Premier League ◽

Selection Of

Download Full-text

ensembles of classifiers
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

TRAKR - A reservoir-based tool for fast and accurate classification of neural time-series patterns

Applying Complementary Credit Scores to Calculate Aggregate Ranking

Data analysis methods in astronomic objects classification (Sloan Digital Sky Survey DR14)

Using novelty search to explicitly create diversity in ensembles of classifiers

An Ensemble of Convolutional Neural Networks for Audio Classification

Empirical Evaluation of Noise Influence on Supervised Machine Learning Algorithms Using Intrusion Detection Datasets

A New Method for Stance Detection based on Feature Selection Techniques and Ensembles of Classifiers

Heterogeneous ensembles of classifiers in predicting Bundesliga football results

High-accuracy classification of thread quality in tapping processes with ensembles of classifiers for imbalanced learning

Selection of Players and Team for an Indian Premier League Cricket Match Using Ensembles of Classifiers

Export Citation Format

ensembles of classifiersRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

TRAKR - A reservoir-based tool for fast and accurate classification of neural time-series patterns

Applying Complementary Credit Scores to Calculate Aggregate Ranking

Data analysis methods in astronomic objects classification (Sloan Digital Sky Survey DR14)

Using novelty search to explicitly create diversity in ensembles of classifiers

An Ensemble of Convolutional Neural Networks for Audio Classification

Empirical Evaluation of Noise Influence on Supervised Machine Learning Algorithms Using Intrusion Detection Datasets

A New Method for Stance Detection based on Feature Selection Techniques and Ensembles of Classifiers

Heterogeneous ensembles of classifiers in predicting Bundesliga football results

High-accuracy classification of thread quality in tapping processes with ensembles of classifiers for imbalanced learning

Selection of Players and Team for an Indian Premier League Cricket Match Using Ensembles of Classifiers

ensembles of classifiers
Recently Published Documents