Noise tolerance of Multiple Classifier Systems in data integration-based gene function prediction

Summary The availability of various high-throughput experimental and computational methods developed in the last decade allowed molecular biologists to investigate the functions of genes at system level opening unprecedented research opportunities. Despite the automated prediction of genes functions could be included in the most difficult problems in bioinformatics, several recently published works showed that consistent improvements in prediction performances can be obtained by integrating heterogeneous data sources. Nevertheless, very few works have been dedicated to the investigation of the impact of noisy data on the prediction performances achievable by using data integration approaches.In this contribution we investigated the tolerance of multiple classifier systems (MCS) to noisy data in gene function prediction experiments based on data integration methods. The experimental results show that performances of MCS do not undergo a significant decay when noisy data sets are added. In addition, we show that in this task MCS are competitive with kernel fusion, one of the most widely applied technique for data integration in gene function prediction problems.

Download Full-text

Multi-sensor fusion based on multiple classifier systems for human activity identification

Human-centric Computing and Information Sciences ◽

10.1186/s13673-019-0194-5 ◽

2019 ◽

Vol 9 (1) ◽

Cited By ~ 7

Author(s):

Henry Friday Nweke ◽

Ying Wah Teh ◽

Ghulam Mujtaba ◽

Uzoma Rita Alo ◽

Mohammed Ali Al-garadi

Keyword(s):

Feature Selection ◽

Sensor Fusion ◽

Human Activity ◽

Activity Detection ◽

Multiple Classifier Systems ◽

Classifier Systems ◽

Multiple Classifier ◽

Performance Results ◽

Human Activity Detection ◽

The Impact

Abstract Multimodal sensors in healthcare applications have been increasingly researched because it facilitates automatic and comprehensive monitoring of human behaviors, high-intensity sports management, energy expenditure estimation, and postural detection. Recent studies have shown the importance of multi-sensor fusion to achieve robustness, high-performance generalization, provide diversity and tackle challenging issue that maybe difficult with single sensor values. The aim of this study is to propose an innovative multi-sensor fusion framework to improve human activity detection performances and reduce misrecognition rate. The study proposes a multi-view ensemble algorithm to integrate predicted values of different motion sensors. To this end, computationally efficient classification algorithms such as decision tree, logistic regression and k-Nearest Neighbors were used to implement diverse, flexible and dynamic human activity detection systems. To provide compact feature vector representation, we studied hybrid bio-inspired evolutionary search algorithm and correlation-based feature selection method and evaluate their impact on extracted feature vectors from individual sensor modality. Furthermore, we utilized Synthetic Over-sampling minority Techniques (SMOTE) algorithm to reduce the impact of class imbalance and improve performance results. With the above methods, this paper provides unified framework to resolve major challenges in human activity identification. The performance results obtained using two publicly available datasets showed significant improvement over baseline methods in the detection of specific activity details and reduced error rate. The performance results of our evaluation showed 3% to 24% improvement in accuracy, recall, precision, F-measure and detection ability (AUC) compared to single sensors and feature-level fusion. The benefit of the proposed multi-sensor fusion is the ability to utilize distinct feature characteristics of individual sensor and multiple classifier systems to improve recognition accuracy. In addition, the study suggests a promising potential of hybrid feature selection approach, diversity-based multiple classifier systems to improve mobile and wearable sensor-based human activity detection and health monitoring system.

Download Full-text

Tackling the problem of classification with noisy data using Multiple Classifier Systems: Analysis of the performance and robustness

Information Sciences ◽

10.1016/j.ins.2013.06.002 ◽

2013 ◽

Vol 247 ◽

pp. 1-20 ◽

Cited By ~ 45

Author(s):

José A. Sáez ◽

Mikel Galar ◽

Julián Luengo ◽

Francisco Herrera

Keyword(s):

Systems Analysis ◽

Noisy Data ◽

Multiple Classifier Systems ◽

Classifier Systems ◽

Multiple Classifier

Download Full-text

Faculty Opinions recommendation of The art of gene function prediction.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1056759.508687 ◽

2006 ◽

Author(s):

Martin Noble

Keyword(s):

Gene Function ◽

Function Prediction ◽

Gene Function Prediction

Download Full-text

Faculty Opinions recommendation of Network-Based Gene Function Prediction in Mouse and Other Model Vertebrates Using MouseNet Server.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.727562216.793535246 ◽

2017 ◽

Author(s):

John Hancock

Keyword(s):

Gene Function ◽

Function Prediction ◽

Gene Function Prediction

Download Full-text

Selection of classifiers for the construction of multiple classifier systems

Eighth International Conference on Document Analysis and Recognition (ICDAR'05) ◽

10.1109/icdar.2005.213 ◽

2005 ◽

Cited By ~ 1

Author(s):

Hee-Joong Kang ◽

D. Doermann

Keyword(s):

Multiple Classifier Systems ◽

Classifier Systems ◽

Multiple Classifier ◽

Selection Of

Download Full-text

Graph-based sequence annotation using a data integration approach

Journal of Integrative Bioinformatics ◽

10.1515/jib-2008-94 ◽

2008 ◽

Vol 5 (2) ◽

Cited By ~ 1

Author(s):

Robert Pesch ◽

Artem Lysenko ◽

Matthew Hindle ◽

Keywan Hassani-Pak ◽

Ralf Thiele ◽

...

Keyword(s):

Data Integration ◽

Gene Function ◽

High Throughput Sequencing ◽

Prediction Method ◽

Function Prediction ◽

Function Annotation ◽

Automated Annotation ◽

Data Integration System ◽

Reference Databases ◽

Function Assignment

SummaryThe automated annotation of data from high throughput sequencing and genomics experiments is a significant challenge for bioinformatics. Most current approaches rely on sequential pipelines of gene finding and gene function prediction methods that annotate a gene with information from different reference data sources. Each function prediction method contributes evidence supporting a functional assignment. Such approaches generally ignore the links between the information in the reference datasets. These links, however, are valuable for assessing the plausibility of a function assignment and can be used to evaluate the confidence in a prediction. We are working towards a novel annotation system that uses the network of information supporting the function assignment to enrich the annotation process for use by expert curators and predicting the function of previously unannotated genes. In this paper we describe our success in the first stages of this development. We present the data integration steps that are needed to create the core database of integrated reference databases (UniProt, PFAM, PDB, GO and the pathway database Ara- Cyc) which has been established in the ONDEX data integration system. We also present a comparison between different methods for integration of GO terms as part of the function assignment pipeline and discuss the consequences of this analysis for improving the accuracy of gene function annotation.The methods and algorithms presented in this publication are an integral part of the ONDEX system which is freely available from http://ondex.sf.net/.

Download Full-text

HANDWRITTEN WORD RECOGNITION USING CLASSIFIER ENSEMBLES GENERATED FROM MULTIPLE PROTOTYPES

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001404003496 ◽

2004 ◽

Vol 18 (05) ◽

pp. 957-974 ◽

Cited By ~ 4

Author(s):

SIMON GÜNTER ◽

HORST BUNKE

Keyword(s):

Handwriting Recognition ◽

Recognition Rate ◽

Classifier Ensembles ◽

Multiple Classifier Systems ◽

Classification Problems ◽

Classifier Systems ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Multiple Classifier ◽

System 1

Handwritten text recognition is one of the most difficult problems in the field of pattern recognition. In this paper, we describe our efforts towards improving the performance of state-of-the-art handwriting recognition systems through the use of classifier ensembles. There are many examples of classification problems in the literature where multiple classifier systems increase the performance over single classifiers. Normally one of the two following approaches is used to create a multiple classifier system. (1) Several classifiers are developed completely independent of each other and combined in a last step. (2) Several classifiers are created out of one prototype classifier by using so-called classifier ensemble creation methods. In this paper an algorithm which combines both approaches is introduced and it is used to increase the recognition rate of a hidden Markov model (HMM) based handwritten word recognizer.

Download Full-text

Gene Function Prediction from Functional Association Networks Using Kernel Partial Least Squares Regression

PLoS ONE ◽

10.1371/journal.pone.0134668 ◽

2015 ◽

Vol 10 (8) ◽

pp. e0134668 ◽

Cited By ~ 12

Author(s):

Sonja Lehtinen ◽

Jon Lees ◽

Jürg Bähler ◽

John Shawe-Taylor ◽

Christine Orengo

Keyword(s):

Least Squares ◽

Partial Least Squares ◽

Gene Function ◽

Partial Least Squares Regression ◽

Function Prediction ◽

Least Squares Regression ◽

Gene Function Prediction ◽

Functional Association ◽

Kernel Partial Least Squares

Download Full-text

INTEGRATION OF n-GRAM LANGUAGE MODELS IN MULTIPLE CLASSIFIER SYSTEMS FOR OFFLINE HANDWRITTEN TEXT LINE RECOGNITION

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001408006855 ◽

2008 ◽

Vol 22 (07) ◽

pp. 1301-1321 ◽

Cited By ~ 2

Author(s):

ROMAN BERTOLAMI ◽

HORST BUNKE

Keyword(s):

Language Model ◽

Language Models ◽

Combination Method ◽

Text Line ◽

Multiple Classifier Systems ◽

Classifier Systems ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Multiple Classifier ◽

N Gram

Current multiple classifier systems for unconstrained handwritten text recognition do not provide a straightforward way to utilize language model information. In this paper, we describe a generic method to integrate a statistical n-gram language model into the combination of multiple offline handwritten text line recognizers. The proposed method first builds a word transition network and then rescores this network with an n-gram language model. Experimental evaluation conducted on a large dataset of offline handwritten text lines shows that the proposed approach improves the recognition accuracy over a reference system as well as over the original combination method that does not include a language model.

Download Full-text

Evaluation of the information-theoretic construction of multiple classifier systems

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings. ◽

10.1109/icdar.2003.1227770 ◽

2005 ◽

Cited By ~ 2

Author(s):

Hee-Joong Kang ◽

D. Doermann

Keyword(s):

Multiple Classifier Systems ◽

Classifier Systems ◽

Information Theoretic ◽

Multiple Classifier

Download Full-text