scholarly journals On the optimistic performance evaluation of newly introduced bioinformatic methods

Author(s):  
Stefan Buchka ◽  
Alexander Hapfelmeier ◽  
Paul P Gardner ◽  
Rory Wilson ◽  
Anne-Laure Boulesteix

Most research articles presenting new data analysis methods claim that “the new method performs better than existing methods”, but the veracity of such statements is questionable. Our manuscript discusses and illustrates consequences of the optimistic bias occurring during the evaluation of novel data analysis methods, that is, all biases resulting from, for example, selection of datasets or competing methods; better ability to fix bugs in a preferred method; and selective reporting of method variants. We quantitatively investigate this bias using a topical example from epigenetic analysis: normalization methods for data generated by the Illumina HumanMethylation450K BeadChip microarray.

2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Stefan Buchka ◽  
Alexander Hapfelmeier ◽  
Paul P. Gardner ◽  
Rory Wilson ◽  
Anne-Laure Boulesteix

AbstractMost research articles presenting new data analysis methods claim that “the new method performs better than existing methods,” but the veracity of such statements is questionable. Our manuscript discusses and illustrates consequences of the optimistic bias occurring during the evaluation of novel data analysis methods, that is, all biases resulting from, for example, selection of datasets or competing methods, better ability to fix bugs in a preferred method, and selective reporting of method variants. We quantitatively investigate this bias using an example from epigenetic analysis: normalization methods for data generated by the Illumina HumanMethylation450K BeadChip microarray.


2011 ◽  
Vol 29 (3) ◽  
pp. 467-491 ◽  
Author(s):  
H. Vanhamäki ◽  
O. Amm

Abstract. We present a review of selected data-analysis methods that are frequently applied in studies of ionospheric electrodynamics and magnetosphere-ionosphere coupling using ground-based and space-based data sets. Our focus is on methods that are data driven (not simulations or statistical models) and can be used in mesoscale studies, where the analysis area is typically some hundreds or thousands of km across. The selection of reviewed methods is such that most combinations of measured input data (electric field, conductances, magnetic field and currents) that occur in practical applications are covered. The techniques are used to solve the unmeasured parameters from Ohm's law and Maxwell's equations, possibly with help of some simplifying assumptions. In addition to reviewing existing data-analysis methods, we also briefly discuss possible extensions that may be used for upcoming data sets.


2017 ◽  
Vol 9 (33) ◽  
pp. 4783-4789 ◽  
Author(s):  
Samuel Mabbott ◽  
Yun Xu ◽  
Royston Goodacre

Reproducibility of SERS signal acquired from thin films developed in-house and commercially has been assessed using seven data analysis methods.


Sign in / Sign up

Export Citation Format

Share Document