ranking aggregation
Recently Published Documents


TOTAL DOCUMENTS

34
(FIVE YEARS 12)

H-INDEX

8
(FIVE YEARS 1)

2022 ◽  
Author(s):  
Bo Wang ◽  
Andy Law ◽  
Tim Regan ◽  
Nicholas Parkinson ◽  
Joby Cole ◽  
...  

A common experimental output in biomedical science is a list of genes implicated in a given biological process or disease. The results of a group of studies answering the same, or similar, questions can be combined by meta-analysis to find a consensus or a more reliable answer. Ranking aggregation methods can be used to combine gene lists from various sources in meta-analyses. Evaluating a ranking aggregation method on a specific type of dataset before using it is required to support the reliability of the result since the property of a dataset can influence the performance of an algorithm. Evaluation of aggregation methods is usually based on a simulated database especially for the algorithms designed for gene lists because of the lack of a known truth for real data. However, simulated datasets tend to be too small compared to experimental data and neglect key features, including heterogeneity of quality, relevance and the inclusion of unranked lists. In this study, a group of existing methods and their variations which are suitable for meta-analysis of gene lists are compared using simulated and real data. Simulated data was used to explore the performance of the aggregation methods as a function of emulating the common scenarios of real genomics data, with various heterogeneity of quality, noise level, and a mix of unranked and ranked data using 20000 possible entities. In addition to the evaluation with simulated data, a comparison using real genomic data on the SARS-CoV-2 virus, cancer (NSCLC), and bacteria (macrophage apoptosis) was performed. We summarise our evaluation results in terms of a simple flowchart to select a ranking aggregation method for genomics data.


2021 ◽  
Vol 12 (3) ◽  
Author(s):  
Camila R. Lopes ◽  
Lúcio F. D. Santos ◽  
Daniel L. Jasbick ◽  
Daniel De Oliveira ◽  
Marcos Bedo

A diversified similarity search retrieves elements that are simultaneously similar to a query object and akin to the different collections within the explored data. While several methods in information retrieval, data clustering, and similarity searching have tackled the problem of adding diversity into result sets, the experimental comparison of their performances is still an open issue mainly because the quality metrics are “borrowed” from those different research areas, bringing their biases alongside. In this manuscript, we investigate a series of such metrics and experimentally discuss their trends and limitations. We conclude diversity is better addressed by a set of measures rather than a single quality index and introduce the concept of Diversity Features Model (DFM), which combines the viewpoints of biased metrics into a multidimensional representation. Experimental evaluations indicate (i) DFM enables comparing different result diversification algorithms by considering multiple criteria, and (ii) the most suitable searching methods for a particular dataset are spotted by combining DFM with ranking aggregation and parallel coordinates maps.


2021 ◽  
Author(s):  
Yeawon Yoo ◽  
Adolfo R. Escobedo

Rank aggregation is widely used in group decision making and many other applications, where it is of interest to consolidate heterogeneous ordered lists. Oftentimes, these rankings may involve a large number of alternatives, contain ties, and/or be incomplete, all of which complicate the use of robust aggregation methods. In particular, these characteristics have limited the applicability of the aggregation framework based on the Kemeny-Snell distance, which satisfies key social choice properties that have been shown to engender improved decisions. This work introduces a binary programming formulation for the generalized Kemeny rank aggregation problem—whose ranking inputs may be complete and incomplete, with and without ties. Moreover, it leverages the equivalence of two ranking aggregation problems, namely, that of minimizing the Kemeny-Snell distance and of maximizing the Kendall-τ correlation, to compare the newly introduced binary programming formulation to a modified version of an existing integer programming formulation associated with the Kendall-τ distance. The new formulation has fewer variables and constraints, which leads to faster solution times. Moreover, we develop a new social choice property, the nonstrict extended Condorcet criterion, which can be regarded as a natural extension of the well-known Condorcet criterion and the Extended Condorcet criterion. Unlike its parent properties, the new property is adequate for handling complete rankings with ties. The property is leveraged to develop a structural decomposition algorithm, through which certain large instances of the NP-hard Kemeny rank aggregation problem can be solved exactly in a practical amount of time. To test the practical implications of the new formulation and social choice property, we work with instances constructed from a probabilistic distribution and with benchmark instances from PrefLib, a library of preference data.


2021 ◽  
Author(s):  
Yijin Zhang ◽  
Jie Huang ◽  
Zongbing Lin

Abstract For emergencies, the reliability of information can not be guaranteed. At the same time, due to the lack of information and knowledge, neither the criteria itself nor the credibility can be given a precise evaluation by decision-makers(DMs). Therefore, we combine intuitionistic fuzzy set and Z-number to get a new class of fuzzy set, complete Z-intuitionistic fuzzy set(CZIFS), and its degenerate form, A-type Z-intuitionistic fuzzy set(AZIFS) and B-type Z-intuitionistic fuzzy set(BZIFS). CZIFS can serve as a reliable tool to depict the hesitant degree both on the ambiguity and reliability of uncertain information. In addition, we introduce the score and accuracy functions and distance measure of complete Z-intuitionistic fuzzy number(CZIFN), with which we have considered both reliability information and DMs' preference on it. Then, we improve traditional MULTIMOORA by developing reference point(RP) model to consider both the risk and profile of alternatives and integrating analytic hierarchy process(AHP) in the process of ranking aggregation method to take into account the preference of DMs on three subordinate rankings. Besides, to solve multicriteria group decision making(MCGDM) problem, we develop improved MULTIMOORA method to the environment of CZIFN. Finally, to illustrate the proposed method, we give a numerical example, solving site selecting of Fangcang shelter hospital for COVID-19.


Mathematics ◽  
2021 ◽  
Vol 9 (12) ◽  
pp. 1380
Author(s):  
Noelia Rico ◽  
Camino R. Vela ◽  
Raúl Pérez-Fernández ◽  
Irene Díaz

Preference aggregation and in particular ranking aggregation are mainly studied by the field of social choice theory but extensively applied in a variety of contexts. Among the most prominent methods for ranking aggregation, the Kemeny method has been proved to be the only one that satisfies some desirable properties such as neutrality, consistency and the Condorcet condition at the same time. Unfortunately, the problem of finding a Kemeny ranking is NP-hard, which prevents practitioners from using it in real-life problems. The state of the art of exact algorithms for the computation of the Kemeny ranking experienced a major boost last year with the presentation of an algorithm that provides searching time guarantee up to 13 alternatives. In this work, we propose an enhanced version of this algorithm based on pruning the search space when some Condorcet properties hold. This enhanced version greatly improves the performance in terms of runtime consumption.


2021 ◽  
Author(s):  
Noelia Rico ◽  
Camino R. Vela ◽  
Raúl Pérez-Fernández ◽  
Irene Díaz

2020 ◽  
Author(s):  
Camila L. Lopes ◽  
Daniel L. Jasbick ◽  
Marcos Bedo ◽  
Lúcio F.D. Santos

Diversity-oriented searches retrieve objects not only similar to a reference element but also related to the different types of collections within the queried dataset. While such characterization is flexible enough to include methods originally from information retrieval, data clustering, and similarity searching under the same umbrella, diversity metrics are expected to be much less paradigm-biased in order to discriminate which approaches are more suitable and when they should be applied. Accordingly, we extend and implement a broad set of quality metrics from those distinct realms and experimentally discuss their trends and limitations. In particular, we evaluate the suitability of data clustering indexes, and similarity-driven measures regarding their adherence to diversified similarity searching. Experiments in real-world datasets indicate such measures are capable of distinguishing diversity methods from different paradigms, but they heavily favor the approaches of the same group – especially cluster indexes. As an alternative, we argue diversity is better addressed by a set of measures rather than a single quality value. Therefore, we propose the Diversity Features Model (DFM) that combines the perspectives of the competing approaches into a multidimensional point whose features are calculated based on the distance distribution within both retrieved and queried datasets. Empirical evaluations showed DFM compares different diversity searching approaches by considering multiple criteria, whereas overall winners can be found by ranking aggregation or visualized through parallel coordinates maps.


Sign in / Sign up

Export Citation Format

Share Document