rank aggregation
Recently Published Documents


TOTAL DOCUMENTS

334
(FIVE YEARS 118)

H-INDEX

26
(FIVE YEARS 4)

Author(s):  
Xi Chen ◽  
Yunxiao Chen ◽  
Xiaoou Li

A sequential design problem for rank aggregation is commonly encountered in psychology, politics, marketing, sports, etc. In this problem, a decision maker is responsible for ranking K items by sequentially collecting noisy pairwise comparisons from judges. The decision maker needs to choose a pair of items for comparison in each step, decide when to stop data collection, and make a final decision after stopping based on a sequential flow of information. Because of the complex ranking structure, existing sequential analysis methods are not suitable. In this paper, we formulate the problem under a Bayesian decision framework and propose sequential procedures that are asymptotically optimal. These procedures achieve asymptotic optimality by seeking a balance between exploration (i.e., finding the most indistinguishable pair of items) and exploitation (i.e., comparing the most indistinguishable pair based on the current information). New analytical tools are developed for proving the asymptotic results, combining advanced change of measure techniques for handling the level crossing of likelihood ratios and classic large deviation results for martingales, which are of separate theoretical interest in solving complex sequential design problems. A mirror-descent algorithm is developed for the computation of the proposed sequential procedures.


2022 ◽  
Author(s):  
Zhigang Wang ◽  
Ye Deng ◽  
Petter Holme ◽  
Zengru Di ◽  
Linyuan Lu ◽  
...  

Abstract We live in a hyperconnected world---connectivity that can sometimes be detrimental. Finding an optimal subset of nodes or links to disintegrate harmful networks is a fundamental problem in network science, with potential applications to anti-terrorism, epidemic control, and many other fields of study. The challenge of the network disintegration problem is to balance the effectiveness and efficiency of strategies. In this paper, we propose a cost-effective targeted enumeration method for network disintegration. The proposed approach includes two stages: searching candidate objects and identifying an optimal solution. In the first stage, we use rank aggregation to generate a comprehensive node importance ranking, upon which we identify a small-scale candidate set of nodes to remove. In the second stage, we use an enumeration method to find an optimal combination among the candidate nodes. Extensive experimental results on synthetic and real-world networks demonstrate that the proposed method achieves a satisfying trade-off between effectiveness and efficiency. The introduced two-stage targeted enumeration framework can also be applied to other computationally intractable combinational optimization problems, from team assembly, via portfolio investment, to drug design.


2021 ◽  
Author(s):  
Wei Lin ◽  
Chen Jiang ◽  
Hangxing Yu ◽  
Jiaqi Li ◽  
Xinyue Liu ◽  
...  

Abstract Background: Diabetic nephropathy (DN) is one of the common complications of diabetes, it can cause a disproportionate burden. Leeches are widely used to treat DN in China, and hirudin is the main component of leeches. However, its pharmacological mechanisms and molecular targets are unclear. This work aimed to explore new biomarkers of DN and reveal the mechanism of hirudin in DN. Methods: Expression microarray datas between kidney tissues of DN and control individuals were obtained from the GEO database, and differentially expressed genes were identified as DN-related targets using the robust rank aggregation analysis. The SEA, GeneCards and SwissTargetPrediction databases were used to predict targets of hirudin. A protein-protein interaction network of DN-hirudin was conducted by Cytoscape software. GO and KEGG pathway enrichment analyses were carried out to explore the involved pharmacological mechanism of hirudin in treatment of DN. And molecular docking was adopted to predict the hub targets of hirudin. Results: A total of 30 significant DEGs (16 up- and 14 down-regulated) were identified. ATF3, SLC22A8 and TGF-Β1 are likely as new biomarkers of DN. The 42 candidate targets were identifyed in PPI network. GO analysis revealed that these targets were enriched with ubiquitin protein ligase, transcription factor binding and nuclear transport. The KEGG pathways were enriched, including AGE-RAGE signaling pathway in diabetic complications, PI3K-Akt signaling pathway, Focal adhesion. The docking results showed that hirudin could easily dock with ITGA4, EGFR and ESR1. Conclusion: Our study demonstrates that hirudin is involved in DN therapy through a multi-targeted, multi-pathway approach. It provides a basis for further research on its mechanism of action.


2021 ◽  
Vol 8 ◽  
Author(s):  
Xiao Ma ◽  
Changhua Mo ◽  
Liangzhao Huang ◽  
Peidong Cao ◽  
Louyi Shen ◽  
...  

Objective: Dilated cardiomyopathy (DCM) is a heart disease with high mortality characterized by progressive cardiac dilation and myocardial contractility reduction. The molecular signature of dilated cardiomyopathy remains to be defined. Hence, seeking potential biomarkers and therapeutic of DCM is urgent and necessary.Methods: In this study, we utilized the Robust Rank Aggregation (RRA) method to integrate four eligible DCM microarray datasets from the GEO and identified a set of significant differentially expressed genes (DEGs) between dilated cardiomyopathy and non-heart failure. Moreover, LASSO analysis was carried out to clarify the diagnostic and DCM clinical features of these genes and identify dilated cardiomyopathy derived diagnostic signatures (DCMDDS).Results: A total of 117 DEGs were identified across the four microarrays. Furthermore, GO analysis demonstrated that these DEGs were mainly enriched in the regulation of inflammatory response, the humoral immune response, the regulation of blood pressure and collagen–containing extracellular matrix. In addition, KEGG analysis revealed that DEGs were mainly enriched in diverse infected signaling pathways. Moreover, Gene set enrichment analysis revealed that immune and inflammatory biological processes such as adaptive immune response, cellular response to interferon and cardiac muscle contraction, dilated cardiomyopathy are significantly enriched in DCM. Moreover, Least absolute shrinkage and selection operator (LASSO) analyses of the 18 DCM-related genes developed a 7-gene signature predictive of DCM. This signature included ANKRD1, COL1A1, MYH6, PERELP, PRKACA, CDKN1A, and OMD. Interestingly, five of these seven genes have a correlation with left ventricular ejection fraction (LVEF) in DCM patients.Conclusion: Our present study demonstrated that the signatures could be robust tools for predicting DCM in clinical practice. And may also be potential treatment targets for clinical implication in the future.


2021 ◽  
Author(s):  
Anushri Umesh ◽  
Praveen Kumar Guttula ◽  
Mukesh Kumar Gupta

Bovine mastitis causes significant economic loss to the dairy industry by affecting milk quality and quantity. E.coli and S.aureus are the two common mastitis-causing bacteria among the consortia of mastitis pathogens, wherein E.coli is an opportunistic environmental pathogen, and S.aureus is a contagious pathogen. This study was designed to predict molecular markers of bovine mastitis by meta-analysis of differentially expressed genes (DEG) in E.coli or S.aureus infected mammary epithelial cells (MECs) using p-value combination and robust rank aggregation (RRA) methods. High throughput transcriptome of bovine (MECs, infected with E.coli or S.aureus, were analyzed, and correlation of z-scores were computed for the expression datasets to identify the lineage profile and functional ontology of DEGs. Key pathways enriched in infected MECs were deciphered by Gene Set Enrichment Analysis (GSEA), following which combined p-value and RRA were used to perform DEG meta-analysis to limit type I error in the analysis. The miRNA-Gene networks were then built to uncover potential molecular markers of mastitis. Lineage profiling of MECs showed that the gene expression levels were associated with mammary tissue lineage. The up-regulated genes were enriched in immune-related pathways whereas down-regulated genes influenced the cellular processes. GSEA analysis of DEGs deciphered the involvement of Toll-like receptor (TLR), and NF- Kappa B signalling pathway during infection. Comparison after meta-analysis yielded with genes ZC3H12A, RND1 and MAP3K8 having significant expression levels in both E.coli and S.aureus dataset and on evaluating miRNA-Gene network 7 pairs were common to both sets identifying them as potential molecular markers.


2021 ◽  
Vol 2021 ◽  
pp. 1-19
Author(s):  
Abdullateef O. Balogun ◽  
Shuib Basri ◽  
Saipunidzam Mahamad ◽  
Luiz Fernando Capretz ◽  
Abdullahi Abubakar Imam ◽  
...  

The high dimensionality of software metric features has long been noted as a data quality problem that affects the performance of software defect prediction (SDP) models. This drawback makes it necessary to apply feature selection (FS) algorithm(s) in SDP processes. FS approaches can be categorized into three types, namely, filter FS (FFS), wrapper FS (WFS), and hybrid FS (HFS). HFS has been established as superior because it combines the strength of both FFS and WFS methods. However, selecting the most appropriate FFS (filter rank selection problem) for HFS is a challenge because the performance of FFS methods depends on the choice of datasets and classifiers. In addition, the local optima stagnation and high computational costs of WFS due to large search spaces are inherited by the HFS method. Therefore, as a solution, this study proposes a novel rank aggregation-based hybrid multifilter wrapper feature selection (RAHMFWFS) method for the selection of relevant and irredundant features from software defect datasets. The proposed RAHMFWFS is divided into two stepwise stages. The first stage involves a rank aggregation-based multifilter feature selection (RMFFS) method that addresses the filter rank selection problem by aggregating individual rank lists from multiple filter methods, using a novel rank aggregation method to generate a single, robust, and non-disjoint rank list. In the second stage, the aggregated ranked features are further preprocessed by an enhanced wrapper feature selection (EWFS) method based on a dynamic reranking strategy that is used to guide the feature subset selection process of the HFS method. This, in turn, reduces the number of evaluation cycles while amplifying or maintaining its prediction performance. The feasibility of the proposed RAHMFWFS was demonstrated on benchmarked software defect datasets with Naïve Bayes and Decision Tree classifiers, based on accuracy, the area under the curve (AUC), and F-measure values. The experimental results showed the effectiveness of RAHMFWFS in addressing filter rank selection and local optima stagnation problems in HFS, as well as the ability to select optimal features from SDP datasets while maintaining or enhancing the performance of SDP models. To conclude, the proposed RAHMFWFS achieved good performance by improving the prediction performances of SDP models across the selected datasets, compared to existing state-of-the-arts HFS methods.


2021 ◽  
Author(s):  
Sepehr Golriz Khatami ◽  
Yasamin Salimi ◽  
Martin Hofmann-Apitius ◽  
Neil P Oxtoby ◽  
Colin Birkenbihl

Background: Previous models of Alzheimer's disease (AD) progression were primarily hypothetical or based on data originating from single cohort studies. However, cohort datasets are subject to specific inclusion and exclusion criteria that influence the signals observed in their collected data. Furthermore, each study measures only a subset of AD-relevant variables. To gain a comprehensive understanding of AD progression, the heterogeneity and robustness of estimated progression patterns must be understood, and complementary information contained in cohort datasets be leveraged. Methods: We compared ten event-based models that we fit to ten independent AD cohort datasets. Additionally, we designed and applied a novel rank aggregation algorithm that combines partially overlapping, individual event sequences into a meta-sequence containing the complementary information from each cohort. Results: We observed overall consistency across the ten event-based model sequences, despite variance in the positioning of mainly imaging variables. The changes described in the aggregated meta-sequence are broadly consistent with current understanding of AD progression, starting with cerebrospinal fluid amyloid beta, followed by memory impairment, tauopathy, FDG-PET, and ultimately brain deterioration and impairment of visual memory. Conclusion: Overall, the event-based models demonstrated similar and robust disease cascades across independent AD cohorts. Aggregation of data-driven results can combine complementary strengths and information of patient-level datasets. Accordingly, the derived meta-sequence draws a more complete picture of AD pathology compared to models relying on single cohorts.


2021 ◽  
Author(s):  
Fu Feng ◽  
Yu-Xiang Zhong ◽  
Jian-Hua Huang ◽  
Fu-Xiang Lin ◽  
Peng-Peng Zhao ◽  
...  

Abstract Background Bladder cancer (BC) is among the most frequent cancers globally. Although substantial efforts have been put to understand its pathogenesis, its underlying molecular mechanisms have not been fully elucidated. Methods The Robust Rank Aggregation (RRA) approach was adopted to integrate four eligible bladder urothelial carcinoma (BLCA) microarray datasets from the GEO. Differentially expressed genes (DEGs) sets were identified between tumor samples and equivalent healthy samples. We constructed gene co-expression networks using WGCNA to explore the alleged relationship between BC clinical characteristics and gene sets, as well as to identify hub genes. We also incorporated the WGCNA and RRA to screen DEGs. Results CDH11, COL6A3, EDNRA and SERPINF1 were selected from the key module and validated. Based on the results, significant downregulation of the hub genes occurred during the early stages of BC. Moreover, Receiver operating characteristics (ROC) curves and Kaplan-Meier (KM) plots showed that the genes exhibited favorable diagnostic and prognostic value for BC. Based on GSEA for single hub gene, all the genes were closely linked to BC cell proliferation. Conclusions These results offer unique insight into the pathogenesis of BC and recognize CDH11, COL6A3, EDNRA and SERPINF1 as potential biomarkers with diagnostic and prognostic roles in BC.


Sensors ◽  
2021 ◽  
Vol 21 (21) ◽  
pp. 7401
Author(s):  
Abdulaziz Alhumam

The automatic localization of software faults plays a critical role in assisting software professionals in fixing problems quickly. Despite various existing models for fault tolerance based on static features, localization is still challenging. By considering the dynamic features, the capabilities of the fault recognition models will be significantly enhanced. The current study proposes a model that effectively ranks static and dynamic parameters through Aggregation-Based Neural Ranking (ABNR). The proposed model includes rank lists produced by self-attention layers using rank aggregation mechanisms to merge them into one aggregated rank list. The rank list would yield the suspicious code statements in descending order of the rank. The performance of ABNR is evaluated against the open-source dataset for fault prediction. ABNR model has exhibited noticeable performance in fault localization. The proposed model is evaluated with other existing models like Ochiai, Fault localization technique based on complex network theory, Tarantula, Jaccard, and software-network centrality measure concerning metrics like assertions evaluated, Wilcoxon signed-rank test, and Top-N.


Author(s):  
Jingbo Huang ◽  
Feng Yao ◽  
Tao Wang ◽  
Jiting Li ◽  
Dayong Shen ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document