similarity information
Recently Published Documents


TOTAL DOCUMENTS

68
(FIVE YEARS 31)

H-INDEX

10
(FIVE YEARS 3)

2022 ◽  
Vol 12 ◽  
Author(s):  
Chu-Qiao Gao ◽  
Yuan-Ke Zhou ◽  
Xiao-Hong Xin ◽  
Hui Min ◽  
Pu-Feng Du

Drug repositioning provides a promising and efficient strategy to discover potential associations between drugs and diseases. Many systematic computational drug-repositioning methods have been introduced, which are based on various similarities of drugs and diseases. In this work, we proposed a new computational model, DDA-SKF (drug–disease associations prediction using similarity kernels fusion), which can predict novel drug indications by utilizing similarity kernel fusion (SKF) and Laplacian regularized least squares (LapRLS) algorithms. DDA-SKF integrated multiple similarities of drugs and diseases. The prediction performances of DDA-SKF are better, or at least comparable, to all state-of-the-art methods. The DDA-SKF can work without sufficient similarity information between drug indications. This allows us to predict new purpose for orphan drugs. The source code and benchmarking datasets are deposited in a GitHub repository (https://github.com/GCQ2119216031/DDA-SKF).


2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Shiming Wang ◽  
Jie Li ◽  
Yadong Wang

Abstract Background Detecting pathogenic proteins is the origin way to understand the mechanism and resist the invasion of diseases, making pathogenic protein prediction develop into an urgent problem to be solved. Prediction for genome-wide proteins may be not necessarily conducive to rapidly cure diseases as developing new drugs specifically for the predicted pathogenic protein always need major expenditures on time and cost. In order to facilitate disease treatment, computational method to predict pathogenic proteins which are targeted by existing drugs should be exploited. Results In this study, we proposed a novel computational model to predict drug-targeted pathogenic proteins, named as M2PP. Three types of features were presented on our constructed heterogeneous network (including target proteins, diseases and drugs), which were based on the neighborhood similarity information, drug-inferred information and path information. Then, a random forest regression model was trained to score unconfirmed target-disease pairs. Five-fold cross-validation experiment was implemented to evaluate model’s prediction performance, where M2PP achieved advantageous results compared with other state-of-the-art methods. In addition, M2PP accurately predicted high ranked pathogenic proteins for common diseases with public biomedical literature as supporting evidence, indicating its excellent ability. Conclusions M2PP is an effective and accurate model to predict drug-targeted pathogenic proteins, which could provide convenience for the future biological researches.


2021 ◽  
Vol 12 ◽  
Author(s):  
Zhen-Wei Zhang ◽  
Zhen Gao ◽  
Chun-Hou Zheng ◽  
Lei Li ◽  
Su-Min Qi ◽  
...  

An increasing number of experiments had verified that miRNA expression is related to human diseases. The miRNA expression profile may be an indicator of clinical diagnosis and provides a new direction for the prevention and treatment of complex diseases. In this work, we present a weighted voting-based model for predicting miRNA–disease association (WVMDA). To reasonably build a network of similarity, we established credibility similarity based on the reliability of known associations and used it to improve the original incomplete similarity. To eliminate noise interference as much as possible while maintaining more reliable similarity information, we developed a filter. More importantly, to ensure the fairness and efficiency of weighted voting, we focus on the design of weighting. Finally, cross-validation experiments and case studies are undertaken to verify the efficacy of the proposed model. The results showed that WVMDA could efficiently identify miRNAs associated with the disease.


2021 ◽  
Author(s):  
Weiren Yu ◽  
Sima Iranmanesh ◽  
Aparajita Haldar ◽  
Maoyin Zhang ◽  
Hakan Ferhatosmanoglu

AbstractRoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim*, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim* not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties (i.e., symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.


2021 ◽  
Vol 14 (4) ◽  
pp. 1-14
Author(s):  
Rachele Sprugnoli ◽  
Marco Guerini ◽  
Giovanni Moretti ◽  
Sara Tonelli

Digital games have been used in the context of a cultural experience for several reasons, from learning to socialising and having fun. As a positive side effect, using digital games in a GLAM environment contributes to increasing the visitors’ engagement and making the collections more popular. Along this line, we present in this article an online game for museum environments that serves two goals: asking users to engage with the artworks in a collection in a playful environment, and collecting their feedback on artwork similarity, which may be used by curators to rethink the organisation of digital exhibitions and in general to better understand how visitors perceive artworks. The game is called PAGANS ( Playful Art: a GAme oN Similarity ), and is designed to collect similarity judgements about artworks. The software was implemented following some principles of gamification in order to quickly leverage similarity information in the cultural heritage domain while increasing user engagement and fun. The game, involving pairs of players, was used during two large public events to collect different data about the players’ behaviour and to investigate how these dimensions correlate with aesthetic perception. A thorough statistical analysis shows that age and (self-declared) gender correlates with the time and the number of moves needed to complete a session. These dimensions also link to the relevance of colour and shape in judging similarity. These findings suggest that, although artwork similarity is very subjective and may vary based on a person’s background, some trends can be identified when considering the subjects’ gender and age. This could have some practical implications; for example, it could be used to support art curators in creating digital exhibitions by grouping artworks in novel, user-centred ways.


2021 ◽  
Vol 11 (6) ◽  
pp. 1527-1532
Author(s):  
G. Shobana ◽  
S. Shankar

Prediction of the development risk of some diseases is an important area of Health Care Research. When exploring the personalized care of the patients, precise identification and classification of similarity in patients from their past report is an important process. Electronically stored health information EHRs that has been sampled unevenly as well as which has variable appointment durations, is considered to be unsuitable for measuring the similarity among patients directly, as there is no proper representation that are fitting. In addition, a technique is required that is efficient to evaluate similarities in patient. We propose two new similarities learning environments using deep learning that learn simultaneously the representations of the patients as well as measurement of similarity in pairs. A Convolutional Neural Network (CNN) is used to understand EHRs that contains crucial information which are local thereby providing scholastic illumination in the triplet loss otherwise entropy loss. When the training is completed, distances are calculated as well as similarities scores. Using this similarity information, disease predictions along with patient grouping is performed. Experimentally the results gives an idea that CNN can represent the EHR sequences in a better way and the schema offered are more efficient than the modern metric distance learning.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Aanchal Mongia ◽  
Sanjay Kr. Saha ◽  
Emilie Chouzenoux ◽  
Angshul Majumdar

AbstractThe year 2020 witnessed a heavy death toll due to COVID-19, calling for a global emergency. The continuous ongoing research and clinical trials paved the way for vaccines. But, the vaccine efficacy in the long run is still questionable due to the mutating coronavirus, which makes drug re-positioning a reasonable alternative. COVID-19 has hence fast-paced drug re-positioning for the treatment of COVID-19 and its symptoms. This work builds computational models using matrix completion techniques to predict drug-virus association for drug re-positioning. The aim is to assist clinicians with a tool for selecting prospective antiviral treatments. Since the virus is known to mutate fast, the tool is likely to help clinicians in selecting the right set of antivirals for the mutated isolate. The main contribution of this work is a manually curated database publicly shared, comprising of existing associations between viruses and their corresponding antivirals. The database gathers similarity information using the chemical structure of drugs and the genomic structure of viruses. Along with this database, we make available a set of state-of-the-art computational drug re-positioning tools based on matrix completion. The tools are first analysed on a standard set of experimental protocols for drug target interactions. The best performing ones are applied for the task of re-positioning antivirals for COVID-19. These tools select six drugs out of which four are currently under various stages of trial, namely Remdesivir (as a cure), Ribavarin (in combination with others for cure), Umifenovir (as a prophylactic and cure) and Sofosbuvir (as a cure). Another unanimous prediction is Tenofovir alafenamide, which is a novel Tenofovir prodrug developed in order to improve renal safety when compared to its original counterpart (older version) Tenofovir disoproxil. Both are under trail, the former as a cure and the latter as a prophylactic. These results establish that the computational methods are in sync with the state-of-practice. We also demonstrate how the drugs to be used against the virus would vary as SARS-Cov-2 mutates over time by predicting the drugs for the mutated strains, suggesting the importance of such a tool in drug prediction. We believe this work would open up possibilities for applying machine learning models to clinical research for drug-virus association prediction and other similar biological problems.


2021 ◽  
pp. 104265
Author(s):  
Gerben A. Bekker ◽  
Arnout R.H. Fischer ◽  
Hilde Tobi ◽  
Hans C.M. van Trijp

2021 ◽  
Author(s):  
Mengbing Li ◽  
Daniel E. Park ◽  
Maliha Aziz ◽  
Cindy M Liu ◽  
Lance B. Price ◽  
...  

SummaryThis paper is concerned with using multivariate binary observations to estimate the proportions of unobserved classes with scientific meanings. We focus on the setting where additional information about sample similarities is available and represented by a rooted weighted tree. Every leaf in the given tree contains multiple independent samples. Shorter distances over the tree between the leaves indicate higher similarity. We propose a novel data integrative extension to classical latent class models (LCMs) with tree-structured shrinkage. The proposed approach enables 1) borrowing of information across leaves, 2) estimating data-driven leaf groups with distinct vectors of class proportions, and 3) individual-level probabilistic class assignment given the observed multivariate binary measurements. We derive and implement a scalable posterior inference algorithm in a variational Bayes framework. Extensive simulations show more accurate estimation of class proportions than alternatives that suboptimally use the additional sample similarity information. A zoonotic infectious disease application is used to illustrate the proposed approach. The paper concludes by a brief discussion on model limitations and extensions.


Sign in / Sign up

Export Citation Format

Share Document