Natural and Pathogenic Protein Sequence Variation Affecting Prion-Like Domains Within and Across Human Proteomes
ABSTRACTProtein aggregation is involved in a variety of muscular and neurodegenerative disorders. For many of these disorders, current models suggest a prion-like molecular mechanism of disease, whereby proteins aggregate and spread to neighboring cells in an infectious manner. A variety of proteins with prion-like domains (PrLDs) have recently been linked to these disorders. The development of prion prediction algorithms has facilitated the large-scale identification of PrLDs among “reference” proteomes for various organisms. However, the degree to which intraspecies protein sequence diversity influences predicted aggregation propensity for PrLDs has not been systematically examined. Here, we explore protein sequence variation introduced at genetic, post-transcriptional, and post-translational levels, and its influence on predicted aggregation propensity for human PrLDs. We find that sequence variation is relatively common among PrLDs and in some cases can result in relatively large differences in predicted aggregation propensity. Analysis of a database of sequence variants associated with human disease reveals a number of mutations within PrLDs that are predicted to increase aggregation propensity. Our analyses expand the list of candidate human PrLDs, estimate the effects of sequence variation on the aggregation propensity of PrLDs, and suggest the involvement of prion-like mechanisms in additional human diseases.