A Machine Learning Approach to Identify Specific Small Molecule Inhibitors of Secondary Nucleation in alpha-Synuclein Aggregation
Drug development is an increasingly active area of machine learning application, due to the high attrition rates of conventional drug discovery pipelines. This issue is especially pressing for neurodegenerative diseases where very few disease modifying drugs have been approved, demonstrating a need for novel and efficient approaches to drug discovery in this area. However, whether or not machine learning methods can fulfil this role remains to be demonstrated. To explore this possibility, we describe a machine learning approach to identify specific inhibitors of the proliferation of alpha-synuclein aggregates through secondary nucleation, a process that has been implicated in Parkinson's disease and related synucleinopathies. We use a combination of docking simulations followed by machine learning to first identify initial hit compounds and then explore the chemical space around these compounds. Our results demonstrate that this approach leads to the identification of novel chemical matter with an improved hit rate and potency over conventional similarity search approaches.