Poster
A Physics-Augmented Deep Learning Framework for Classifying Single Molecule Force Spectroscopy Data
Cailong Hua · Sivaraman Rajaganapathy · Rebecca Slick · Joseph Vavra · Joseph Muretta · James Ervasti · Murti Salapaka
West Exhibition Hall B2-B3 #W-117
Understanding protein folding and unfolding under mechanical forces is important for understanding biological processes. Such insights hold the promise of developing treatments for serious conditions, including muscular disorders like Duchenne Muscular Dystrophy and neurodegenerative diseases such as Parkinson’s disease. Single molecule force spectroscopy (SMFS) is a powerful technique for investigating forces involved in protein folding and unfolding. However, real-world SMFS data often includes trials from multiple proteins, which can confound meaningful results. To interpret these data correctly, researchers need to isolate clean, single-molecule trials—a process that traditionally required manual inspection by experts and could take an entire day for just one experiment.In this work, we automated this filtering process by applying several state-of-the-art machine learning models and introducing a novel deep learning model that incorporates physical knowledge of proteins. We also provide a simulation engine to generate realistic datasets alongside extensive experimental data from a variety of proteins. Our model outperforms existing methods and reduces the time needed to extract meaningful results from a full day to under an hour. To support future research, we have made our datasets and Python-based toolbox publicly available.