Poster
Latent Imputation before Prediction: A New Computational Paradigm for De Novo Peptide Sequencing
Ye DU · Chen Yang · Nanxi Yu · Wanyu LIN · Qian Zhao · Shujun Wang
West Exhibition Hall B2-B3 #W-712
Peptides, the building blocks of proteins, are crucial for understanding biological processes and developing new therapies. De novo peptide sequencing is a computational technique that determines peptide sequences directly from mass spectrometry data, without relying on existing databases. However, missing data in spectra—caused by suboptimal experimental conditions—makes sequencing challenging. To address this, we developed LIPNovo, a novel method that compensates for missing information in spectral data before predicting peptide sequences. Instead of trying to recreate missing raw data, LIPNovo uses advanced machine learning techniques to fill in the gaps within the model's inherent representations, guided by theoretical knowledge of peptides. This approach improves the quality and reliability of peptide predictions. Our experiments show that LIPNovo significantly outperforms existing methods, making peptide sequencing more accurate. This advancement has the potential to accelerate discoveries in biology, biotechnology, and medicine.