Receive a weekly summary and discussion of the top papers of the week by leading researchers in the field.

Efficient Indexing of Peptides for Database Search Using Tide.

In Journal of proteome research

The first step in the analysis of protein tandem mass spectrometry data typically involves searching the observed spectra against a protein database. During database search, the search engine must digest the proteins in the database into peptides, subject to digestion rules that are under user control. The choice of these digestion parameters, as well as selection of post-translational modifications (PTMs), can dramatically affect the size of the search space and hence the statistical power of the search. The Tide search engine separates the creation of the peptide index from the database search step, thereby saving time by allowing a peptide index to be reused in multiple searches. Here we describe an improved implementation of the indexing component of Tide that consumes around four times less resources (CPU and RAM) than the previous version and can generate arbitrarily large peptide databases, limited by only the amount of available disk space. We use this improved implementation to explore the relationship between database size and the parameters controlling digestion and PTMs, as well as database size and statistical power. Our results can help guide practitioners in proper selection of these important parameters.

Nii Adoquaye Acquaye Frank Lawrence, Kertesz-Farkas Attila, Noble William Stafford

2023-Jan-12

database search, spectrum identification, tandem mass spectrometry

13 Jan 2023

Efficient Indexing of Peptides for Database Search Using Tide.

Weekly Summary