In Analytical biochemistry
X-ray crystallography is the major approach for atomic-level protein structure determination. Since not all proteins can be easily crystallized, accurate prediction of protein crystallization propensity is critical to guiding the experimental design and improving the success rate of X-ray crystallography experiments. In this work, we proposed a new deep learning pipeline, GCmapCrys, for multi-stage crystallization propensity prediction through integrating graph attention network with predicted protein contact map. Experimental results on CRYS_DS proteins with known crystallization records demonstrated that GCmapCrys increased the value of Matthew's correlation coefficient by 38.9% compared to the best of other state-of-the-art protein crystallization propensity predictors. Detailed analyses show that the major advantages of GCmapCrys lie in the efficiency of the graph attention network with predicted contact map, which effectively associates the residue-interaction knowledge with crystallization pattern. Meanwhile, the designed four sequence-based features can be complementary to further enhance crystallization prediction.
Wang Peng-Hao, Zhu Yi-Heng, Yang Xibei, Yu Dong-Jun
2022-Dec-12
Graph attention network, Protein crystallization prediction