Anal Chem. 2023 Sep 04.
The development of ion mobility-mass spectrometry (IM-MS) has revolutionized the analysis of small molecules, such as metabolomics, lipidomics, and exposome studies. The curation of comprehensive reference collision cross-section (CCS) databases plays a pivotal role in the successful application of IM-MS for small-molecule analysis. In this study, we presented AllCCS2, an enhanced version of AllCCS, designed for the universal prediction of the ion mobility CCS values of small molecules. AllCCS2 incorporated newly available experimental CCS data, including 10,384 records and 7713 unified values, as training data. By leveraging a neural network trained on diverse molecular representations encompassing mass spectrometry features, molecular descriptors, and graph features extracted using a graph convolutional network, AllCCS2 achieved exceptional prediction accuracy. AllCCS2 achieved median relative error (MedRE) values of 0.31, 0.72, and 1.64% in the training, validation, and testing sets, respectively, surpassing existing CCS prediction tools in terms of accuracy and coverage. Furthermore, AllCCS2 exhibited excellent compatibility with different instrument platforms (DTIMS, TWIMS, and TIMS). The prediction uncertainties in AllCCS2 from the training data and the prediction model were comprehensively investigated by using representative structure similarity and model prediction variation. Notably, small molecules with high structural similarities to the training set and lower model prediction variation exhibited improved accuracy and lower relative errors. In summary, AllCCS2 serves as a valuable resource to support applications of IM-MS technologies. The AllCCS2 database and tools are freely accessible at http://allccs.zhulab.cn/.