bioRxiv Preprint
The emergence of SARS-CoV-2 reawakened the need to rapidly understand the molecular etiologies, pandemic potential, and prospective treatments of infectious agents. The lack of existing data on SARS-CoV-2 hampered early attempts to treat severe forms of COVID-19 during the pandemic. This study coupled existing transcriptomic data from SARS-CoV-1 lung infection animal studies with crowdsourcing statistical approaches to derive temporal meta-signatures of host responses during early viral accumulation and subsequent clearance stages. Unsupervised and supervised machine learning approaches identified top dysregulated genes and potential biomarkers (e.g., CXCL10, BEX2, and ADM). Temporal meta-signatures revealed distinct gene expression programs with biological implications to a series of host responses underlying sustained Cxcl10 expression and Stat signaling. Cell cycle switched from G1/G0 phase genes, early in infection, to a G2/M gene signature during late infection that correlated with the enrichment of DNA Damage Response and Repair genes. The SARS-CoV-1 meta-signatures were shown to closely emulate human SARS-CoV-2 host responses from emerging RNAseq, single cell and proteomics data with early monocyte-macrophage activation followed by lymphocyte proliferation. The circulatory hormone adrenomedullin was observed as maximally elevated in elderly patients that died from COVID-19. Stage-specific correlations to compounds with potential to treat COVID-19 and future coronavirus infections were in part validated by a subset of twenty-four that are in clinical trials to treat COVID-19. This study represents a roadmap to leverage existing data in the public domain to derive novel molecular and biological insights and potential treatments to emerging human pathogens. The data from this study is available in an interactive portal (http://18.222.95.219:8047).
Flynn, J.; Ahmadi, M.; McFarland, C.; Kubal, M.; Taylor, M.; Cheng, Z.; Torchia, E. C.; Edwards, M.
2022-12-15