Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2023.07.04.547619v1?rss=1
Authors: Liu, L., Li, W., Wong, K.-c., Yang, F., Yao, J.
Abstract: Despite the recent advancements in single-cell proteome technology, it still has limitation on throughput, proteome depth and batch effect, and the cost is still high. Inspired by the translation procedure of both natural language processing (NLP) and the genetic central dogma, we propose a pre-trained large language model named scTranslator (single-cell translator), which is align-free and generates absent single-cell proteome by inferring from the transcriptome. scTranslator achieves a general knowledge of RNA-protein interactions by being pre-trained on substantial amounts of bulk and single-cell data. Systematic benchmarking confirms the accuracy, stability, and flexibility of scTranslator across various quantification techniques, cell types, and conditions. Furthermore, we apply scTranslator to various downstream analyses and applications, including interaction inference, gene pseudo-knockout, cell clustering, and cell origin recognition of pan-cancer data.
Copy rights belong to original authors. Visit the link for more info
Podcast created by Paper Player, LLC