Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2023.02.22.529627v1?rss=1
Authors: Yu, L., Liu, C., Yang, J. Y. H., Yang, P.
Abstract: Motivation: Recent advances in multimodal single-cell omics technologies enable multiple modalities of molecular attributes, such as gene expression, chromatin accessibility, and protein abundance, to be profiled simultaneously at a global level in individual cells. While the increasing availability of multiple data modalities is expected to provide a more accurate clustering and characterisation of cells, the development of computational methods that are capable of extracting information embedded across data modalities is still in its infancy. Results: We propose SnapCCESS for clustering cells by integrating data modalities in multimodal single-cell omics data using an unsupervised ensemble deep learning framework. By creating snapshots of embeddings of multimodality using variational autoencoders, SnapCCESS can be coupled with various clustering algorithms for generating consensus clustering of cells. We applied SnapCCESS with several clustering algorithms to various datasets generated from popular multimodal single-cell omics technologies. Our results demonstrate that SnapCCESS is effective and more efficient than conventional ensemble deep learning-based clustering methods and outperforms other state-of-the-art multimodal embedding generation methods in integrating data modalities for clustering cells. The improved clustering of cells from SnapCCESS will pave the way for more accurate characterisation of cell identity and types, an essential step for various downstream analyses of multimodal single-cell omics data. Availability and implementation: SnapCCESS is implemented as a Python package and is freely available from https://github.com/yulijia/SnapCCESS.
Copy rights belong to original authors. Visit the link for more info
Podcast created by Paper Player, LLC