25
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      BABEL enables cross-modality translation between multiomic profiles at single-cell resolution

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Significance

          Simultaneous measurement of the DNA, RNA, and proteins of single cells can lead to important new insights but is experimentally challenging. This work introduces a deep learning algorithm that flexibly translates between chromatin, RNA, and protein profiles in single cells. This makes it possible to computationally synthesize matched multiomic measurements when only one modality is experimentally available. This algorithm complements experimental advances to efficiently achieve single-cell multiomic insight.

          Abstract

          Simultaneous profiling of multiomic modalities within a single cell is a grand challenge for single-cell biology. While there have been impressive technical innovations demonstrating feasibility—for example, generating paired measurements of single-cell transcriptome (single-cell RNA sequencing [scRNA-seq]) and chromatin accessibility (single-cell assay for transposase-accessible chromatin using sequencing [scATAC-seq])—widespread application of joint profiling is challenging due to its experimental complexity, noise, and cost. Here, we introduce BABEL, a deep learning method that translates between the transcriptome and chromatin profiles of a single cell. Leveraging an interoperable neural network model, BABEL can predict single-cell expression directly from a cell’s scATAC-seq and vice versa after training on relevant data. This makes it possible to computationally synthesize paired multiomic measurements when only one modality is experimentally available. Across several paired single-cell ATAC and gene expression datasets in human and mouse, we validate that BABEL accurately translates between these modalities for individual cells. BABEL also generalizes well to cell types within new biological contexts not seen during training. Starting from scATAC-seq of patient-derived basal cell carcinoma (BCC), BABEL generated single-cell expression that enabled fine-grained classification of complex cell states, despite having never seen BCC data. These predictions are comparable to analyses of experimental BCC scRNA-seq data for diverse cell types related to BABEL’s training data. We further show that BABEL can incorporate additional single-cell data modalities, such as protein epitope profiling, thus enabling translation across chromatin, RNA, and protein. BABEL offers a powerful approach for data exploration and hypothesis generation.

          Related collections

          Most cited references57

          • Record: found
          • Abstract: not found
          • Article: not found

          Matplotlib: A 2D Graphics Environment

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Comprehensive Integration of Single-Cell Data

            Single-cell transcriptomics has transformed our ability to characterize cell states, but deep biological understanding requires more than a taxonomic listing of clusters. As new methods arise to measure distinct cellular modalities, a key analytical challenge is to integrate these datasets to better understand cellular identity and function. Here, we develop a strategy to "anchor" diverse datasets together, enabling us to integrate single-cell measurements not only across scRNA-seq technologies, but also across different modalities. After demonstrating improvement over existing methods for integrating scRNA-seq data, we anchor scRNA-seq experiments with scATAC-seq to explore chromatin differences in closely related interneuron subsets and project protein expression measurements onto a bone marrow atlas to characterize lymphocyte populations. Lastly, we harmonize in situ gene expression and scRNA-seq datasets, allowing transcriptome-wide imputation of spatial gene expression patterns. Our work presents a strategy for the assembly of harmonized references and transfer of information across datasets.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              SCANPY : large-scale single-cell gene expression data analysis

              Scanpy is a scalable toolkit for analyzing single-cell gene expression data. It includes methods for preprocessing, visualization, clustering, pseudotime and trajectory inference, differential expression testing, and simulation of gene regulatory networks. Its Python-based implementation efficiently deals with data sets of more than one million cells (https://github.com/theislab/Scanpy). Along with Scanpy, we present AnnData, a generic class for handling annotated data matrices (https://github.com/theislab/anndata).
                Bookmark

                Author and article information

                Journal
                Proc Natl Acad Sci U S A
                Proc Natl Acad Sci U S A
                pnas
                pnas
                PNAS
                Proceedings of the National Academy of Sciences of the United States of America
                National Academy of Sciences
                0027-8424
                1091-6490
                13 April 2021
                07 April 2021
                07 April 2021
                : 118
                : 15
                : e2023070118
                Affiliations
                [1] aDepartment of Computer Science, Stanford University , Stanford, CA 94305;
                [2] bDepartment of Biomedical Data Science, Stanford University School of Medicine , Stanford, CA 94305;
                [3] cCenter for Personal and Dynamic Regulomes, Stanford University School of Medicine , Stanford, CA 94305;
                [4] dHHMI, Stanford University School of Medicine , Stanford, CA 94305
                Author notes
                1To whom correspondence may be addressed. Email: howchang@ 123456stanford.edu or jamesz@ 123456stanford.edu .

                This contribution is part of the special series of Inaugural Articles by members of the National Academy of Sciences elected in 2020.

                Contributed by Howard Y. Chang, November 4, 2020 (sent for review November 4, 2020; reviewed by Junhyong Kim and X. Shirley Liu)

                Author contributions: H.Y.C. and J.Z. designed research; K.E.W. performed research; K.E.Y. contributed new reagents/analytic tools; K.E.W., K.E.Y., H.Y.C., and J.Z. analyzed data; and K.E.W., K.E.Y., H.Y.C., and J.Z. wrote the paper.

                Reviewers: J.K., University of Pennsylvania; and X.S.L., Dana-Farber Cancer Institute.

                Author information
                https://orcid.org/0000-0002-4786-9796
                https://orcid.org/0000-0002-9459-4393
                Article
                202023070
                10.1073/pnas.2023070118
                8054007
                33827925
                8012d06d-c664-4979-8bf4-a61dd87345a7
                Copyright © 2021 the Author(s). Published by PNAS.

                This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY).

                History
                Page count
                Pages: 11
                Funding
                Funded by: HHS | NIH | National Human Genome Research Institute (NHGRI) 100000051
                Award ID: RM1-HG007735
                Award Recipient : Howard Y. Chang
                Funded by: HHS | NIH | National Cancer Institute (NCI) 100000054
                Award ID: R35-CA209919
                Award Recipient : Howard Y. Chang
                Funded by: Howard Hughes Medical Institute (HHMI) 100000011
                Award ID: Investigator
                Award Recipient : Howard Y. Chang
                Funded by: National Science Foundation (NSF) 100000001
                Award ID: CCF 1763191
                Award Recipient : James Zou
                Funded by: HHS | NIH | National Institute of Mental Health (NIMH) 100000025
                Award ID: U01-MH098953
                Award Recipient : James Zou
                Funded by: Silicon Valley Foundation
                Award ID: N/A
                Award Recipient : James Zou
                Funded by: Chan-Zuckerberg Initiative
                Award ID: N/A
                Award Recipient : James Zou
                Funded by: National Institute of Aging
                Award ID: P30-AG059307
                Award Recipient : James Zou
                Funded by: National Institute for Minority Health and Health Disparities
                Award ID: R21-MD012867
                Award Recipient : James Zou
                Categories
                1
                442
                419
                Biological Sciences
                Genetics
                Inaugural Article

                single-cell analysis,multiomics,deep learning,gene regulation

                Comments

                Comment on this article