27
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: not found
      • Article: not found

      Cloud computing for genomic data analysis and collaboration

      ,
      Nature Reviews Genetics
      Springer Nature

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          <p class="first" id="P2">DNA sequencing made huge strides in the last decade. Studies based on large sequencing datasets appear frequently, and public archives for raw sequencing data have been doubling in size every 18 months. Meanwhile, commercial and academic cloud computing have matured, leading to more providers, greater total capacity, and a larger variety of services. Here we describe how cloud computing is used for large-scale genomics collaborations and research and argue how cloud computing will likely be a basic underpinning for future large-scale genomics collaborations and for efforts to re-analyze archived data, including privacy-protected data. </p>

          Related collections

          Most cited references48

          • Record: found
          • Abstract: not found
          • Article: not found

          MapReduce

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Diversity and dynamics of the Drosophila transcriptome

            Animal transcriptomes are dynamic, each cell type, tissue and organ system expressing an ensemble of transcript isoforms that give rise to substantial diversity. We identified new genes, transcripts, and proteins using poly(A)+ RNA sequence from Drosophila melanogaster cultured cell lines, dissected organ systems, and environmental perturbations. We found a small set of mostly neural-specific genes has the potential to encode thousands of transcripts each through extensive alternative promoter usage and RNA splicing. The magnitudes of splicing changes are larger between tissues than between developmental stages, and most sex-specific splicing is gonad-specific. Gonads express hundreds of previously unknown coding and long noncoding RNAs (lncRNAs) some of which are antisense to protein-coding genes and produce short regulatory RNAs. Furthermore, previously identified pervasive intergenic transcription occurs primarily within newly identified introns. The fly transcriptome is substantially more complex than previously recognized arising from combinatorial usage of promoters, splice sites, and polyadenylation sites.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression.

              Many studies have used DNA microarrays to identify the gene expression signatures of human cancer, yet the critical features of these often unmanageably large signatures remain elusive. To address this, we developed a statistical method, comparative metaprofiling, which identifies and assesses the intersection of multiple gene expression signatures from a diverse collection of microarray data sets. We collected and analyzed 40 published cancer microarray data sets, comprising 38 million gene expression measurements from >3,700 cancer samples. From this, we characterized a common transcriptional profile that is universally activated in most cancer types relative to the normal tissues from which they arose, likely reflecting essential transcriptional features of neoplastic transformation. In addition, we characterized a transcriptional profile that is commonly activated in various types of undifferentiated cancer, suggesting common molecular mechanisms by which cancer cells progress and avoid differentiation. Finally, we validated these transcriptional profiles on independent data sets.
                Bookmark

                Author and article information

                Journal
                Nature Reviews Genetics
                Nat Rev Genet
                Springer Nature
                1471-0056
                1471-0064
                January 30 2018
                January 30 2018
                :
                :
                Article
                10.1038/nrg.2017.113
                6452449
                29379135
                3db254a3-0418-43b1-9a89-be5b368d873f
                © 2018
                History

                Comments

                Comment on this article