101
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Viral dark matter and virus–host interactions resolved from publicly available microbial genomes

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus–host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7–38% of ‘unknown’ sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 new viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus–host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.

          DOI: http://dx.doi.org/10.7554/eLife.08490.001

          eLife digest

          Viruses are infectious particles that can only multiply inside the cells of microbes and other organisms. Little is known about the genetic differences between virus particles (so-called ‘genetic diversity’), especially compared to what we know about the diversity of bacteria, archaea, and other single-celled microbes. This lack of knowledge hampers our understanding of the role viruses play in the evolution of microbial communities and their associated ecosystems.

          Studying the genetics of the viruses in these communities is challenging. There is no single ‘marker’ gene that can be used to identify all viruses in environmental samples. Also, many of the fragments of viral genomes that have been identified have not yet been linked to their host microbes. Many viruses integrate their genome into the DNA of their host cell, and there are computational tools available that exploit this ability to identify viruses and link them to their host. However, other viruses can live and multiply inside cells without integrating their genome into the host's DNA.

          Earlier in 2015, researchers developed a new computational tool called VirSorter that can predict virus genome sequences within the DNA extracted from microbes. VirSorter identifies viral genome sequences based on the presence of ‘hallmark’ genes that encode for components found in many virus particles, together with a reference database of genomes from many viruses.

          Now, Roux et al.—including some of the researchers from the earlier work—use VirSorter to predict viral DNA from publicly available bacteria and archaea genome data. The study identifies over 12,000 viral genomes and links them to their microbial hosts. These data increase the number of viral genome sequences that are publically available by a factor of ten and identify the first viruses associated with 13 new types of bacteria, which include species that are abundant in particular environments.

          It is possible for several different viruses to infect a single cell at the same time. Some viruses are known to be able to exchange DNA, and if this happens frequently in other viruses, it could have a big impact on how viruses evolve. Roux et al.'s findings suggest that although it is common for several different viruses to infect the same cell, it is relatively rare for these viruses to exchange genetic material.

          Roux et al.'s findings demonstrate the value of searching publicly available microbial genome data for fragments of viral genomes. These new viral genomes will serve as a useful resource for researchers as they explore the communities of viruses and microbes in natural environments, the human body and in industrial processes.

          DOI: http://dx.doi.org/10.7554/eLife.08490.002

          Related collections

          Most cited references43

          • Record: found
          • Abstract: found
          • Article: not found

          A molecular view of microbial diversity and the biosphere.

          N Pace (1997)
          Over three decades of molecular-phylogenetic studies, researchers have compiled an increasingly robust map of evolutionary diversification showing that the main diversity of life is microbial, distributed among three primary relatedness groups or domains: Archaea, Bacteria, and Eucarya. The general properties of representatives of the three domains indicate that the earliest life was based on inorganic nutrition and that photosynthesis and use of organic compounds for carbon and energy metabolism came comparatively later. The application of molecular-phylogenetic methods to study natural microbial ecosystems without the traditional requirement for cultivation has resulted in the discovery of many unexpected evolutionary lineages; members of some of these lineages are only distantly related to known organisms but are sufficiently abundant that they are likely to have impact on the chemistry of the biosphere.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            NCBI Reference Sequences: current status, policy and new initiatives

            NCBI's Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. RefSeq records integrate information from multiple sources and represent a current description of the sequence, the gene and sequence features. The database includes over 5300 organisms spanning prokaryotes, eukaryotes and viruses, with records for more than 5.5 × 106 proteins (RefSeq release 30). Feature annotation is applied by a combination of curation, collaboration, propagation from other sources and computation. We report here on the recent growth of the database, recent changes to feature annotations and record types for eukaryotic (primarily vertebrate) species and policies regarding species inclusion and genome annotation. In addition, we introduce RefSeqGene, a new initiative to support reporting variation data on a stable genomic coordinate system.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Fermentation, hydrogen, and sulfur metabolism in multiple uncultivated bacterial phyla.

              BD1-5, OP11, and OD1 bacteria have been widely detected in anaerobic environments, but their metabolisms remain unclear owing to lack of cultivated representatives and minimal genomic sampling. We uncovered metabolic characteristics for members of these phyla, and a new lineage, PER, via cultivation-independent recovery of 49 partial to near-complete genomes from an acetate-amended aquifer. All organisms were nonrespiring anaerobes predicted to ferment. Three augment fermentation with archaeal-like hybrid type II/III ribulose-1,5-bisphosphate carboxylase-oxygenase (RuBisCO) that couples adenosine monophosphate salvage with CO(2) fixation, a pathway not previously described in Bacteria. Members of OD1 reduce sulfur and may pump protons using archaeal-type hydrogenases. For six organisms, the UGA stop codon is translated as tryptophan. All bacteria studied here may play previously unrecognized roles in hydrogen production, sulfur cycling, and fermentation of refractory sedimentary carbon.
                Bookmark

                Author and article information

                Contributors
                Role: Reviewing editor
                Journal
                eLife
                eLife
                eLife
                eLife
                eLife Sciences Publications, Ltd
                2050-084X
                2050-084X
                22 July 2015
                2015
                : 4
                : e08490
                Affiliations
                [1 ]deptDepartment of Ecology and Evolutionary Biology , University of Arizona , Tucson, United States
                [2 ]deptDepartment of Microbiology and Immunology , University of British Columbia , Vancouver, Canada
                [3 ]deptGraduate Program in Bioinformatics , University of British Columbia , Vancouver, Canada
                [4 ]U.S Department of Energy Joint Genome Institute , Walnut Creek, United States
                Max Planck Institute for Developmental Biology , Germany
                Max Planck Institute for Developmental Biology , Germany
                Author notes
                [* ]For correspondence: mbsulli@ 123456gmail.com
                [†]

                Department of Microbiology, The Ohio State University, Columbus, United States.

                [‡]

                Department of Civil, Environmental, and Geodetic Engineering, Columbus, United States.

                Article
                08490
                10.7554/eLife.08490
                4533152
                26200428
                c97d5582-74e4-4f02-b6d2-e0962e6fd338
                © 2015, Roux et al

                This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

                History
                : 02 May 2015
                : 22 July 2015
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100000936, Gordon and Betty Moore Foundation;
                Award ID: 3790
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100000038, Natural Sciences and Engineering Research Council of Canada (Conseil de Recherches en Sciences Naturelles et en Génie du Canada);
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100000196, Canada Foundation for Innovation (Fondation canadienne pour l'innovation);
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/100007631, Canadian Institute for Advanced Research (L'Institut Canadien de Recherches Avancées);
                Award Recipient :
                Funded by: Tula Foundation;
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/100001246, Ambrose Monell Foundation;
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/100001372, G. Unger Vetlesen Foundation;
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/100000015, U.S. Department of Energy (Department of Energy);
                Award ID: Joint Genome Institute (DE-AC02-05CH11231)
                Award Recipient :
                The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
                Categories
                Tools and Resources
                Ecology
                Genomics and Evolutionary Biology
                Custom metadata
                2.3
                From public microbial genomes, VirSorter revealed 12,498 viral genome sequences that expand the map of the global virosphere and whose analyses improve understanding of viral taxonomy, evolution and virus-host interactions.

                Life sciences
                virus,phage,prophage,virus-host adaptation,none
                Life sciences
                virus, phage, prophage, virus-host adaptation, none

                Comments

                Comment on this article