Horizontal Transfer, Not Duplication, Drives the Expansion of Protein Families in Prokaryotes

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Gene duplication followed by neo- or sub-functionalization deeply impacts the evolution of protein families and is regarded as the main source of adaptive functional novelty in eukaryotes. While there is ample evidence of adaptive gene duplication in prokaryotes, it is not clear whether duplication outweighs the contribution of horizontal gene transfer in the expansion of protein families. We analyzed closely related prokaryote strains or species with small genomes ( Helicobacter, Neisseria, Streptococcus, Sulfolobus), average-sized genomes ( Bacillus, Enterobacteriaceae), and large genomes ( Pseudomonas, Bradyrhizobiaceae) to untangle the effects of duplication and horizontal transfer. After removing the effects of transposable elements and phages, we show that the vast majority of expansions of protein families are due to transfer, even among large genomes. Transferred genes—xenologs—persist longer in prokaryotic lineages possibly due to a higher/longer adaptive role. On the other hand, duplicated genes—paralogs—are expressed more, and, when persistent, they evolve slower. This suggests that gene transfer and gene duplication have very different roles in shaping the evolution of biological systems: transfer allows the acquisition of new functions and duplication leads to higher gene dosage. Accordingly, we show that paralogs share most protein–protein interactions and genetic regulators, whereas xenologs share very few of them. Prokaryotes invented most of life's biochemical diversity. Therefore, the study of the evolution of biology systems should explicitly account for the predominant role of horizontal gene transfer in the diversification of protein families.

Author Summary

Prokaryotes can be found in the most diverse and severe ecological niches of the planet. Their rapid adaptation is, in part, the result of the ability to acquire genetic information horizontally. This means that prokaryotes utilize two major paths to expand their repertoire of protein families: they can duplicate a pre-existing gene or acquire it by horizontal transfer. In this study, we track family expansions among closely related strains of prokaryotic species. We find that the majority of gene expansions arrive via transfer not via duplication. Additionally, we find that duplicate genes tend be more transient and evolve slower than transferred ones, highlighting different roles with respect to adaptation and evolution. These results suggest that prevailing theories aimed at understanding the evolution of biological systems grounded on gene duplication might be poorly fit to explain the evolution of prokaryotic systems, which include the vast majority of life's biochemical diversity.

Related collections

Most cited references 56

Record: found
Abstract: found
Article: not found

ISfinder: the reference centre for bacterial insertion sequences

P. Siguier, J. Perochon, L. Lestrade … (2005)

ISfinder () is a dedicated database for bacterial insertion sequences (ISs). It has superseded the Stanford reference center. One of its functions is to assign IS names and to provide a focal point for a coherent nomenclature. It is also the repository for ISs. Each new IS is indexed together with information such as its DNA sequence and open reading frames or potential coding sequences, the sequence of the ends of the element and target sites, its origin and distribution together with a bibliography where available. Another objective is to continuously monitor ISs to provide updated comprehensive groupings or families and to provide some insight into their phylogenies. The site also contains extensive background information on ISs and transposons in general. Online tools are gradually being added. At present an online Blast facility against the entire bank is available. But additional features will include alignment capability, PsiBLAST and HMM profiles. ISfinder also includes a section on bacterial genomes and is involved in annotating the IS content of these genomes. Finally, this database is currently recommended by several microbiology journals for registration of new IS elements before their publication.

0 comments Cited 1134 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications.

P. Sharp, W Li (1987)

A simple, effective measure of synonymous codon usage bias, the Codon Adaptation Index, is detailed. The index uses a reference set of highly expressed genes from a species to assess the relative merits of each codon, and a score for a gene is calculated from the frequency of use of all codons in that gene. The index assesses the extent to which selection has been effective in moulding the pattern of codon usage. In that respect it is useful for predicting the level of expression of a gene, for assessing the adaptation of viral genes to their hosts, and for making comparisons of codon usage in different organisms. The index may also give an approximate indication of the likely success of heterologous gene expression.

0 comments Cited 733 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data.

O. Gascuel (1997)

We propose an improved version of the neighbor-joining (NJ) algorithm of Saitou and Nei. This new algorithm, BIONJ, follows the same agglomerative scheme as NJ, which consists of iteratively picking a pair of taxa, creating a new mode which represents the cluster of these taxa, and reducing the distance matrix by replacing both taxa by this node. Moreover, BIONJ uses a simple first-order model of the variances and covariances of evolutionary distance estimates. This model is well adapted when these estimates are obtained from aligned sequences. At each step it permits the selection, from the class of admissible reductions, of the reduction which minimizes the variance of the new distance matrix. In this way, we obtain better estimates to choose the pair of taxa to be agglomerated during the next steps. Moreover, in comparison with NJ's estimates, these estimates become better and better as the algorithm proceeds. BIONJ retains the good properties of NJ--especially its low run time. Computer simulations have been performed with 12-taxon model trees to determine BIONJ's efficiency. When the substitution rates are low (maximum pairwise divergence approximately 0.1 substitutions per site) or when they are constant among lineages, BIONJ is only slightly better than NJ. When the substitution rates are higher and vary among lineages,BIONJ clearly has better topological accuracy. In the latter case, for the model trees and the conditions of evolution tested, the topological error reduction is on the average around 20%. With highly-varying-rate trees and with high substitution rates (maximum pairwise divergence approximately 1.0 substitutions per site), the error reduction may even rise above 50%, while the probability of finding the correct tree may be augmented by as much as 15%.

0 comments Cited 608 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

: Role: Editor

Journal

Journal ID (nlm-ta): PLoS Genet

Journal ID (publisher-id): plos

Journal ID (pmc): plosgen

Title: PLoS Genetics

Publisher: Public Library of Science (San Francisco, USA )

ISSN (Print): 1553-7390

ISSN (Electronic): 1553-7404

Publication date Collection: January 2011

Publication date (Print): January 2011

Publication date (Electronic): 27 January 2011

Volume: 7

Issue: 1

Electronic Location Identifier: e1001284

Affiliations

[1 ]Institut Pasteur, Microbial Evolutionary Genomics, Département Génomes et Génétique, Paris, France

[2 ]CNRS, URA2171, Paris, France

[3 ]UPMC Université Pierre et Marie Curie, Atelier de Bioinformatique, Paris, France

Yale University, United States of America

Author notes

* E-mail: treangen@ 123456pasteur.fr

¤: Current address: Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, United States of America

Conceived and designed the experiments: TJT EPCR. Performed the experiments: TJT. Analyzed the data: TJT EPCR. Contributed reagents/materials/analysis tools: TJT. Wrote the paper: TJT EPCR.

Article

Publisher ID: 10-PLGE-RA-EV-3419R3

DOI: 10.1371/journal.pgen.1001284

PMC ID: 3029252

PubMed ID: 21298028

SO-VID: de9fcc44-001d-43ad-a63f-a6e77ec649c0

Copyright © Treangen, Rocha. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

History

Date received : 14 June 2010

Date accepted : 20 December 2010

Page count

Pages: 12

Comments

Comment on this article

scite_

Cited by 207

See all cited by

Most referenced authors 1,471

See all reference authors

Horizontal Transfer, Not Duplication, Drives the Expansion of Protein Families in Prokaryotes

Read this article at

Abstract

Author Summary

Related collections

Genome Integrity

Most cited references 56

ISfinder: the reference centre for bacterial insertion sequences

The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications.

BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data.

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Page count

Categories

Comments

Comment on this article

Similar content 21

Cited by 207

Most referenced authors 1,471