69
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Models of Somatic Hypermutation Targeting and Substitution Based on Synonymous Mutations from High-Throughput Immunoglobulin Sequencing Data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Analyses of somatic hypermutation (SHM) patterns in B cell immunoglobulin (Ig) sequences contribute to our basic understanding of adaptive immunity, and have broad applications not only for understanding the immune response to pathogens, but also to determining the role of SHM in autoimmunity and B cell cancers. Although stochastic, SHM displays intrinsic biases that can confound statistical analysis, especially when combined with the particular codon usage and base composition in Ig sequences. Analysis of B cell clonal expansion, diversification, and selection processes thus critically depends on an accurate background model for SHM micro-sequence targeting (i.e., hot/cold-spots) and nucleotide substitution. Existing models are based on small numbers of sequences/mutations, in part because they depend on data from non-coding regions or non-functional sequences to remove the confounding influences of selection. Here, we combine high-throughput Ig sequencing with new computational analysis methods to produce improved models of SHM targeting and substitution that are based only on synonymous mutations, and are thus independent of selection. The resulting “S5F” models are based on 806,860 Synonymous mutations in 5-mer motifs from 1,145,182 Functional sequences and account for dependencies on the adjacent four nucleotides (two bases upstream and downstream of the mutation). The estimated profiles can explain almost half of the variance in observed mutation patterns, and clearly show that both mutation targeting and substitution are significantly influenced by neighboring bases. While mutability and substitution profiles were highly conserved across individuals, the variability across motifs was found to be much larger than previously estimated. The model and method source code are made available at http://clip.med.yale.edu/SHM

          Related collections

          Most cited references28

          • Record: found
          • Abstract: found
          • Article: not found

          IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains.

          IMGT, the international ImMunoGeneTics database (http://imgt.cines.fr) is a high quality integrated information system specializing in immunoglobulins (IG), T cell receptors (TR) and major histocompatibility complex (MHC) of human and other vertebrates. IMGT provides a common access to expertly annotated data on the genome, proteome, genetics and structure of the IG and TR, based on the IMGT Scientific chart and IMGT-ONTOLOGY. The IMGT unique numbering defined for the IG and TR variable regions and domains of all jawed vertebrates has allowed a redefinition of the limits of the framework (FR-IMGT) and complementarity determining regions (CDR-IMGT), leading, for the first time, to a standardized description of mutations, allelic polymorphisms, 2D representations (Colliers de Perles) and 3D structures, whatever the antigen receptor, the chain type, or the species. The IMGT numbering has been extended to the V-like domain and is, therefore, highly valuable for comparative analysis and evolution studies of proteins belonging to the IG superfamily.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The Interpretation of Interaction in Contingency Tables

            E. SIMPSON (1951)
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Quantifying selection in high-throughput Immunoglobulin sequencing data sets

              High-throughput immunoglobulin sequencing promises new insights into the somatic hypermutation and antigen-driven selection processes that underlie B-cell affinity maturation and adaptive immunity. The ability to estimate positive and negative selection from these sequence data has broad applications not only for understanding the immune response to pathogens, but is also critical to determining the role of somatic hypermutation in autoimmunity and B-cell cancers. Here, we develop a statistical framework for Bayesian estimation of Antigen-driven SELectIoN (BASELINe) based on the analysis of somatic mutation patterns. Our approach represents a fundamental advance over previous methods by shifting the problem from one of simply detecting selection to one of quantifying selection. Along with providing a more intuitive means to assess and visualize selection, our approach allows, for the first time, comparative analysis between groups of sequences derived from different germline V(D)J segments. Application of this approach to next-generation sequencing data demonstrates different selection pressures for memory cells of different isotypes. This framework can easily be adapted to analyze other types of DNA mutation patterns resulting from a mutator that displays hot/cold-spots, substitution preference or other intrinsic biases.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Immunol
                Front Immunol
                Front. Immunol.
                Frontiers in Immunology
                Frontiers Media S.A.
                1664-3224
                15 November 2013
                2013
                : 4
                : 358
                Affiliations
                [1] 1Bioengineering Program, Faculty of Engineering, Bar Ilan University , Ramat Gan, Israel
                [2] 2Department of Pathology, Yale School of Medicine , New Haven, CT, USA
                [3] 3Interdepartmental Program in Computational Biology and Bioinformatics, Yale University , New Haven, CT, USA
                [4] 4Department of Neurology, Yale School of Medicine , New Haven, CT, USA
                [5] 5Department of Science Education, Hofstra North Shore-LIJ School of Medicine , Hempstead, NY, USA
                [6] 6Human and Translational Immunology Program, Yale School of Medicine , New Haven, CT, USA
                [7] 7Department of Immunobiology, Yale School of Medicine , New Haven, CT, USA
                [8] 8Department of Genetics, Harvard Medical School , Boston, MA, USA
                [9] 9AbVitro, Inc. , Boston, MA, USA
                Author notes

                Edited by: Ramit Mehr, Bar-Ilan University, Israel

                Reviewed by: Ronald B. Corley, Boston University School of Medicine, USA; Masaki Hikida, Kyoto University, Japan

                *Correspondence: Steven H. Kleinstein, Department of Pathology, Yale School of Medicine, Suite 505, 300 George Street, New Haven, CT 06511, USA e-mail: steven.kleinstein@ 123456yale.edu

                This article was submitted to B Cell Biology, a section of the journal Frontiers in Immunology.

                Article
                10.3389/fimmu.2013.00358
                3828525
                24298272
                b6a17c84-7afb-40b6-a749-6c707d52e76e
                Copyright © 2013 Yaari, Vander Heiden, Uduman, Gadala-Maria, Gupta, Stern, O’Connor, Hafler, Laserson, Vigneault and Kleinstein.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 01 August 2013
                : 22 October 2013
                Page count
                Figures: 6, Tables: 2, Equations: 5, References: 29, Pages: 11, Words: 7381
                Categories
                Immunology
                Original Research

                Immunology
                substitution,aid,targeting,mutability,immunoglobulin,somatic hypermutation,affinity maturation,b cell

                Comments

                Comment on this article