6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A chromosome-level reference genome of Ensete glaucum gives insight into diversity and chromosomal and repetitive sequence evolution in the Musaceae

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Ensete glaucum (2 n = 2 x = 18) is a giant herbaceous monocotyledonous plant in the small Musaceae family along with banana ( Musa). A high-quality reference genome sequence assembly of E. glaucum is a resource for functional and evolutionary studies of Ensete, Musaceae, and the Zingiberales.

          Findings

          Using Oxford Nanopore Technologies, chromosome conformation capture (Hi-C), Illumina and RNA survey sequence, supported by molecular cytogenetics, we report a high-quality 481.5 Mb genome assembly with 9 pseudo-chromosomes and 36,836 genes. A total of 55% of the genome is composed of repetitive sequences with predominantly LTR-retroelements (37%) and DNA transposons (7%). The single 5S ribosomal DNA locus had an exceptionally long monomer length of 1,056 bp, more than twice that of the monomers at multiple loci in Musa. A tandemly repeated satellite (1.1% of the genome, with no similar sequence in Musa) was present around all centromeres, together with a few copies of a long interspersed nuclear element (LINE) retroelement. The assembly enabled us to characterize in detail the chromosomal rearrangements occurring between E. glaucum and the x = 11 species of Musa. One E. glaucum chromosome has the same gene content as Musa acuminata, while others show multiple, complex, but clearly defined evolutionary rearrangements in the change between x= 9 and 11.

          Conclusions

          The advance towards a Musaceae pangenome including E. glaucum, tolerant of extreme environments, makes a complete set of gene alleles, copy number variation, and a reference for structural variation available for crop breeding and understanding environmental responses. The chromosome-scale genome assembly shows the nature of chromosomal fusion and translocation events during speciation, and features of rapid repetitive DNA change in terms of copy number, sequence, and genomic location, critical to understanding its role in diversity and evolution.

          Related collections

          Most cited references121

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Trimmomatic: a flexible trimmer for Illumina sequence data

          Motivation: Although many next-generation sequencing (NGS) read preprocessing tools already existed, we could not find any tool or combination of tools that met our requirements in terms of flexibility, correct handling of paired-end data and high performance. We have developed Trimmomatic as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data. Results: The value of NGS read preprocessing is demonstrated for both reference-based and reference-free tasks. Trimmomatic is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested. Availability and implementation: Trimmomatic is licensed under GPL V3. It is cross-platform (Java 1.5+ required) and available at http://www.usadellab.org/cms/index.php?page=trimmomatic Contact: usadel@bio1.rwth-aachen.de Supplementary information: Supplementary data are available at Bioinformatics online.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            STAR: ultrafast universal RNA-seq aligner.

            Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Fast gapped-read alignment with Bowtie 2.

              As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
                Bookmark

                Author and article information

                Contributors
                Journal
                Gigascience
                Gigascience
                gigascience
                GigaScience
                Oxford University Press
                2047-217X
                30 April 2022
                2022
                30 April 2022
                : 11
                : giac027
                Affiliations
                Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences , Guangzhou 510650, China
                Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences , Guangzhou 510650, China
                College of Life Sciences, University of the Chinese Academy of Sciences , Beijing 100049, China
                Bioversity International, Parc Scientifique Agropolis II , 34397 Montpellier Cedex 5, France
                French Institute of Bioinformatics (IFB) - South Green Bioinformatics Platform, Alliance Bioversity and CIAT, CIRAD, INRAE, IRD , F-34398 Montpellier, France
                Department of Genetics and Genome Biology, University of Leicester , Leicester LE1 7RH, UK
                French Institute of Bioinformatics (IFB) - South Green Bioinformatics Platform, Alliance Bioversity and CIAT, CIRAD, INRAE, IRD , F-34398 Montpellier, France
                CIRAD, UMR AGAP Institut , F-34398 Montpellier, France
                UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro , F-34398 Montpellier, France
                Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences , Guangzhou 510650, China
                Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences , Guangzhou 510650, China
                College of Life Sciences, University of the Chinese Academy of Sciences , Beijing 100049, China
                Bioversity International, Parc Scientifique Agropolis II , 34397 Montpellier Cedex 5, France
                CIRAD, UMR AGAP Institut , F-34398 Montpellier, France
                UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro , F-34398 Montpellier, France
                Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences , Guangzhou 510650, China
                Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences , Guangzhou 510650, China
                Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences , Guangzhou 510650, China
                Department of Genetics and Genome Biology, University of Leicester , Leicester LE1 7RH, UK
                Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences , Guangzhou 510650, China
                Department of Genetics and Genome Biology, University of Leicester , Leicester LE1 7RH, UK
                Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences , Guangzhou 510650, China
                Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences , Guangzhou 510650, China
                Author notes
                Correspondence address. Qing Liu. Key Laboratory of Plant Resources Conservation and Sustainable Utilization / Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China. Pat Heslop-Harrison. Department of Genetics and Genome Biology, University of Leicester, Leicester, LE 7RH, UK Qing Liu. Key Laboratory of Plant Resources Conservation and Sustainable Utilization / Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China E-mail: liuqing@ 123456scib.ac.cn
                Correspondence address. Pat Heslop-Harrison. Department of Genetics and Genome Biology, University of Leicester, Leicester, LE 7RH, UK. E-mail: phh4@ 123456le.ac.uk
                Author information
                https://orcid.org/0000-0002-5590-2437
                https://orcid.org/0000-0003-0284-1885
                https://orcid.org/0000-0001-9228-1927
                https://orcid.org/0000-0003-1849-1269
                https://orcid.org/0000-0003-0254-1364
                https://orcid.org/0000-0002-8309-3120
                https://orcid.org/0000-0002-5219-8771
                https://orcid.org/0000-0002-5008-9475
                https://orcid.org/0000-0001-8310-5489
                https://orcid.org/0000-0002-3105-2167
                https://orcid.org/0000-0003-2712-6028
                Article
                giac027
                10.1093/gigascience/giac027
                9055855
                35488861
                ec015a22-20cc-45fb-b28c-325057485827
                © The Author(s) 2022. Published by Oxford University Press GigaScience.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 03 November 2021
                : 26 January 2022
                : 22 February 2022
                Page count
                Pages: 21
                Funding
                Funded by: National Natural Science Foundation of China, DOI 10.13039/501100001809;
                Award ID: 32070359
                Funded by: Guangdong Basic and Applied Basic Research Foundation, DOI 10.13039/501100021171;
                Award ID: 2021A1515012410
                Funded by: SCBG;
                Award ID: Y861041001
                Funded by: Chinese Academy of Sciences, DOI 10.13039/501100002367;
                Award ID: KCJH-80107-2020-004-97
                Categories
                Data Note
                AcademicSubjects/SCI00960
                AcademicSubjects/SCI02254

                centromeres,chromosome-scale assembly,ensete glaucum,musaceae evolution,nanopore,pangenome,repetitive dna,retrotransposons,synteny,translocations

                Comments

                Comment on this article