18
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Improving Illumina assemblies with Hi‐C and long reads: An example with the North African dromedary

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Researchers have assembled thousands of eukaryotic genomes using Illumina reads, but traditional mate‐pair libraries cannot span all repetitive elements, resulting in highly fragmented assemblies. However, both chromosome conformation capture techniques, such as Hi‐C and Dovetail Genomics Chicago libraries and long‐read sequencing, such as Pacific Biosciences and Oxford Nanopore, help span and resolve repetitive regions and therefore improve genome assemblies. One important livestock species of arid regions that does not have a high‐quality contiguous reference genome is the dromedary ( Camelus dromedarius). Draft genomes exist but are highly fragmented, and a high‐quality reference genome is needed to understand adaptation to desert environments and artificial selection during domestication. Dromedaries are among the last livestock species to have been domesticated, and together with wild and domestic Bactrian camels, they are the only representatives of the Camelini tribe, which highlights their evolutionary significance. Here we describe our efforts to improve the North African dromedary genome. We used Chicago and Hi‐C sequencing libraries from Dovetail Genomics to resolve the order of previously assembled contigs, producing almost chromosome‐level scaffolds. Remaining gaps were filled with Pacific Biosciences long reads, and then scaffolds were comparatively mapped to chromosomes. Long reads added 99.32 Mbp to the total length of the new assembly. Dovetail Chicago and Hi‐C libraries increased the longest scaffold over 12‐fold, from 9.71 Mbp to 124.99 Mbp and the scaffold N50 over 50‐fold, from 1.48 Mbp to 75.02 Mbp. We demonstrate that Illumina de novo assemblies can be substantially upgraded by combining chromosome conformation capture and long‐read sequencing.

          Related collections

          Most cited references34

          • Record: found
          • Abstract: found
          • Article: not found

          Nanopore sensors for nucleic acid analysis.

          Nanopore analysis is an emerging technique that involves using a voltage to drive molecules through a nanoscale pore in a membrane between two electrolytes, and monitoring how the ionic current through the nanopore changes as single molecules pass through it. This approach allows charged polymers (including single-stranded DNA, double-stranded DNA and RNA) to be analysed with subnanometre resolution and without the need for labels or amplification. Recent advances suggest that nanopore-based sensors could be competitive with other third-generation DNA sequencing technologies, and may be able to rapidly and reliably sequence the human genome for under $1,000. In this article we review the use of nanopore technology in DNA sequencing, genetics and medical diagnostics.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A beginner's guide to eukaryotic genome annotation.

            The falling cost of genome sequencing is having a marked impact on the research community with respect to which genomes are sequenced and how and where they are annotated. Genome annotation projects have generally become small-scale affairs that are often carried out by an individual laboratory. Although annotating a eukaryotic genome assembly is now within the reach of non-experts, it remains a challenging task. Here we provide an overview of the genome annotation process and the available tools and describe some best-practice approaches.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Quake: quality-aware detection and correction of sequencing errors

              We introduce Quake, a program to detect and correct errors in DNA sequencing reads. Using a maximum likelihood approach incorporating quality values and nucleotide specific miscall rates, Quake achieves the highest accuracy on realistically simulated reads. We further demonstrate substantial improvements in de novo assembly and SNP detection after using Quake. Quake can be used for any size project, including more than one billion human reads, and is freely available as open source software from http://www.cbcb.umd.edu/software/quake.
                Bookmark

                Author and article information

                Contributors
                jean.elbers@gmail.com
                pamela.burger@vetmeduni.ac.at
                Journal
                Mol Ecol Resour
                Mol Ecol Resour
                10.1111/(ISSN)1755-0998
                MEN
                Molecular Ecology Resources
                John Wiley and Sons Inc. (Hoboken )
                1755-098X
                1755-0998
                17 May 2019
                July 2019
                : 19
                : 4 ( doiID: 10.1111/men.2019.19.issue-4 )
                : 1015-1026
                Affiliations
                [ 1 ] Department of Integrative Biology and Evolution Research Institute of Wildlife Ecology, Vetmeduni Vienna Vienna Austria
                [ 2 ] Intelligent Systems Laboratory University of Bristol Bristol UK
                [ 3 ] Institute of Molecular and Cellular Biology SB RAS and Novosibirsk State University Novosibirsk Russia
                [ 4 ] The Walter Reed Biosystematics Unit, Smithsonian Institution Museum Support Center MRC‐534 Suitland Maryland
                [ 5 ] Department of Animal Genetics, Faculty of Veterinary Medicine, Ceitec VFU, RG Animal Immunogenomics University of Veterinary and Pharmaceutical Sciences Brno Czech Republic
                [ 6 ] Department of Biostatistics University of Oslo Oslo Norway
                [ 7 ] Department of Mathematics and Statistics University of Helsinki Helsinki Finland
                [ 8 ] Bristol Medical School: Translational Health Sciences, Molecular Neuroendocrinology Research Group University of Bristol Bristol UK
                Author notes
                [*] [* ] Correspondence

                Jean Elbers and Pamela Burger, Department of Integrative Biology and Evolution, Research Institute of Wildlife Ecology, Vetmeduni Vienna, Vienna, Austria.

                Emails: jean.elbers@ 123456gmail.com ; pamela.burger@ 123456vetmeduni.ac.at

                Author information
                https://orcid.org/0000-0002-5593-4447
                Article
                MEN13020
                10.1111/1755-0998.13020
                6618069
                30972949
                3335edb9-7c06-467a-b6dc-2aecadddc9f4
                © 2019 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd

                This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.

                History
                : 16 August 2018
                : 24 March 2019
                : 25 March 2019
                Page count
                Figures: 5, Tables: 1, Pages: 12, Words: 9462
                Funding
                Funded by: Russian Science Foundation
                Award ID: 16-14-10009
                Funded by: Russian Foundation for Basic Research
                Award ID: 17-00-00146
                Funded by: Leverhulme Trust
                Award ID: RPG-2017-287
                Funded by: Austrian Science Fund
                Award ID: P29623-B25
                Categories
                Resource Article
                RESOURCE ARTICLES
                Permanent Genetic Resources
                Custom metadata
                2.0
                men13020
                July 2019
                Converter:WILEY_ML3GV2_TO_NLMPMC version:5.6.5 mode:remove_FC converted:10.07.2019

                Ecology
                chromosome conformation capture,chromosome mapping,dromedary,genome annotation,genome assembly,scaffolding

                Comments

                Comment on this article