Loading

International Journal of Plant Biology & Research

Characterization of the De novo Assembly Using Oxford Nanopore Sequencing Data

Editorial | Open Access | Volume 6 | Issue 2

  • 1. Department of Computer Science and Engineering, University of Nebraska-Lincoln, USA
+ Show More - Show Less
Corresponding Authors
Kan Liu, Department of Computer Science and Engineering, University of Nebraska-Lincoln, USA
Citation

Liu K (2018) Characterization of the De novo Assembly Using Oxford Nanopore Sequencing Data. Int J Plant Biol Res 6(2): 1087.

EDITORIAL

The MinION of Oxford Nanopore Technology (ONT) is a portable single-molecule sequencer released in 2014. As a portable nanopore-based sequencer [1], with just a USB drive size which can be conveniently used connected to a desktop or laptop using the USB 3.0 or higher interface, MinION contains pores embedded on a membrane that is placed over an electric grid. DNA/RNA molecules sequenced by MinION are basically measured by the ionic flow changes through the pores. As a thirdgeneration commercial sequencing platform product, Nanopore sequencing data can reach up to hundreds of thousands of nucleotides in a single run. Usually an average of 5,000 bp of the product long reads are expected for the DNA sequencing. When the two strands of the target sequence are both successfully basecalled, a consensus will be generated as a more accurate output (the “2D” sequence) compared with forward strand only output (“1D” sequence). Such long read sequencing technology enables great capacity in assembling large and complex genome data compared with using short reads only.

However, such long reads are generally much more errorprone (10~30% error rate) than short-read sequencing technologies such as Illumina, which generally makes it less competent in direct usage such as small structural variations (SNP and InDel) detections and other sequence analysis and applications. Nanopore sequencing shows a pattern of error in base calling being context-specific, for example, small variations of SNP such as TAG->TGG, TAC ->TCG are predominant in ONT. Early reports [2], showed about ~35% error rate in ONT data significantly hindered the wide application of this new technology. Therefore, Nanopore data usually needs to be corrected for the preprocessing. With the development of the base calling software and nanopore chemistry, a significant drop in error rate has been achieved [3]. Both short reads-based and long reads self-based correction strategies can be conducted based on the input data sets. Common error correction tools as nanocorrect [4], nanocorr [5], can be used for the correction of Nanopore data.

The basic data processing pipeline using ONT sequencing data contains: preprocessing, error correction, sequence assembly. For genome assembly, it is reported that using nanopore data only at about 30X genome coverage can be sufficient for assembling some small genomes such as E. coli [4]. Even after error correction, the assembly using Nanopore long reads is still quite challenging now since many existing assemblers were not designed to implement long reads with high error rate. Therefore, some hybrid methods incorporating both long and short reads for genome assembly are developed. The hybrid and non-hybrid de novo assembly strategies are both important depend on the data sets: short read will offer high quality for small region of sequence assembly as well as the scaffolding using paired end information, while Nanopore can recover long repetitive regions which cannot be fully reconstructed using short reads only. The selection between hybrid and non-hybrid assembly methods also depends on other considerations such as the size of the target genome, G+C content bias of the genome composition, the genome complexity (repetitive ratio and multiploidy, etc), also the costs and the bioinformatics analysis to be implemented for the project.

For de novo assembly tools can be used similar to PacBio reads such as PBc R and canu [6]. PBc R is an excellent assembler used in PacBio small and large genome de novo assembly using either hybrid or self-based methods. Canu can also help effectively assemble MinION data into genomes with high sequence accuracy. SPAdes [7] is another assembler used for both simple and multichromosome genomes. Other assemblers using de Bruijn graphbased methods such as Velvet [8] and ABySS [9], as well as using Overlap Layout Consensus (OLC)-based methods such as Celera assembler named CABOG [10]. Also other greedy graph-based package such as SSAKE [11], can also be applied in Nanopore reads de novo assembly. A survey [12] of the benchmarking of those assemblers in their performance using Nanopore long genome reads showed that an ideal strategy of long read assembly should first choose the OLC-based assemblers for a higher initial N50 value & mean and then use the de Bruijn graph-based algorithms for the accuracy improvement. NaS pipeline [13] combines both de Bruijn graphs and OLC approach for the error-free DNA reads assembly.

For plant genome sequence research, breakthroughs using Nanopore technology have proved the competence of long read in genome assembly of generating very large and long contiguous sequences (contigs) of complex genomes. For Oryza coarctata [14], a tetraploid Asian wild rice of approximately 660 Mb in size, a draft genome of multi-chromosome assembled using both hybrid and non-hybrid assembly strategies demonstrated the ability of de novo assembly using Nanopore high-noise single molecule sequencing reads. Arabidopsis thaliana genome is also successfully reconstructed using Nanopore reads in a fast and cost-effective implementation. Other plant pathogen genome assemblies using Nanopore genomic dataset such as Agrobacterium tumefaciens [15], also provided more evidences on long reads being effectively assembled into multi-chromosomal genomes with small number of contigs and high accuracy.

REFERENCES

1. Cherf GM, Lieberman KR, Rashid H, Lam CE, Karplus K, Akeson M. Automated forward and reverse ratcheting of DNA in a nanopore at 5-Å precision. Nat Biotechnol. 2012; 30: 344-348.

2. Laver T, Harrison J, O’Neill PA, Moore K, Farbos A, Paszkiewicz K, et al., Assessing the Performance of the Oxford Nanopore Technologies Minion. Biomol Detect Quantif. 2015; 3: 1-8.

3. Jain M, Hugh E. Olsen, Benedict Paten, Mark Akeson. The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community. Genome Biol. 2016; 17: 239.

4. Loman NJ, Quick J, Simpson JT. A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat Methods. 2015; 12: 733-735.

5. Goodwin S, Gurtowski J, Ethe Sayers S, Deshpande P, Schatz MC, McCombie WR. Oxford Nanopore sequencing, hybrid error correction, and de novo assembly of a eukaryotic genome. Genome Res. 2015; 25: 1750-1756.

6. Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017; 27: 722-736.

7. Bankevich Anton, Sergey Nurk, Dmitry Antipov, Alexey A. Gurevich, Mikhail Dvorkin, Alexander S. Kulikov, et al. SPAdes: A New Genome Assembly Algorithm and its Applications to Single-Cell Sequencing. J Comput Biol. 2012; 19: 455-477.

8. Zerbino DR, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008; 18: 821-829.

9. Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJ, Birol I. ABySS: a parallel assembler for short read sequence data. Genome Res. 2009; 19: 1117-1123.

10. Myers EW. Toward simplifying and accurately formulating fragment assembly. J Comput Biol. 1995; 2: 275-290.

11. Miller JR, Koren S, Sutton G. Assembly algorithms for next-generation sequencing data. Genomics. 2010; 95: 315-327.

12. Cherukuri Y, Janga SC. Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches. BMC Genomics. 2016; 17: 507.

13. Madoui MA, Engelen S, Cruaud C, Belser C, Bertrand L, Alberti A, et al. Genome assembly using Nanopore-guided long and error-free DNA reads. BMC Genomics. 2015; 16: 327.

14. Mondal TK, Rawal HC, Gaikwad K, Sharma TR, Singh NK. First de novo draft genome sequence of Oryza coarctata, the only halophytic species in the genus Oryza. F1000Res. 2017; 6: 1750.

15. Deschamps S, Mudge J, Cameron C, Ramaraj T, Anand A, Fengler K, et al. Characterization, correction and de novo assembly of an Oxford Nanopore genomic dataset from Agrobacterium tumefaciens. Sci Rep. 2016; 6: 28625.

: Liu K (2018) Characterization of the De novo Assembly Using Oxford Nanopore Sequencing Data. Int J Plant Biol Res 6(2): 1087.

Received : 22 Mar 2018
Accepted : 24 Mar 2018
Published : 27 Mar 2018
Journals
Annals of Otolaryngology and Rhinology
ISSN : 2379-948X
Launched : 2014
JSM Schizophrenia
Launched : 2016
Journal of Nausea
Launched : 2020
JSM Internal Medicine
Launched : 2016
JSM Hepatitis
Launched : 2016
JSM Oro Facial Surgeries
ISSN : 2578-3211
Launched : 2016
Journal of Human Nutrition and Food Science
ISSN : 2333-6706
Launched : 2013
JSM Regenerative Medicine and Bioengineering
ISSN : 2379-0490
Launched : 2013
JSM Spine
ISSN : 2578-3181
Launched : 2016
Archives of Palliative Care
ISSN : 2573-1165
Launched : 2016
JSM Nutritional Disorders
ISSN : 2578-3203
Launched : 2017
Annals of Neurodegenerative Disorders
ISSN : 2476-2032
Launched : 2016
Journal of Fever
ISSN : 2641-7782
Launched : 2017
JSM Bone Marrow Research
ISSN : 2578-3351
Launched : 2016
JSM Mathematics and Statistics
ISSN : 2578-3173
Launched : 2014
Journal of Autoimmunity and Research
ISSN : 2573-1173
Launched : 2014
JSM Arthritis
ISSN : 2475-9155
Launched : 2016
JSM Head and Neck Cancer-Cases and Reviews
ISSN : 2573-1610
Launched : 2016
JSM General Surgery Cases and Images
ISSN : 2573-1564
Launched : 2016
JSM Anatomy and Physiology
ISSN : 2573-1262
Launched : 2016
JSM Dental Surgery
ISSN : 2573-1548
Launched : 2016
Annals of Emergency Surgery
ISSN : 2573-1017
Launched : 2016
Annals of Mens Health and Wellness
ISSN : 2641-7707
Launched : 2017
Journal of Preventive Medicine and Health Care
ISSN : 2576-0084
Launched : 2018
Journal of Chronic Diseases and Management
ISSN : 2573-1300
Launched : 2016
Annals of Vaccines and Immunization
ISSN : 2378-9379
Launched : 2014
JSM Heart Surgery Cases and Images
ISSN : 2578-3157
Launched : 2016
Annals of Reproductive Medicine and Treatment
ISSN : 2573-1092
Launched : 2016
JSM Brain Science
ISSN : 2573-1289
Launched : 2016
JSM Biomarkers
ISSN : 2578-3815
Launched : 2014
JSM Biology
ISSN : 2475-9392
Launched : 2016
Archives of Stem Cell and Research
ISSN : 2578-3580
Launched : 2014
Annals of Clinical and Medical Microbiology
ISSN : 2578-3629
Launched : 2014
JSM Pediatric Surgery
ISSN : 2578-3149
Launched : 2017
Journal of Memory Disorder and Rehabilitation
ISSN : 2578-319X
Launched : 2016
JSM Tropical Medicine and Research
ISSN : 2578-3165
Launched : 2016
JSM Head and Face Medicine
ISSN : 2578-3793
Launched : 2016
JSM Cardiothoracic Surgery
ISSN : 2573-1297
Launched : 2016
JSM Bone and Joint Diseases
ISSN : 2578-3351
Launched : 2017
JSM Bioavailability and Bioequivalence
ISSN : 2641-7812
Launched : 2017
JSM Atherosclerosis
ISSN : 2573-1270
Launched : 2016
Journal of Genitourinary Disorders
ISSN : 2641-7790
Launched : 2017
Journal of Fractures and Sprains
ISSN : 2578-3831
Launched : 2016
Journal of Autism and Epilepsy
ISSN : 2641-7774
Launched : 2016
Annals of Marine Biology and Research
ISSN : 2573-105X
Launched : 2014
JSM Health Education & Primary Health Care
ISSN : 2578-3777
Launched : 2016
JSM Communication Disorders
ISSN : 2578-3807
Launched : 2016
Annals of Musculoskeletal Disorders
ISSN : 2578-3599
Launched : 2016
Annals of Virology and Research
ISSN : 2573-1122
Launched : 2014
JSM Renal Medicine
ISSN : 2573-1637
Launched : 2016
Journal of Muscle Health
ISSN : 2578-3823
Launched : 2016
JSM Genetics and Genomics
ISSN : 2334-1823
Launched : 2013
JSM Anxiety and Depression
ISSN : 2475-9139
Launched : 2016
Clinical Journal of Heart Diseases
ISSN : 2641-7766
Launched : 2016
Annals of Medicinal Chemistry and Research
ISSN : 2378-9336
Launched : 2014
JSM Pain and Management
ISSN : 2578-3378
Launched : 2016
JSM Women's Health
ISSN : 2578-3696
Launched : 2016
Clinical Research in HIV or AIDS
ISSN : 2374-0094
Launched : 2013
Journal of Endocrinology, Diabetes and Obesity
ISSN : 2333-6692
Launched : 2013
Journal of Substance Abuse and Alcoholism
ISSN : 2373-9363
Launched : 2013
JSM Neurosurgery and Spine
ISSN : 2373-9479
Launched : 2013
Journal of Liver and Clinical Research
ISSN : 2379-0830
Launched : 2014
Journal of Drug Design and Research
ISSN : 2379-089X
Launched : 2014
JSM Clinical Oncology and Research
ISSN : 2373-938X
Launched : 2013
JSM Bioinformatics, Genomics and Proteomics
ISSN : 2576-1102
Launched : 2014
JSM Chemistry
ISSN : 2334-1831
Launched : 2013
Journal of Trauma and Care
ISSN : 2573-1246
Launched : 2014
JSM Surgical Oncology and Research
ISSN : 2578-3688
Launched : 2016
Annals of Food Processing and Preservation
ISSN : 2573-1033
Launched : 2016
Journal of Radiology and Radiation Therapy
ISSN : 2333-7095
Launched : 2013
JSM Physical Medicine and Rehabilitation
ISSN : 2578-3572
Launched : 2016
Annals of Clinical Pathology
ISSN : 2373-9282
Launched : 2013
Annals of Cardiovascular Diseases
ISSN : 2641-7731
Launched : 2016
Journal of Behavior
ISSN : 2576-0076
Launched : 2016
Annals of Clinical and Experimental Metabolism
ISSN : 2572-2492
Launched : 2016
Clinical Research in Infectious Diseases
ISSN : 2379-0636
Launched : 2013
JSM Microbiology
ISSN : 2333-6455
Launched : 2013
Journal of Urology and Research
ISSN : 2379-951X
Launched : 2014
Journal of Family Medicine and Community Health
ISSN : 2379-0547
Launched : 2013
Annals of Pregnancy and Care
ISSN : 2578-336X
Launched : 2017
JSM Cell and Developmental Biology
ISSN : 2379-061X
Launched : 2013
Annals of Aquaculture and Research
ISSN : 2379-0881
Launched : 2014
Clinical Research in Pulmonology
ISSN : 2333-6625
Launched : 2013
Journal of Immunology and Clinical Research
ISSN : 2333-6714
Launched : 2013
Annals of Forensic Research and Analysis
ISSN : 2378-9476
Launched : 2014
JSM Biochemistry and Molecular Biology
ISSN : 2333-7109
Launched : 2013
Annals of Breast Cancer Research
ISSN : 2641-7685
Launched : 2016
Annals of Gerontology and Geriatric Research
ISSN : 2378-9409
Launched : 2014
Journal of Sleep Medicine and Disorders
ISSN : 2379-0822
Launched : 2014
JSM Burns and Trauma
ISSN : 2475-9406
Launched : 2016
Chemical Engineering and Process Techniques
ISSN : 2333-6633
Launched : 2013
Annals of Clinical Cytology and Pathology
ISSN : 2475-9430
Launched : 2014
JSM Allergy and Asthma
ISSN : 2573-1254
Launched : 2016
Journal of Neurological Disorders and Stroke
ISSN : 2334-2307
Launched : 2013
Annals of Sports Medicine and Research
ISSN : 2379-0571
Launched : 2014
JSM Sexual Medicine
ISSN : 2578-3718
Launched : 2016
Annals of Vascular Medicine and Research
ISSN : 2378-9344
Launched : 2014
JSM Biotechnology and Biomedical Engineering
ISSN : 2333-7117
Launched : 2013
Journal of Hematology and Transfusion
ISSN : 2333-6684
Launched : 2013
JSM Environmental Science and Ecology
ISSN : 2333-7141
Launched : 2013
Journal of Cardiology and Clinical Research
ISSN : 2333-6676
Launched : 2013
JSM Nanotechnology and Nanomedicine
ISSN : 2334-1815
Launched : 2013
Journal of Ear, Nose and Throat Disorders
ISSN : 2475-9473
Launched : 2016
JSM Ophthalmology
ISSN : 2333-6447
Launched : 2013
Journal of Pharmacology and Clinical Toxicology
ISSN : 2333-7079
Launched : 2013
Annals of Psychiatry and Mental Health
ISSN : 2374-0124
Launched : 2013
Medical Journal of Obstetrics and Gynecology
ISSN : 2333-6439
Launched : 2013
Annals of Pediatrics and Child Health
ISSN : 2373-9312
Launched : 2013
JSM Clinical Pharmaceutics
ISSN : 2379-9498
Launched : 2014
JSM Foot and Ankle
ISSN : 2475-9112
Launched : 2016
JSM Alzheimer's Disease and Related Dementia
ISSN : 2378-9565
Launched : 2014
Journal of Addiction Medicine and Therapy
ISSN : 2333-665X
Launched : 2013
Journal of Veterinary Medicine and Research
ISSN : 2378-931X
Launched : 2013
Annals of Public Health and Research
ISSN : 2378-9328
Launched : 2014
Annals of Orthopedics and Rheumatology
ISSN : 2373-9290
Launched : 2013
Journal of Clinical Nephrology and Research
ISSN : 2379-0652
Launched : 2014
Annals of Community Medicine and Practice
ISSN : 2475-9465
Launched : 2014
Annals of Biometrics and Biostatistics
ISSN : 2374-0116
Launched : 2013
JSM Clinical Case Reports
ISSN : 2373-9819
Launched : 2013
Journal of Cancer Biology and Research
ISSN : 2373-9436
Launched : 2013
Journal of Surgery and Transplantation Science
ISSN : 2379-0911
Launched : 2013
Journal of Dermatology and Clinical Research
ISSN : 2373-9371
Launched : 2013
JSM Gastroenterology and Hepatology
ISSN : 2373-9487
Launched : 2013
Annals of Nursing and Practice
ISSN : 2379-9501
Launched : 2014
JSM Dentistry
ISSN : 2333-7133
Launched : 2013
Author Information X