The combination of long-reads and long-range information to produce genome assemblies is now accepted as a common standard. This strategy not only allow to access the gene catalogue of a given species but also reveals the architecture and organisation of chromosomes, including complex regions like telomeres and centromeres. The Brassica genus is not exempt and many assemblies based on long reads are now available. The reference genome for Brassica napus, Darmor-bzh, which was published in 2014, has been produced using short-reads and its contiguity was extremely low if compared to current assemblies of the Brassica genus.Here, we report the new long-reads assembly of Darmor-bzh genome (Brassica napus) generated by combining long-reads sequencing data, optical and genetic maps. Using the PromethION device and six flowcells, we generated about 16M long-reads representing 93X coverage and more importantly 6X with reads longer than 100Kb. This ultralong-reads dataset allows us to generate one of the most contiguous and complete assembly of a Brassica genome to date (contigs N50 > 10Mb). In addition, we exploited all the advantages of the nanopore technology to detect modified bases and sequence transcriptomic data using direct RNA to annotate the genome and focus on resistance genes.Using these cutting edge technologies, and in particular by relying on all the advantages of the nanopore technology, we provide the most contiguous Brassica napus assembly, a resource that will be valuable for the Brassica community for crop improvement and will facilitate the rapid selection of agronomically important traits.
Less...