Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2023.03.09.531669v1?rss=1
Authors: Hu, J., Wang, Z., Sun, Z., Hu, B., Ayoola, A. O., Liang, F., Li, J., Sandoval, J. R., Cooper, D. N., Ye, K., Ruan, J., Xiao, C.-L., Wang, D., Wu, D.-D., Wang, S.
Abstract: Long read sequencing data, particularly those derived from the Oxford Nanopore (ONT) sequencing platform, tend to exhibit a high error rate. Here, we present NextDenovo, a highly efficient error correction and assembly tool for noisy long reads, which achieves a high level of accuracy in genome assembly. NextDenovo can rapidly correct reads; these corrected reads contain fewer errors than other comparable tools and are characterized by fewer chimeric alignments. We applied NextDenovo to the assembly of high quality reference genomes of 35 diverse humans from across the world using ONT Nanopore long read sequencing data. Based on these de novo genome assemblies, we were able to identify the landscape of segmental duplications and gene copy number variation in the modern human population. The use of the NextDenovo program should pave the way for population-scale long-read assembly, thereby facilitating the construction of human pan-genomes, using Nanopore long read sequencing data.
Copy rights belong to original authors. Visit the link for more info
Podcast created by Paper Player, LLC