• Home/
  • Chasing perfection: validation and polishing strategies for telomere-to-telomere genome assemblies

Chasing perfection: validation and polishing strategies for telomere-to-telomere genome assemblies

bioRxiv 2021
Mc Cartney A. et al

Ann M. Mc CartneyKishwar ShafinMichael AlongeAndrey V. BzikadzeGiulio FormentiArkarachai FungtammasanKerstin HoweChirag JainSergey KorenGlennis A. LogsdonKaren H. MigaAlla MikheenkoBenedict PatenAlaina ShumateDaniela C. SotoIvan SovićJonathan MD WoodJustin M. ZookAdam M. PhillippyArang Rhie

Advances in long-read sequencing technologies and genome assembly methods have enabled the recent completion of the first Telomere-to-Telomere (T2T) human genome assembly, which resolves complex segmental duplications and large tandem repeats, including centromeric satellite arrays in a complete hydatidiform mole (CHM13). Though derived from highly accurate sequencing, evaluation revealed that the initial T2T draft assembly had evidence of small errors and structural misassemblies. To correct these errors, we designed a novel repeat-aware polishing strategy that made accurate assembly corrections in large repeats without overcorrection, ultimately fixing 51% of the existing errors and improving the assembly QV to 73.9. By comparing our results to standard automated polishing tools, we outline common polishing errors and offer practical suggestions for genome projects with limited resources. We also show how sequencing biases in both PacBio HiFi and Oxford Nanopore Technologies reads cause signature assembly errors that can be corrected with a diverse panel of sequencing technologies

Top

This website stores cookies on your computer. These cookies are used to collect information about how you interact with our website and allow us to remember you. We use this information in order to improve and customize your browsing experience and for analytics and metrics about our visitors both on this website and other media. To find out more about the cookies we use, see our Privacy Policy.