Long-read sequencing and novel long-range assays have revolutionized de novo genome assembly by automating the reconstruction of reference-quality genomes. In particular, Hi-C sequencing is becoming an economical method for generating chromosome-scale scaffolds. Despite its increasing popularity, there are limited open-source tools available. Errors, particularly inversions and fusions across chromosomes, remain higher than alternate scaffolding technologies. We present a novel open-source Hi-C scaffolder that does not require an a priori estimate of chromosome number and minimizes errors by scaffolding with the assistance of an assembly graph. We demonstrate higher accuracy than the state-of-the-art methods across a variety of Hi-C library preparations and input assembly sizes. The Python and C++ code for our method is openly available at https://github.com/machinegun/SALSA.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6719893 | PMC |
http://dx.doi.org/10.1371/journal.pcbi.1007273 | DOI Listing |
CNS Drugs
January 2025
New York State Psychiatric Institute, 1051 Riverside Drive, New York, NY, 10032, USA.
Sci Data
January 2025
Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea.
This study presents the first chromosome-level genome assembly of the Korean long-tailed chicken (KLC), a unique breed of Gallus gallus known as Ginkkoridak. Our assembly achieved a super contig N50 of 5.7 Mbp and a scaffold N50 exceeding 90 Mb, with a genome completeness of 96.
View Article and Find Full Text PDFSci Data
January 2025
Key Laboratory of Ecological Safety and Sustainable Development in Arid Lands, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi, 830011, China.
Argali stands as the largest species among wild sheep in Central and East Asia, with a concerning rate of decline estimated at 30%. The intraspecific taxonomy of argali remains contentious due to limited genomic data and unclear geographic separation. In this study, we constructed a chromosome-level genome assembly and annotation for the Tibetan argali (O.
View Article and Find Full Text PDFSci Data
January 2025
Laboratory of Aquatic Genomics, College of Life Sciences and Oceanography, Shenzhen University, Shenzhen, 518057, China.
Three-spotted seahorse (Hippocampi trimaculata) is a unique fish with important economic and medicinal values, and its total chromosome number is potentially quite different from other seahorse species. Herein, we constructed a chromosome-level genome assembly for this special seahorse by integration of MGI short-read, PacBio HiFi long-read and Hi-C sequencing techniques. A 416.
View Article and Find Full Text PDFSci Data
January 2025
School of Molecular and Cell Biology, University of the Witwatersrand, Johannesburg, 2017, South Africa.
The Southern Ground Hornbill (SGH - Bucorvus leadbeateri) is one of the largest hornbill species worldwide, known for its complex social structures and breeding behaviours. This bird has been of great interest due to its declining population and disappearance from historic ranges in southern Africa. Despite being the focus of numerous conservation efforts, with research forming an integral part of these initiatives, there is still a substantial lack of knowledge regarding the molecular biology aspects of this bird species.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!