SLHSD: hybrid scaffolding method based on short and long reads.

Brief Bioinform

School of Computer and Information Engineering, Henan University, Kaifeng 475001, China.

Published: May 2023

In genome assembly, scaffolding can obtain more complete and continuous scaffolds. Current scaffolding methods usually adopt one type of read to construct a scaffold graph and then orient and order contigs. However, scaffolding with the strengths of two or more types of reads seems to be a better solution to some tricky problems. Combining the advantages of different types of data is significant for scaffolding. Here, a hybrid scaffolding method (SLHSD) is present that simultaneously leverages the precision of short reads and the length advantage of long reads. Building an optimal scaffold graph is an important foundation for getting scaffolds. SLHSD uses a new algorithm that combines long and short read alignment information to determine whether to add an edge and how to calculate the edge weight in a scaffold graph. In addition, SLHSD develops a strategy to ensure that edges with high confidence can be added to the graph with priority. Then, a linear programming model is used to detect and remove remaining false edges in the graph. We compared SLHSD with other scaffolding methods on five datasets. Experimental results show that SLHSD outperforms other methods. The open-source code of SLHSD is available at https://github.com/luojunwei/SLHSD.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bib/bbad169DOI Listing

Publication Analysis

Top Keywords

scaffold graph
12
hybrid scaffolding
8
scaffolding method
8
long reads
8
scaffolding methods
8
slhsd
7
scaffolding
7
graph
5
slhsd hybrid
4
method based
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!