Anchorage Accurately Assembles Anchor-Flanked Synthetic Long Reads.

Lebniz Int Proc Inform

Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, USA Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA, USA.

Published: August 2024

Modern sequencing technologies allow for the addition of short-sequence tags, known as anchors, to both ends of a captured molecule. Anchors are useful in assembling the full-length sequence of a captured molecule as they can be used to accurately determine the endpoints. One representative of such anchor-enabled technology is LoopSeq Solo, a synthetic long read (SLR) sequencing protocol. LoopSeq Solo also achieves ultra-high sequencing depth and high purity of short reads covering the entire captured molecule. Despite the availability of many assembly methods, constructing full-length sequence from these anchor-enabled, ultra-high coverage sequencing data remains challenging due to the complexity of the underlying assembly graphs and the lack of specific algorithms leveraging anchors. We present Anchorage, a novel assembler that performs anchor-guided assembly for ultra-high-depth sequencing data. Anchorage starts with a kmer-based approach for precise estimation of molecule lengths. It then formulates the assembly problem as finding an optimal path that connects the two nodes determined by anchors in the underlying compact de Bruijn graph. The optimality is defined as maximizing the weight of the smallest node while matching the estimated sequence length. Anchorage uses a modified dynamic programming algorithm to efficiently find the optimal path. Through both simulations and real data, we show that Anchorage outperforms existing assembly methods, particularly in the presence of sequencing artifacts. Anchorage fills the gap in assembling anchor-enabled data. We anticipate its broad use as anchor-enabled sequencing technologies become prevalent. Anchorage is freely available at https://github.com/Shao-Group/anchorage; the scripts and documents that can reproduce all experiments in this manuscript are available at https://github.com/Shao-Group/anchorage-test.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11702288PMC
http://dx.doi.org/10.4230/LIPIcs.WABI.2024.22DOI Listing

Publication Analysis

Top Keywords

captured molecule
12
synthetic long
8
sequencing technologies
8
full-length sequence
8
loopseq solo
8
assembly methods
8
sequencing data
8
data anchorage
8
optimal path
8
anchorage
7

Similar Publications

This study explores the formation of functionalized carbon surfaces through shock compression of graphite in the presence of water, modeled using molecular dynamics and the ReaxFF reactive force field. The shock compression method produces activated carbon with surface functionalities, primarily hydroxyl groups, and varying morphological properties. Two approaches, unidirectional and isotropic compression, yield distinct surface structures: the former preserves a relatively flat surface, while the latter generates corrugated features with valleys and ridges.

View Article and Find Full Text PDF

Oleogels (organogels) are systems resembling a solid substance based on the gelation of organic solvents (oil or non-polar liquid) through components of low molecular weight or oil-soluble polymers. Such compounds are organogelators that produce a thermoreversible three-dimensional gel network that captures liquid organic solvents. Oleogels based on natural oils are attracting more attention due to their numerous advantages, such as their unsaturated fatty acid contents, ease of preparation, and safety of use.

View Article and Find Full Text PDF

A Fluorine-Functionalized Tb(III)-Organic Framework for Ba Detection.

Molecules

December 2024

School of Food and Pharmaceutical Engineering, Zhaoqing University, Zhaoqing 526061, China.

The development of lanthanide-organic frameworks (Ln-MOFs) using for luminescence sensing and selective gas adsorption applications is of great significance from an energy and environmental perspective. This study reports the solvothermal synthesis of a fluorine-functionalized 3D microporous Tb-MOF with a face-centered cubic () topology constructed from hexanuclear clusters (TbO) bridged by fdpdc ligands, formulated as {[Tb(fdpdc)(-OH)(HO)]·4DMF} (), (fdpdc = 3-fluorobiphenyl-4,4'-dicarboxylate). Complex displays a 3D framework with the channel of 7.

View Article and Find Full Text PDF

Harnessing Halogenated Zeolitic Imidazolate Frameworks for Alcohol Vapor Adsorption.

Molecules

December 2024

Institut Européen des Membranes (IEM), CNRS, ENSCM, Univ Montpellier, Place Eugène Bataillon, 34095 Montpellier, France.

This study explores Zeolitic Imidazolate Frameworks (ZIFs) as promising materials for adsorbing alcohol vapors, one of the main contributors to air quality deterioration and adverse health effects. Indeed, this sub-class of Metal-Organic Frameworks (MOFs) offers a promising alternative to conventional adsorbents like zeolites and activated carbons for air purification. Specifically, this investigation focuses on ZIF-8_Br, a brominated version of ZIF-8_CH, to evaluate its ability to capture aliphatic alcohols at lower partial pressures.

View Article and Find Full Text PDF

Thermally activated delayed fluorescence (TADF) materials with high photoluminescence quantum yields and a fast reverse intersystem crossing (RISC) are of the highest interest for organic light-emitting diodes (OLEDs). In the past decade, triaryl boranes with multiple resonance effect (MR) have captured significant attention. The efficiency of MR-TADF emitters strongly depends on small singlet-triplet energy gaps (ΔE), but also on large reverse intersystem crossing (RISC) rate constants (k).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!