A turning point in cancer research is the introduction of massively parallel sequencing technology which greatly reduced the cost and time for genome sequencing. This enhanced the scope for detecting and analyzing the role of structural alterations in cancer. However, certain bias exists in NGS-based approaches, which badly affects the CNV identification process. Moreover, DNA repeats existing in CNV regions need special attention as they will degrade the performance of majority of the existing CNV detection tools, even after applying generalized bias correction method. This motivated this work, where a novel method has been designed to address the issue of DNA repeats and thereby mappability bias existing in regions of CNV. The method consists of three phases, where the first phase computes the alignment information of uniquely mapped DNA reads, considering the base quality and base mismatch parameters at nucleotide level precision. The second and the third phase use a novel approach to allocate the non-uniquely mapped reads to an optimal region of the DNA repeats based on a probabilistic membership model. The proposed method is capable of identifying CNVs present in coding, as well as non-coding region of the DNA, and is also capable of detecting CNVs existing in DNA repeat regions. The methodology achieves a sensitivity greater than [Formula: see text] during the performed simulations, and on real data, the detected variants are validated with the database of genomic variants, where the percentage overlap is also greater than 95%, and has achieved much better breakpoint prediction, as compared with other popular bias correction CNV detection methods.

Download full-text PDF

Source
http://dx.doi.org/10.1142/S0219720024500094DOI Listing

Publication Analysis

Top Keywords

dna repeats
12
novel method
8
mappability bias
8
existing cnv
8
cnv detection
8
bias correction
8
region dna
8
dna
7
bias
5
method addressing
4

Similar Publications

Since microRNAs (miRNAs) serve as markers for early cancer diagnosis, it is crucial to develop a novel biosensor to detect miRNAs quickly, sensitively and selectively. Hence, we developed a fluorescence biosensor based on target miRNA-initiated rolling circle amplification (RCA) to generate RCA products with multiple tandem catalytic hairpin DNA templates that trigger primer exchange reactions (PER) which extend short single-strand DNA (ssDNA) primers into long ssDNA. Subsequently, the long ssDNA activates the -cleavage activity of the clustered regularly interspaced short palindromic repeats (CRISPR)/Cas12a system to cleave a fluorescent reporter chain, enabling ultrasensitive detection of miRNAs through the output fluorescence signal.

View Article and Find Full Text PDF

'Splice-at-will' Cas12a crRNA engineering enabled direct quantification of ultrashort RNAs.

Nucleic Acids Res

January 2025

Key Laboratory of Applied Surface and Colloid Chemistry, Ministry of Education, Key Laboratory of Analytical Chemistry for Life Science of Shaanxi Province, School of Chemistry & Chemical Engineering, Shaanxi Normal University, 620 West Chang'an Avenue, Chang'an District, Xi'an, Shaanxi 710119, P.R. China.

We present a robust 'splice-at-will' CRISPR RNA (crRNA) engineering mechanism that overcomes the limitations of clustered regularly interspaced short palindromic repeats (CRISPR)/Cas system in directly detecting ultrashort RNAs. In this strategy, an intact Cas12a crRNA can be split from almost any site of the spacer region to obtain a truncated crRNA (tcrRNA) that cannot activate Cas12a even after binding an auxiliary DNA activator. While splicing tcrRNAs with a moiety of ultrashort RNA, the formed combination can work together to activate Cas12a efficiently, enabling 'splice-at-will' crRNA engineering.

View Article and Find Full Text PDF

The complete chloroplast genome of Diels (Apioideae), a traditional medicinal herb in China.

Mitochondrial DNA B Resour

January 2025

Enshi Tujia & Miao Autonomous Prefecture Academy of Agricultural Sciences, Enshi, Hubei, China.

Heracleum hemsleyanum Diels is a traditional medicinal herb in China. We reported its first complete chloroplast genome. The chloroplast genome was 146,775 bp in length with 37.

View Article and Find Full Text PDF

Using the Telomere-to-Telomere reference, we assembled the distribution of simple repeat lengths present in the human genome. Analyzing over two hundred mammalian genomes, we found remarkable consistency in the shape of the distribution across evolutionary epochs. All observed genomes harbor an excess of long repeats, which are prone to developing into repeat expansion disorders.

View Article and Find Full Text PDF

The natural compound orotic acid and its anionic form, orotate, play a pivotal role in various biological processes, serving as essential intermediates in pyrimidine de novo synthesis, with demonstrated connections to dietary, supplement, and neurodrug applications. A novel perspective on biomolecular aggregation at the nanoscale, particularly pertinent to neurodegeneration, challenges the established paradigm positing that peptide (amyloid beta) and protein (tau) aggregation mainly govern the molecular events underlying prevalent neuropathologies. Emerging biological evidence indicates a notable role for G-quadruplex (G4) DNA aggregation in neurodegenerative processes affecting neuronal cells, particularly in the presence of extended (GC) repeats in nuclear DNA sequences.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!