Fast sequence analysis based on diamond sampling.

PLoS One

Institute of Machine Learning and Systems Biology, School of Electronics and Information Engineering, Tongji University, Shanghai, China.

Published: December 2018

Both in DNA and protein contexts, an important method for modelling motifs is to utilize position weight matrix (PWM) in biological sequences. With the development of genome sequencing technology, the quantity of the sequence data is increasing explosively, so the faster searching algorithms which have the ability to meet the increasingly need are desired to develop. In this paper, we proposed a method for speeding up the searching process of candidate transcription factor binding sites (TFBS), and the users can be allowed to specify p threshold to get the desired trade-off between speed and sensitivity for a particular sequence analysis. Moreover, the proposed method can also be generalized to large-scale annotation and sequence projects.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6023231PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0198922PLOS

Publication Analysis

Top Keywords

sequence analysis
8
proposed method
8
fast sequence
4
analysis based
4
based diamond
4
diamond sampling
4
sampling dna
4
dna protein
4
protein contexts
4
contexts method
4

Similar Publications

Introduction: The methicillin-resistant Staphylococcus aureus (MRSA) genome varies by geographical location. This study aims to determine the genomic characteristics of MRSA using whole-genome sequencing (WGS) data from medical centers in Mexico and to explore the associations between antimicrobial resistance genes and virulence factors.

Methods: This study included 27 clinical isolates collected from sterile sites at eight centers in Mexico in 2022 and 2023.

View Article and Find Full Text PDF

The Tapetum Determinant 1 (TPD1) family proteins are known to play a crucial role in the regulation of reproduction in plants, including Cenchrus americanus (pearl millet). However, members of TPD1 family proteins have not been fully identified. The current study aims to identify and characterize the TPD1 family proteins in Cenchrus americanus (L.

View Article and Find Full Text PDF

An obligately anaerobic, spore-forming sulphate-reducing bacterium, strain SB140, was isolated from a long-term continuous enrichment culture that was inoculated with peat soil from an acidic fen. Cells were immotile, slightly curved rods that stained Gram-negative. The optimum temperature for growth was 28 °C.

View Article and Find Full Text PDF

sp. nov., isolated from tree bark ( Chev.) and its antioxidant activity.

Int J Syst Evol Microbiol

January 2025

Department of Biochemistry and Microbiology, Faculty of Pharmaceutical Sciences, Chulalongkorn University, Bangkok 10330, Thailand.

A Gram-stain-positive, facultatively anaerobic, rod-shaped strain, designated SPB1-3, was isolated from tree bark. This strain exhibited heterofermentative production of dl-lactic acid from glucose. Optimal growth was observed at 25-40 °C, pH 4.

View Article and Find Full Text PDF

5-Methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC) are crucial epigenetic modifications in eukaryotic genomic DNA that regulate gene expression and are associated with the occurrence of various cancers. Here, we combined bisulfite conversion with 4-acetamido-2,2,6,6-tetramethyl-1-oxopiperridinium tetrafluoroborate (ACTBF, TCI) oxidation to develop a label-free and sequence-independent isothermal amplification (BTIA) assay for a genome-wide 5mC and 5hmC analysis. The BTIA strategy can distinguish 5mC and 5hmC signatures from other bases with high sensitivity and good specificity, avoiding sophisticated chemical modifications and expensive protein labeling.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!