Repetitive sequences are biologically and clinically important because they can influence traits and disease, but repeats are challenging to analyse using short-read sequencing technology. We present a tool for genotyping microsatellite repeats called RepeatSeq, which uses Bayesian model selection guided by an empirically derived error model that incorporates sequence and read properties. Next, we apply RepeatSeq to high-coverage genomes from the 1000 Genomes Project to evaluate performance and accuracy. The software uses common formats, such as VCF, for compatibility with existing genome analysis pipelines. Source code and binaries are available at http://github.com/adaptivegenome/repeatseq.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592458PMC
http://dx.doi.org/10.1093/nar/gks981DOI Listing

Publication Analysis

Top Keywords

accurate human
4
human microsatellite
4
microsatellite genotypes
4
genotypes high-throughput
4
high-throughput resequencing
4
resequencing data
4
data informed
4
informed error
4
error profiles
4
profiles repetitive
4

Similar Publications

Accurate diagnosis of oral lesions, early indicators of oral cancer, is a complex clinical challenge. Recent advances in deep learning have demonstrated potential in supporting clinical decisions. This paper introduces a deep learning model for classifying oral lesions, focusing on accuracy, interpretability, and reducing dataset bias.

View Article and Find Full Text PDF

Nursing activity recognition has immense importance in the development of smart healthcare management and is an extremely challenging area of research in human activity recognition. The main reasons are an extreme class-imbalance problem and intra-class variability depending on both the subject and the recipient. In this paper, we apply a unique two-step feature extraction, coupled with an intermediate feature 'Angle' and a new feature called mean min max sum to render the features robust against intra-class variation.

View Article and Find Full Text PDF

Warfarin is the most widely used oral anticoagulant in clinical practice. The cytochrome P450 2C9 (CYP2C9), vitamin K epoxide reductase complex 1 (VKORC1), and cytochrome P450 4F2 (CYP4F2) genotypes are associated with warfarin dose requirements in China. Accurate genotyping is vital for obtaining reliable genotype-guided warfarin dosing information.

View Article and Find Full Text PDF

Early prediction of patient responses to neoadjuvant chemotherapy (NACT) is essential for the precision treatment of early breast cancer (EBC). Therefore, this study aims to noninvasively and early predict pathological complete response (pCR). We used dynamic ultrasound (US) imaging changes acquired during NACT, along with clinicopathological features, to create a nomogram and construct a machine learning model.

View Article and Find Full Text PDF

Effect of heart rate on B-type natriuretic peptide in sinus rhythm.

Sci Rep

December 2024

Division of Cardiology, Department of Internal Medicine, The Jikei University School of Medicine, 3-25-8 Nishi-shimbashi, Minato-ku, Tokyo, 105-8461, Japan.

B-type natriuretic peptide (BNP) levels accurately reflect the degree of cardiac overload in heart failure. Considering cardiac morphology and intracardiac pressure, including the left ventricular end-systolic volume index (LVESVI) and left ventricular end-diastolic volume index (LVEDVI), is essential for cardiac overload assessment. These indexes influence plasma BNP levels, and high heart rate is likely associated with cardiac morphology.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!