The human genome is diploid, which requires assigning heterozygous single nucleotide polymorphisms (SNPs) to the two copies of the genome. The resulting haplotypes, lists of SNPs belonging to each copy, are crucial for downstream analyses in population genetics. Currently, statistical approaches, which are oblivious to direct read information, constitute the state-of-the-art. Haplotype assembly, which addresses phasing directly from sequencing reads, suffers from the fact that sequencing reads of the current generation are too short to serve the purposes of genome-wide phasing. While future-technology sequencing reads will contain sufficient amounts of SNPs per read for phasing, they are also likely to suffer from higher sequencing error rates. Currently, no haplotype assembly approaches exist that allow for taking both increasing read length and sequencing error information into account. Here, we suggest WhatsHap, the first approach that yields provably optimal solutions to the weighted minimum error correction problem in runtime linear in the number of SNPs. WhatsHap is a fixed parameter tractable (FPT) approach with coverage as the parameter. We demonstrate that WhatsHap can handle datasets of coverage up to 20×, and that 15× are generally enough for reliably phasing long reads, even at significantly elevated sequencing error rates. We also find that the switch and flip error rates of the haplotypes we output are favorable when comparing them with state-of-the-art statistical phasers.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1089/cmb.2014.0157 | DOI Listing |
Neuropsychol Dev Cogn B Aging Neuropsychol Cogn
January 2025
Department of Psychology, University of Pittsburgh, Pittsburgh, PA, USA.
Greater neighborhood disadvantage is associated with poorer global cognition. However, less is known about the variation in the magnitude of neighborhood effects across individual cognitive domains and whether the strength of these associations differs by individual-level factors. The current study investigated these questions in a community sample of older adults ( = 166, mean age = 72.
View Article and Find Full Text PDFPsychon Bull Rev
January 2025
Centre de Recherche en Psychologie et Neuroscience, CNRS & Aix-Marseille University, Marseille, France.
A recent study (Wen et al., Journal of Experimental Psychology: Human Perception and Performance, 50: 934-941, 2024) found no influence of relative word-length on transposed-word effects. However, following the tradition of prior research on effects of transposed words during sentence reading, the transposed words in that study were adjacent words (words at positions 2 and 3 or 3 and 4 in five-word sequences).
View Article and Find Full Text PDFNat Commun
January 2025
Division of Cancer Epigenomics, German Cancer Research Center (DKFZ), Heidelberg, Germany.
DNA methylation (DNAm) is a key epigenetic mark that shows profound alterations in cancer. Read-level methylomes enable more in-depth analyses, due to their broad genomic coverage and preservation of rare cell-type signals, compared to summarized data such as 450K/EPIC microarrays. Here, we propose MethylBERT, a Transformer-based model for read-level methylation pattern classification.
View Article and Find Full Text PDFMol Biol Evol
January 2025
Institute of Computer Science, Foundation for Research and Technology-Hellas (FORTH).
A common problem when analyzing ancient DNA (aDNA) data is to identify the species which corresponds to the recovered aDNA sequence(s). The standard approach is to deploy sequence similarity based tools, such as BLAST. However, as aDNA reads may frequently stem from unsampled taxa due to extinction, it is likely that there is no exact match in any database.
View Article and Find Full Text PDFTransl Pediatr
December 2024
Central Laboratory, Jiangxi Provincial Children's Hospital, The Affiliated Children's Hospital of Nanchang Medical College, Nanchang, China.
Background: Oral microbiome homeostasis is important for children's health, and microbial community is affected by anesthetics. The application of anesthetics in children's oral therapy has become a relatively mature method. This study aims to investigate the effect of different anesthesia techniques on children's oral microbiota.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!