Genetic studies of Plasmodium parasites increasingly feature relatedness estimates. However, various aspects of malaria parasite relatedness estimation are not fully understood. For example, relatedness estimates based on whole-genome-sequence (WGS) data often exceed those based on sparser data types. Systematic bias in relatedness estimation is well documented in the literature geared towards diploid organisms, but largely unknown within the malaria community. We characterise systematic bias in malaria parasite relatedness estimation using three complementary approaches: theoretically, under a non-ancestral statistical model of pairwise relatedness; numerically, under a simulation model of ancestry; and empirically, using data on parasites sampled from Guyana and Colombia. We show that allele frequency estimates encode, locus-by-locus, relatedness averaged over the set of sampled parasites used to compute them. Plugging sample allele frequencies into models of pairwise relatedness can lead to systematic underestimation. However, systematic underestimation can be viewed as population-relatedness calibration, i.e., a way of generating measures of relative relatedness. Systematic underestimation is unavoidable when relatedness is estimated assuming independence between genetic markers. It is mitigated when relatedness is estimated using WGS data under a hidden Markov model (HMM) that exploits linkage between proximal markers. The extent of mitigation is unknowable when a HMM is fit to sparser data, but downstream analyses that use high relatedness thresholds are relatively robust regardless. In summary, practitioners can either resolve to use relative relatedness estimated under independence, or try to estimate absolute relatedness under a HMM. We propose various tools to help practitioners evaluate their situation on a case-by-case basis.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/g3journal/jkaf018 | DOI Listing |
G3 (Bethesda)
January 2025
Infectious Disease Epidemiology and Analytics G5 Unit, Institut Pasteur, Université Paris Cité, Paris 75015, France.
Genetic studies of Plasmodium parasites increasingly feature relatedness estimates. However, various aspects of malaria parasite relatedness estimation are not fully understood. For example, relatedness estimates based on whole-genome-sequence (WGS) data often exceed those based on sparser data types.
View Article and Find Full Text PDFmSystems
January 2025
Malopolska Centre of Biotechnology, Jagiellonian University, Kraków, Poland.
Average nucleotide identity (ANI) is a widely used metric to estimate genetic relatedness, especially in microbial species delineation. While ANI calculation has been well optimized for bacteria and closely related viral genomes, accurate estimation of ANI below 80%, particularly in large reference data sets, has been challenging due to a lack of accurate and scalable methods. To bridge this gap, we introduce MANIAC, an efficient computational pipeline optimized for estimating ANI and alignment fraction (AF) in viral genomes with divergence around ANI of 70%.
View Article and Find Full Text PDFbioRxiv
January 2025
Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA.
Elucidating ancestry-specific structures in admixed populations is crucial for comprehending population history and mitigating confounding effects in genome-wide association studies. Existing methods for elucidating the ancestry-specific structures generally rely on frequency-based estimates of genetic relationship matrix (GRM) among admixed individuals after masking segments from ancestry components not being targeted for investigation. However, these approaches disregard linkage information between markers, potentially limiting their resolution in revealing structure within an ancestry component.
View Article and Find Full Text PDFParasitol Int
January 2025
Parasitology Department, Faculty of Veterinary Medicine, Mansoura University, Mansoura 35516, Egypt. Electronic address:
Background/objective: Theileria orientalis is a non-transforming Theileria species infecting cattle and water buffaloes. Several outbreaks of oriental theileriosis accompanied by considerable economic loss were documented in Asia, Australia, and New Zealand. The major piroplasm surface protein (MPSP) gene has frequently been used to molecularly characterize T.
View Article and Find Full Text PDFAssessments of genetic diversity, structure, history, and effective population size ( ) are critical for the conservation of imperiled populations. The lesser prairie-chicken () has experienced declines due to habitat loss, degradation, and fragmentation in addition to substantial population fluctuations with unknown effects on genetic diversity. Our objectives were to: (i) compare genetic diversity across three temporally discrete sampling periods (2002, 2007-2010, and 2013-2014) that are characterized by low or high population abundance; (ii) examine genetic diversity at lek and lek cluster spatial scales; (ii) identify potential bottlenecks and characterize genetic structure and relatedness; and (iii) estimate the regional .
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!