rBahadur: efficient simulation of structured high-dimensional genotype data with applications to assortative mating.

BMC Bioinformatics

Applied Mathematics and Computational Research Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA.

Published: August 2023

Existing methods for generating synthetic genotype data are ill-suited for replicating the effects of assortative mating (AM). We propose rb_dplr, a novel and computationally efficient algorithm for generating high-dimensional binary random variates that effectively recapitulates AM-induced genetic architectures using the Bahadur order-2 approximation of the multivariate Bernoulli distribution. The rBahadur R library is available through the Comprehensive R Archive Network at https://CRAN.R-project.org/package=rBahadur .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10439545PMC
http://dx.doi.org/10.1186/s12859-023-05442-6DOI Listing

Publication Analysis

Top Keywords

genotype data
8
assortative mating
8
rbahadur efficient
4
efficient simulation
4
simulation structured
4
structured high-dimensional
4
high-dimensional genotype
4
data applications
4
applications assortative
4
mating existing
4

Similar Publications

Trophoblast glycoprotein (TPBG) plays a significant part in the growth of specific cancers, yet its connection to gastric cancer (GC) remains uncertain. This research seeks to analyse the fluctuation in TPBG levels in GC and evaluate how TPBG expression relates to the prognosis of GC patients. TPBG expression in GC and normal gastric tissues was investigated in The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GTEx) database, further extracting the immunohistochemistry images from HPA database and validating by Western blot.

View Article and Find Full Text PDF

Objective: We attempted to evaluate the immediate high-grade squamous intraepithelial lesion-cervical intraepithelial neoplasia grade 2/3 or worse (HSIL-CIN2+/3+, hereafter referred to as CIN2+/3+) risk of specific human papillomavirus (HPV) genotype and form the precise risk-based triage strategy for atypical squamous cells of undetermined significance (ASC-US) women.

Methods: The clinical data of ASC-US women who underwent HPV genotyping testing and colposcopy were retrospectively reviewed. The distribution and CIN2+/3+ risks of specific HPV genotype were assessed by three approaches.

View Article and Find Full Text PDF

Background: Convolutional neural networks have excellent modeling abilities to complex large-scale datasets and have been applied to genomics. It requires converting genotype data to image format when employing convolutional neural networks to genome-wide association studies. Existing studies converting the data into grayscale images have shown promising.

View Article and Find Full Text PDF

Background: Although ABO and RhD are the clinically significant blood group antigens that are routinely tested for, other blood group antigens may become important in multiply transfused patients due to risk of alloimmunization. Knowledge of antigen prevalence in a population is important in the context of alloimmunization and antigen matching. This study aims to do the same in a population of voluntary blood donors of a center in South India.

View Article and Find Full Text PDF

Introduction: There are scarce data on Indian blood donors with respect to blood group phenotypes using molecular diagnostic modalities. Hence, we planned to estimate frequencies of blood group alleles/phenotypes using DNA microarray analysis in the north Indian RhD-negative blood donor population. With this initial pilot study, we plan to expand it to our entire donor population.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!