Publications by David Siegmund | LitMetric

Publications by authors named "David Siegmund"

Page 1 of 1

Ghost QTL and hotspots in experimental crosses: novel approach for modeling polygenic effects.

Jonas Wallin Małgorzata Bogdan Piotr A Szulc R W Doerge David O Siegmund

Genetics

March 2021

Ghost quantitative trait loci (QTL) are the false discoveries in QTL mapping, that arise due to the "accumulation" of the polygenic effects, uniformly distributed over the genome. The locations on the chromosome that are strongly correlated with the total of the polygenic effects depend on a specific sample correlation structure determined by the genotypes at all loci. The problem is particularly severe when the same genotypes are used to study multiple QTL, e.

View Article and Find Full Text PDF

A genome-wide approach for detecting novel insertion-deletion variants of mid-range size.

Li C Xia Sukolsak Sakshuwong Erik S Hopmans John M Bell Susan M Grimes David O Siegmund

Nucleic Acids Res

September 2016

We present SWAN, a statistical framework for robust detection of genomic structural variants in next-generation sequencing data and an analysis of mid-range size insertion and deletions (<10 Kb) for whole genome analysis and DNA mixtures. To identify these mid-range size events, SWAN collectively uses information from read-pair, read-depth and one end mapped reads through statistical likelihoods based on Poisson field models. SWAN also uses soft-clip/split read remapping to supplement the likelihood analysis and determine variant boundaries.

View Article and Find Full Text PDF

Joint testing of genotype and ancestry association in admixed families.

Hua Tang David O Siegmund Nicholas A Johnson Isabelle Romieu Stephanie J London

Genet Epidemiol

December 2010

Current genome-wide association studies (GWAS) often involve populations that have experienced recent genetic admixture. Genotype data generated from these studies can be used to test for association directly, as in a non-admixed population. As an alternative, these data can be used to infer chromosomal ancestry, and thus allow for admixture mapping.

View Article and Find Full Text PDF

Detecting simultaneous changepoints in multiple sequences.

Nancy R Zhang David O Siegmund Hanlee Ji Jun Z Li

Biometrika

September 2010

We discuss the detection of local signals that occur at the same location in multiple one-dimensional noisy sequences, with particular attention to relatively weak signals that may occur in only a fraction of the sequences. We propose simple scan and segmentation algorithms based on the sum of the chi-squared statistics for each individual sample, which is equivalent to the generalized likelihood ratio for a model where the errors in each sample are independent. The simple geometry of the statistic allows us to derive accurate analytic approximations to the significance level of such scans.

View Article and Find Full Text PDF

Mapping quantitative traits in unselected families: algorithms and examples.

Josée Dupuis Jianxin Shi Alisa K Manning Emelia J Benjamin James B Meigs David Siegmund

Genet Epidemiol

November 2009

Linkage analysis has been widely used to identify from family data genetic variants influencing quantitative traits. Common approaches have both strengths and limitations. Likelihood ratio tests typically computed in variance component analysis can accommodate large families but are highly sensitive to departure from normality assumptions.

View Article and Find Full Text PDF

Statistical corrections of linkage data suggest predominantly cis regulations of gene expression.

Jianxin Shi David O Siegmund Douglas F Levinson

BMC Proc

December 2009

Morley et al. (Nature 2004, 430:743-747) detected significant linkages to the expression levels of 142 genes (of 3554) at a reported threshold of genome-wide p = 0.001 (LOD asymptotically equal to 5.

View Article and Find Full Text PDF

A unified framework for linkage and association analysis of quantitative traits.

Josée Dupuis David O Siegmund Benjamin Yakir

Proc Natl Acad Sci U S A

December 2007

We give a unified treatment of the statistical foundations of population based association mapping and of family based linkage mapping of quantitative traits in humans. A central ingredient in the unification involves the efficient score statistic. The discussion focuses on generalized linear models with an additional illustration of the Cox (proportional hazards) model for age of onset data.

View Article and Find Full Text PDF

Approximating the variance of the conditional probability of the state of a hidden Markov model.

David O Siegmund Benjamin Yakir

Stat Appl Genet Mol Biol

July 2009

In a hidden Markov model, one "estimates" the state of the hidden Markov chain at t by computing via the forwards-backwards algorithm the conditional distribution of the state vector given the observed data. The covariance matrix of this conditional distribution measures the information lost by failure to observe directly the state of the hidden process. In the case where changes of state occur slowly relative to the speed at which information about the underlying state accumulates in the observed data, we compute approximately these covariances in terms of functionals of Brownian motion that arise in change-point analysis.

View Article and Find Full Text PDF

A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data.

Nancy R Zhang David O Siegmund

Biometrics

March 2007

In the analysis of data generated by change-point processes, one critical challenge is to determine the number of change-points. The classic Bayes information criterion (BIC) statistic does not work well here because of irregularities in the likelihood function. By asymptotic approximation of the Bayes factor, we derive a modified BIC for the model of Brownian motion with changing drift.

View Article and Find Full Text PDF

Spatial regulation and the rate of signal transduction activation.

Nizar N Batada Larry A Shepp David O Siegmund Michael Levitt

PLoS Comput Biol

May 2006

Of the many important signaling events that take place on the surface of a mammalian cell, activation of signal transduction pathways via interactions of cell surface receptors is one of the most important. Evidence suggests that cell surface proteins are not as freely diffusible as implied by the classic fluid mosaic model and that their confinement to membrane domains is regulated. It is unknown whether these dynamic localization mechanisms function to enhance signal transduction activation rate or to minimize cross talk among pathways that share common intermediates.

View Article and Find Full Text PDF

Genome scans with gene-covariate interaction.

Jie Peng Hsiu-Khuern Tang David Siegmund

Genet Epidemiol

November 2005

Genetic models for gene-covariate interactions are described. Methods of linkage analysis that utilize special features of these models and the corresponding score statistics are derived. Their power is compared with that of simple genome scans that ignore these special features, and substantial gains in power are observed when the gene-covariate interaction is strong.

View Article and Find Full Text PDF

On the power for linkage detection using a test based on scan statistics.

Sonia Hernández David O Siegmund Mathisca de Gunst

Biostatistics

April 2005

We analyze some aspects of scan statistics, which have been proposed to help for the detection of weak signals in genetic linkage analysis. We derive approximate expressions for the power of a test based on moving averages of the identity by descent allele sharing proportions for pairs of relatives at several contiguous markers. We confirm these approximate formulae by simulation.

View Article and Find Full Text PDF

Stochastic model of protein-protein interaction: why signaling proteins need to be colocalized.

Nizar N Batada Larry A Shepp David O Siegmund

Proc Natl Acad Sci U S A

April 2004

Colocalization of proteins that are part of the same signal transduction pathway via compartmentalization, scaffold, or anchor proteins is an essential aspect of the signal transduction system in eukaryotic cells. If interaction must occur via free diffusion, then the spatial separation between the sources of the two interacting proteins and their degradation rates become primary determinants of the time required for interaction. To understand the role of such colocalization, we create a mathematical model of the diffusion based protein-protein interaction process.

View Article and Find Full Text PDF

Gene expression patterns and gene copy number changes in dermatofibrosarcoma protuberans.

Sabine C Linn Rob B West Jonathan R Pollack Shirley Zhu Tina Hernandez-Boussard David Siegmund

Am J Pathol

December 2003

Dermatofibrosarcoma protuberans (DFSP) is an aggressive spindle cell neoplasm. It is associated with the chromosomal translocation, t(17:22), which fuses the COL1A1 and PDGFbeta genes. We determined the characteristic gene expression profile of DFSP and characterized DNA copy number changes in DFSP by array-based comparative genomic hybridization (array CGH).

View Article and Find Full Text PDF

Frequentist estimation of coalescence times from nucleotide sequence data using a tree-based partition.

Hua Tang David O Siegmund Peidong Shen Peter J Oefner Marcus W Feldman

Genetics

May 2002

This article proposes a method of estimating the time to the most recent common ancestor (TMRCA) of a sample of DNA sequences. The method is based on the molecular clock hypothesis, but avoids assumptions about population structure. Simulations show that in a wide range of situations, the point estimate has small bias and the confidence interval has at least the nominal coverage probability.

View Article and Find Full Text PDF

Mapping multiple genes for quantitative or complex traits.

Hsiu-Khuern Tang David Siegmund

Genet Epidemiol

April 2002

Models for complex and quantitative traits that involve multiple, possibly interacting, genes are described. Methods of linkage analysis are developed that utilize special features of these models, and their power is compared with that of simple genome scans that ignore these special features. Our calculations show that for family-based nonparametric linkage analysis in human genetics, in contrast to experimental genetics, there are limits to the increase in power that can be achieved by correctly modeling gene-gene interactions.

View Article and Find Full Text PDF