Ascertainment bias in the genomic test of positive selection on regulatory sequences.

bioRxiv

Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA.

Published: August 2023

Evolution of gene expression mediated by -regulatory changes is thought to be an important contributor to organismal adaptation, but identifying adaptive -regulatory changes is challenging due to the difficulty in knowing the expectation under no positive selection. A new approach for detecting positive selection on transcription factor binding sites (TFBSs) was recently developed, thanks to the application of machine learning in predicting transcription factor (TF) binding affinities of DNA sequences. Given a TFBS sequence from a focal species and the corresponding inferred ancestral sequence that differs from the former at sites, one can predict the TF binding affinities of many -step mutational neighbors of the ancestral sequence and obtain a null distribution of the derived binding affinity, which allows testing whether the binding affinity of the real derived sequence deviates significantly from the null distribution. Applying this test genomically to all experimentally identified binding sites of three TFs in humans, a recent study reported positive selection for elevated binding affinities of TFBSs. Here we show that this genomic test suffers from an ascertainment bias because, even in the absence of positive selection for strengthened binding, the binding affinities of known human TFBSs are more likely to have increased than decreased in evolution. We demonstrate by computer simulation that this bias inflates the false positive rate of the selection test. We propose several methods to mitigate the ascertainment bias and show that almost all previously reported positive selection signals disappear when these methods are applied.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10473660	PMC
http://dx.doi.org/10.1101/2023.08.20.554030	DOI Listing

Publication Analysis

Top Keywords

positive selection

binding affinities

ascertainment bias

binding

genomic test

-regulatory changes

transcription factor

factor binding

binding sites

ancestral sequence

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!