In animals, transcription factor binding sites are hard to recognize because of their extensive variation. We therefore characterized the general relationship between a specific protein-binding site and its DNA sequence and used this relationship to generate a predictive algorithm for searching other DNA sequences. The experimental process was defined by studying hepatocyte nuclear factor 1 (HNF1), which binds DNA as a dimer on two inverted-repeat 7-bp half sites separated by one base. The binding model was based on the equivalence of the two half sites, which was confirmed in examples where specific modified sites were compared. Binding competition analysis was used to determine the effects of substitution of all four bases at each position in the half site. From these data, a weighted half-site matrix was generated and the full site was evaluated as the sum of two half-site scores. This process accurately predicted even weak binding sites that were significantly different from the consensus sequence. The predictions also showed a direct correlation with measured protein binding.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC137408PMC
http://dx.doi.org/10.1093/nar/gkf484DOI Listing

Publication Analysis

Top Keywords

transcription factor
8
factor binding
8
hepatocyte nuclear
8
nuclear factor
8
binding sites
8
half sites
8
binding
6
sites
5
definition prediction
4
prediction full
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!