There are two regulatory single nucleotide polymorphisms (rSNPs) at the beginning of the second intron of the mouse K-ras gene that are strongly associated with lung cancer susceptibility. We performed functional analysis of three SNPs (rs12228277: T greater than A, rs12226937: G greater than A, and rs61761074: T greater than G) located in the same region of human KRAS. We found that rs12228277 and rs61761074 result in differential binding patterns of lung nuclear proteins to oligonucleotide probes corresponding two alternative alleles; in both cases, the transcription factor NF-Y is involved.
View Article and Find Full Text PDFThis paper presents implementation of Data Mining and Knowledge Discovery techniques for searching for regularities in tables of context features of DNA sequences involved in regulation of transcription. The goal is to discover regularities that relate nucleotide sequences to the functional classes of these sequences. The search patterns for regularities have been constructed in the first-order logic augmented by probabilistic estimates.
View Article and Find Full Text PDFA method has been developed for constructing a tree source model for genetic text generation. Model visualisation in the form of suffix (context) trees provides a new way of context analysis of symbol sequences. Estimation of the stochastic complexity of the data in the frame of the model serves as a criterion for the model's ascertainment.
View Article and Find Full Text PDF