Background: A major goal of computational studies of gene regulation is to accurately predict the expression of genes based on the cis-regulatory content of their promoters. The development of computational methods to decode the interactions among cis-regulatory elements has been slow, in part, because it is difficult to know, without extensive experimental validation, whether a particular method identifies the correct cis-regulatory interactions that underlie a given set of expression data. There is an urgent need for test expression data in which the interactions among cis-regulatory sites that produce the data are known. The ability to rapidly generate such data sets would facilitate the development and comparison of computational methods that predict gene expression patterns from promoter sequence.
Results: We developed a gene expression simulator which generates expression data using user-defined interactions between cis-regulatory sites. The simulator can incorporate additive, cooperative, competitive, and synergistic interactions between regulatory elements. Constraints on the spacing, distance, and orientation of regulatory elements and their interactions may also be defined and Gaussian noise can be added to the expression values. The simulator allows for a data transformation that simulates the sigmoid shape of expression levels from real promoters. We found good agreement between sets of simulated promoters and predicted regulatory modules from real expression data. We present several data sets that may be useful for testing new methodologies for predicting gene expression from promoter sequence.
Conclusion: We developed a flexible gene expression simulator that rapidly generates large numbers of simulated promoters and their corresponding transcriptional output based on specified interactions between cis-regulatory sites. When appropriate rule sets are used, the data generated by our simulator faithfully reproduces experimentally derived data sets. We anticipate that using simulated gene expression data sets will facilitate the direct comparison of computational strategies to predict gene expression from promoter sequence. The source code is available online and as additional material. The test sets are available as additional material.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2375358 | PMC |
http://dx.doi.org/10.1186/1471-2105-8-272 | DOI Listing |
Theor Appl Genet
January 2025
Research Center for Life Sciences Computing, Zhejiang Lab, Hangzhou, 310012, China.
In the present study, we identified 22 significant SNPs, eight stable QTLs and 17 potential candidate genes associated with 100-seed weight in soybean. Soybean is an economically important crop that is rich in seed oil and protein. The 100-seed weight (HSW) is a crucial yield contributing trait.
View Article and Find Full Text PDFMol Biol Rep
January 2025
Mahatma Gandhi Medical Advanced Research Institute (MGMARI), Sri Balaji Vidyapeeth (Deemed to be University), Puducherry, 607402, India.
MicroRNAs (miRNAs) and transfer RNA-derived stress-induced RNAs (tiRNAs) have emerged as crucial players in the post-transcriptional regulation of gene expression in various cellular processes, including immunity and host defense against infections. In recent years, increasing evidence has highlighted their complex role in influencing the host response during viral and bacterial infections. miRNAs have been shown to play multiple roles in host-pathogen interaction like TLR activation and altered disease virulence during bacterial infections.
View Article and Find Full Text PDFPlanta
January 2025
Institute of Plant Genetics and Biotechnology, Plant Science and Biodiversity Center, Slovak Academy of Sciences, Akademicka 2, P. O. Box 39A, 950 07, Nitra, Slovak Republic.
DbChitI-3, Drosera binata's acidic chitinase, peaks at pH 2.5 from 15 °C to 30 °C. Gene expression is stimulated by polysaccharides and suppressed by monosaccharide digestion, implying a feedback loop in its transcriptional regulation.
View Article and Find Full Text PDFCell Mol Biol (Noisy-le-grand)
January 2025
College of Life Sciences, Liaoning Normal University, Dalian 116000, Liaoning Province, China.
Liaoning cashmere goat is an outstanding breed in China primarily for cashmere production, with strict controls against genetic outflow. Melatonin(MT) is a key factor affecting cashmere growth, and preliminary transcriptome sequencing indicated that melatonin upregulates the expression of the PIP5K1A gene in skin fibroblasts. To predict the physicochemical properties of PIP5K1A in Liaoning cashmere goats, ascertain the tissue localization of PIP5K1A in their skin, and explore the role and mechanism of PIP5K1A in the proliferation of skin fibroblasts.
View Article and Find Full Text PDFCell Mol Biol (Noisy-le-grand)
January 2025
Department of Integrative Medicine, Huashan Hospital, Fudan University, Shanghai, China.
Mitochondrial ribosomal protein S23 (MRPS23), encoded by a nuclear gene, is a well-known driver of proliferation in cancer. It participates in mitochondrial protein translation, and its expression association has been explored in many types of cancer. However, MRPS23 expression associations are rarely reported in breast cancer (BC).
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!