AI Article Synopsis

  • IRF8 and NFATc1 are transcription factors crucial for osteoclast differentiation, and researchers face challenges in identifying target genes that are consistently regulated across various studies due to the complexity of high-throughput data.* -
  • To tackle this issue, a new machine learning method called Cohort-based TF target prediction system (cTAP) was developed, which uses unique concepts of Gene-Present Sufficiently and Gene-Absent Insufficiently alongside log2 fold changes to predict gene targets more effectively.* -
  • The application of cTAP on multiple data sets revealed a limited number of target genes for IRF8 and NFATc1 that are involved in osteoclast differentiation, and these predictions were more

Article Abstract

Background: Interferon regulatory factor-8 (IRF8) and nuclear factor-activated T cells c1 (NFATc1) are two transcription factors that have an important role in osteoclast differentiation. Thanks to ChIP-seq technology, scientists can now estimate potential genome-wide target genes of IRF8 and NFATc1. However, finding target genes that are consistently up-regulated or down-regulated across different studies is hard because it requires analysis of a large number of high-throughput expression studies from a comparable context.

Method: We have developed a machine learning based method, called, Cohort-based TF target prediction system (cTAP) to overcome this problem. This method assumes that the pathway involving the transcription factors of interest is featured with multiple "functional groups" of marker genes pertaining to the concerned biological process. It uses two notions, Gene-Present Sufficiently (GP) and Gene-Absent Insufficiently (GA), in addition to log2 fold changes of differentially expressed genes for the prediction. Target prediction is made by applying multiple machine-learning models, which learn the patterns of GP and GA from log2 fold changes and four types of Z scores from the normalized cohort's gene expression data. The learned patterns are then associated with the putative transcription factor targets to identify genes that consistently exhibit Up/Down gene regulation patterns within the cohort. We applied this method to 11 publicly available GEO data sets related to osteoclastgenesis.

Result: Our experiment identified a small number of Up/Down IRF8 and NFATc1 target genes as relevant to osteoclast differentiation. The machine learning models using GP and GA produced NFATc1 and IRF8 target genes different than simply using a log2 fold change alone. Our literature survey revealed that all predicted target genes have known roles in bone remodeling, specifically related to the immune system and osteoclast formation and functions, suggesting confidence and validity in our method.

Conclusion: cTAP was motivated by recognizing that biologists tend to use Z score values present in data sets for the analysis. However, using cTAP effectively presupposes assembling a sizable cohort of gene expression data sets within a comparable context. As public gene expression data repositories grow, the need to use cohort-based analysis method like cTAP will become increasingly important.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8740472PMC
http://dx.doi.org/10.1186/s12864-021-08159-zDOI Listing

Publication Analysis

Top Keywords

target genes
20
irf8 nfatc1
12
osteoclast differentiation
12
machine learning
12
log2 fold
12
gene expression
12
expression data
12
data sets
12
differentiation machine
8
transcription factors
8

Similar Publications

Background: CRISPR is widely used to silence genes by inducing mutations expected to nullify their expression. While numerous computational tools have been developed to design single-guide RNAs (sgRNAs) with high cutting efficiency and minimal off-target effects, only a few tools focus specifically on predicting gene knockouts following CRISPR. These tools consider factors like conservation, amino acid composition, and frameshift likelihood.

View Article and Find Full Text PDF

Attenuated sex-related DNA methylation differences in cancer highlight the magnitude bias mediating existing disparities.

Biol Sex Differ

December 2024

State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China.

Background: DNA methylation (DNAm) influences both sex differences and cancer development, yet the mechanisms connecting these factors remain unclear.

Methods: Utilizing data from The Cancer Genome Atlas, we conducted a comprehensive analysis of sex-related DNAm effects in nine non-reproductive cancers, compared to paired normal adjacent tissues (NATs), and validated the results using independent datasets. First, we assessed the extent of sex differential DNAm between cancers and NATs to explore how sex-related DNAm differences change in cancerous tissues.

View Article and Find Full Text PDF

Systematic druggable genome-wide Mendelian randomization identifies therapeutic targets for hyperemesis gravidarum.

BMC Pregnancy Childbirth

December 2024

Cancer Research Institute, Yunnan Cancer Hospital, The Third Affiliated Hospital of Kunming Medical University, Peking University Cancer Hospital Yunnan, Kunming, 650106, China.

Background: Hyperemesis gravidarum (HG), excessive vomiting in pregnancy, occurs in 0.3-10.8% of pregnancies and is associated with maternal and fetal morbidity.

View Article and Find Full Text PDF

Background: Embryogenic callus (EC) has strong regenerative potential, useful for propagation and genetic transformation. miRNAs have been confirmed to play key regulatory roles in EC regeneration across various plants. However, challenges in EC induction have hindered the breeding of drumstick (Moringa oleifera Lam.

View Article and Find Full Text PDF

Background: As an important prokaryotic model organism, Bacillus subtilis has been widely used in the industrial production of a variety of target products. The efficient secretion of target products has always been the main purpose of industrial microbial technology. The modification of gene regulatory networks is an important technical means to construct a factory of microbial cells that efficiently secretes target products.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!

A PHP Error was encountered

Severity: Notice

Message: fwrite(): Write of 34 bytes failed with errno=28 No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 272

Backtrace:

A PHP Error was encountered

Severity: Warning

Message: session_write_close(): Failed to write session data using user defined save handler. (session.save_path: /var/lib/php/sessions)

Filename: Unknown

Line Number: 0

Backtrace: