Type 1 diabetes (T1D) outcome prediction plays a vital role in identifying novel risk factors, ensuring early patient care and designing cohort studies. TEDDY is a longitudinal cohort study that collects a vast amount of multi-omics and clinical data from its participants to explore the progression and markers of T1D. However, missing data in the omics profiles make the outcome prediction a difficult task. TEDDY collected time series gene expression for less than 6% of enrolled participants. Additionally, for the participants whose gene expressions are collected, 79% time steps are missing. This study introduces an advanced bioinformatics framework for gene expression imputation and islet autoimmunity (IA) prediction. The imputation model generates synthetic data for participants with partially or entirely missing gene expression. The prediction model integrates the synthetic gene expression with other risk factors to achieve better predictive performance. Comprehensive experiments on TEDDY datasets show that: (1) Our pipeline can effectively integrate synthetic gene expression with family history, HLA genotype and SNPs to better predict IA status at 2 years (sensitivity 0.622, AUC 0.715) compared with the individual datasets and state-of-the-art results in the literature (AUC 0.682). (2) The synthetic gene expression contains predictive signals as strong as the true gene expression, reducing reliance on expensive and long-term longitudinal data collection. (3) Time series gene expression is crucial to the proposed improvement and shows significantly better predictive ability than cross-sectional gene expression. (4) Our pipeline is robust to limited data availability. Availability: Code is available at https://github.com/compbiolabucf/TEDDY.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851333PMC
http://dx.doi.org/10.1093/bib/bbac537DOI Listing

Publication Analysis

Top Keywords

gene expression
40
synthetic gene
12
gene
11
expression
10
islet autoimmunity
8
autoimmunity prediction
8
outcome prediction
8
risk factors
8
data participants
8
time series
8

Similar Publications

Transgenic expression of a double-stranded RNA in plants can induce silencing of homologous mRNAs in fungal pathogens. Although such host-induced gene silencing is well documented, the molecular mechanisms by which RNAs can move from the cytoplasm of plant cells across the plasma membrane of both the host cell and fungal cell are poorly understood. Indirect evidence suggests that this RNA transfer may occur at a very early stage of the infection process, prior to breach of the host cell wall, suggesting that silencing RNAs might be secreted onto leaf surfaces.

View Article and Find Full Text PDF

Mitochondrial DNA (mtDNA) is highly polymorphic, and host mtDNA variation has been associated with altered cancer severity. To determine the basis of this mtDNA-cancer association, we analyzed conplastic mice with the C57BL/6J (B6) nucleus but two naturally occurring mtDNA lineages, and , where mitochondria generate more oxidative phosphorylation (OXPHOS)-derived reactive oxygen species (mROS). In a cardiac transplant model, Foxp3+ T regulatory (Treg) cells supported long-term allograft survival, whereas Treg cells failed to suppress host T effector (Teff) cells, leading to acute rejection.

View Article and Find Full Text PDF

is one of the three most frequently mutated genes in age-related clonal hematopoiesis (CH), alongside and (. CH can progress to myeloid malignancies including chronic monomyelocytic leukemia (CMML) and is also strongly associated with inflammatory cardiovascular disease and all-cause mortality in humans. DNMT3A and TET2 regulate DNA methylation and demethylation pathways, respectively, and loss-of-function mutations in these genes reduce DNA methylation in heterochromatin, allowing derepression of silenced elements in heterochromatin.

View Article and Find Full Text PDF

Many biological systems operate near the physical limits to their performance, suggesting that aspects of their behavior and underlying mechanisms could be derived from optimization principles. However, such principles have often been applied only in simplified models. Here, we explore a detailed mechanistic model of the gap gene network in the embryo, optimizing its 50+ parameters to maximize the information that gene expression levels provide about nuclear positions.

View Article and Find Full Text PDF

Transcriptional coupling of telomeric retrotransposons with the cell cycle.

Sci Adv

January 2025

Department of Biochemistry and Molecular Biology, Tulane University School of Medicine, Louisiana Cancer Research Center, 1700 Tulane Avenue, New Orleans, LA 70112, USA.

Unlike most species that use telomerase for telomere maintenance, many dipterans, including , rely on three telomere-specific retrotransposons (TRs)-, , and -to form tandem repeats at chromosome ends. Although TR transcription is crucial in their life cycle, its regulation remains poorly understood. This study identifies the Mediator complex, E2F1-Dp, and Scalloped/dTEAD as key regulators of TR transcription.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!