Cross-validation (CV) is a technique to assess the generalizability of a model to unseen data. This technique relies on assumptions that may not be satisfied when studying genomics datasets. For example, random CV (RCV) assumes that a randomly selected set of samples, the test set, well represents unseen data. This assumption doesn't hold true where samples are obtained from different experimental conditions, and the goal is to learn regulatory relationships among the genes that generalize beyond the observed conditions. In this study, we investigated how the CV procedure affects the assessment of supervised learning methods used to learn gene regulatory networks (or in other applications). We compared the performance of a regression-based method for gene expression prediction estimated using RCV with that estimated using a clustering-based CV (CCV) procedure. Our analysis illustrates that RCV can produce over-optimistic estimates of the model's generalizability compared to CCV. Next, we defined the 'distinctness' of test set from training set and showed that this measure is predictive of performance of the regression method. Finally, we introduced a simulated annealing method to construct partitions with gradually increasing distinctness and showed that performance of different gene expression prediction methods can be better evaluated using this method.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5920056 | PMC |
http://dx.doi.org/10.1038/s41598-018-24937-4 | DOI Listing |
Sci Rep
January 2025
School of Sports and Health, Nanjing Sport Institute, Nanjing, China.
A high-calorie diet and lack of exercise are the most important risk factors contributing to metabolic dysfunction-associated steatotic liver disease (MASLD) initiation and progression. The precise molecular mechanisms of mitochondrial function alteration during MASLD development remain to be fully elucidated. In this study, a total of 60 male C57BL/6J mice were maintained on a normal or amylin liver NASH (AMLN) diet for 6 or 10 weeks.
View Article and Find Full Text PDFSci Rep
January 2025
MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, OX3 9DS, UK.
Bulk ATAC-seq assays have been used to map and profile the chromatin accessibility of regulatory elements such as enhancers, promoters, and insulators. This has provided great insight into the regulation of gene expression in many cell types in a variety of organisms. To date, ATAC-seq has most often been used to provide an average evaluation of chromatin accessibility in populations of cells.
View Article and Find Full Text PDFSci Rep
January 2025
Department of Pharmaceutics, College of Pharmacy, King Saud University, PO Box 2457, Riyadh, 11451, Saudi Arabia.
Prostate cancer presents a major health issue, with its progression influenced by intricate molecular factors. Notably, the interplay between miRNAs and changes in transcriptomic patterns is not fully understood. Our study seeks to bridge this knowledge gap, employing computational techniques to explore how miRNAs and transcriptomic alterations jointly regulate the development of prostate cancer.
View Article and Find Full Text PDFBiochem Genet
January 2025
Bashkir State Medical University, Lenina Str. 3, Ufa, 450008, Russian Federation.
Idiopathic pulmonary fibrosis (IPF) is a rapidly progressive interstitial lung disease of unknown pathogenesis with no effective treatment currently available. Given the regulatory roles of lncRNAs (TP53TG1, LINC00342, H19, MALAT1, DNM3OS, MEG3), miRNAs (miR-218-5p, miR-126-3p, miR-200a-3p, miR-18a-5p, miR-29a-3p), and their target protein-coding genes (PTEN, TGFB2, FOXO3, KEAP1) in the TGF-β/SMAD3, Wnt/β-catenin, focal adhesion, and PI3K/AKT signaling pathways, we investigated the expression levels of selected genes in peripheral blood mononuclear cells (PBMCs) and lung tissue from patients with IPF. Lung tissue and blood samples were collected from 33 newly diagnosed, treatment-naive patients and 70 healthy controls.
View Article and Find Full Text PDFAdv Exp Med Biol
January 2025
Department of Stem Cells & Regenerative Medicine, Centre for Interdisciplinary Research, D Y Patil Education Society (Deemed to be University), Kolhapur, India.
Bone tissue engineering is a promising field that aims to rebuild the bone tissue using biomaterials, cells, and signaling molecules. Materials like natural and synthetic polymers, inorganic materials, and composite materials are used to create scaffolds that mimic the hierarchical microstructure of bone. Stem cells, particularly mesenchymal stem cells (MSCs), play a crucial role in bone tissue engineering by promoting tissue regeneration and modulating the immune response.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!