Background: The effectiveness of genomic prediction (GP) significantly influences breeding progress, and employing SNP markers to predict phenotypic values is a pivotal aspect of pig breeding. Machine learning (ML) methods are usually used to predict phenotypic values since their advantages in processing high dimensional data. While, the existing researches have not indicated which ML methods are suitable for most pig genomic prediction. Therefore, it is necessary to select appropriate methods from a large number of ML methods as long as genomic prediction is performed. This paper compared the performance of popular ML methods in predicting pig phenotypes and then found out suitable methods for most traits.
Results: In this paper, five commonly used datasets from other literatures were utilized to compare the performance of different ML methods. The experimental results demonstrate that Stacking performs best on the PIC dataset where the trait information is hidden, and the performs of kernel ridge regression with rbf kernel (KRR-rbf) closely follows. Support vector regression (SVR) performs best in predicting reproductive traits, followed by genomic best linear unbiased prediction (GBLUP). GBLUP achieves the best performance on growth traits, with SVR as the second best.
Conclusions: GBLUP achieves good performance for GP problems. Similarly, the Stacking, SVR, and KRR-RBF methods also achieve high prediction accuracy. Moreover, LR statistical analysis shows that Stacking, SVR and KRR are stable. When applying ML methods for phenotypic values prediction in pigs, we recommend these three approaches.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1186/s12711-025-00957-3 | DOI Listing |
Physiol Mol Biol Plants
February 2025
Traditional Chinese Medicine Institute of Anhui Dabie Mountain, College of Biological and Pharmaceutical Engineering, West Anhui University, Lu'an, 237012 China.
Unlabelled: The cytochrome P450 monooxygenases (CYP450) are the largest enzyme family in plant metabolism, playing a key role in the biosynthesis of both primary and secondary metabolites. However, the CYP450 has not yet been systematically characterized in Dendrobium species. In this study, 193 genes were identified in the genome of through bioinformatics, and divided into 2 groups and 10 clans.
View Article and Find Full Text PDFFront Immunol
March 2025
Division of Haematology, Respiratory Medicine and Oncology, Department of Internal Medicine, Faculty of Medicine, Saga University, Saga, Japan.
High tumour mutational burden (TMB-high), identified through comprehensive genomic profiling (CGP), is a biomarker that predicts the efficacy of immune checkpoint inhibitors. CGP testing is recommended for rare cancers with limited effective treatment options. Here, we provide the first report of a malignant phyllodes tumour of the breast demonstrating TMB-high status and effective treatment with pembrolizumab.
View Article and Find Full Text PDFHealth Sci Rep
March 2025
Clinical Epidemiology and EBM Unit, Beijing Friendship Hospital Capital Medical University Beijing China.
Background And Aims: Nonalcoholic fatty liver disease (NAFLD) is an escalating global health concern with significant implications for cancers. A better understanding of the causal relationship between NAFLD and extrahepatic cancers might help in clinical management of NAFLD and prevent its adverse outcomes.
Methods: This study encompassed two complementary approaches.
ISME Commun
January 2025
Institute of Environment and Ecology, Tsinghua Shenzhen International Graduate School, Tsinghua University, 518071, Shenzhen, China.
Cellulose is the most abundant component of plant litter, which is critical for terrestrial carbon cycling. Nonetheless, it remains unknown how global warming affects cellulose-decomposing microorganisms. Here, we carried out a 3-year litterbag experiment to examine cellulose decomposition undergoing +3°C warming in a tallgrass prairie.
View Article and Find Full Text PDFFront Bioinform
February 2025
School of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece.
Machine learning and genomic medicine are the mainstays of research in delivering personalized healthcare services for disease diagnosis, risk stratification, tailored treatment, and prediction of adverse effects. However, potential prediction errors in healthcare services can have life-threatening impact, raising reasonable skepticism about whether these applications have practical benefit in clinical settings. Conformal prediction offers a versatile framework for addressing these concerns by quantifying the uncertainty of predictive models.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!