The Translational Machine: A novel machine-learning approach to illuminate complex genetic architectures.

Genet Epidemiol

Department of Biostatistics, Epidemiology, & Informatics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Published: July 2021

The Translational Machine (TM) is a machine learning (ML)-based analytic pipeline that translates genotypic/variant call data into biologically contextualized features that richly characterize complex variant architectures and permit greater interpretability and biological replication. It also reduces potentially confounding effects of population substructure on outcome prediction. The TM consists of three main components. First, replicable but flexible feature engineering procedures translate genome-scale data into biologically informative features that appropriately contextualize simple variant calls/genotypes within biological and functional contexts. Second, model-free, nonparametric ML-based feature filtering procedures empirically reduce dimensionality and noise of both original genotype calls and engineered features. Third, a powerful ML algorithm for feature selection is used to differentiate risk variant contributions across variant frequency and functional prediction spectra. The TM simultaneously evaluates potential contributions of variants operative under polygenic and heterogeneous models of genetic architecture. Our TM enables integration of biological information (e.g., genomic annotations) within conceptual frameworks akin to geneset-/pathways-based and collapsing methods, but overcomes some of these methods' limitations. The full TM pipeline is executed in R. Our approach and initial findings from its application to a whole-exome schizophrenia case-control data set are presented. These TM procedures extend the findings of the primary investigation and yield novel results.

Download full-text PDF

Source
http://dx.doi.org/10.1002/gepi.22383DOI Listing

Publication Analysis

Top Keywords

translational machine
8
data biologically
8
machine novel
4
novel machine-learning
4
machine-learning approach
4
approach illuminate
4
illuminate complex
4
complex genetic
4
genetic architectures
4
architectures translational
4

Similar Publications

Machine Learning-enhanced X-ray-based Radiomics in the Identification of Post-COVID Patients.

Arch Bronconeumol

December 2024

National Koranyi Institute of Pulmonology, Budapest, Hungary; Department of Thoracic Surgery, Semmelweis University and National Institute of Oncology, Budapest, Hungary; Department of Thoracic Surgery, Comprehensive Cancer Center Vienna, Medical University of Vienna, Vienna, Austria.

View Article and Find Full Text PDF

Developing and experimental validating a T cell senescence-related gene signature to predict prognosis and immunotherapeutic sensitivity in non-small cell lung cancer.

Gene

January 2025

Department of Thoracic Oncology Surgery, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou 350011 Fujian Province, PR China. Electronic address:

Background: T cell senescence affects non-small cell lung cancer (NSCLC) by compromising the anti-tumor immune response. However, the prognostic significance of T cell senescence-related genes in NSCLC remains unclear.

Methods: The scRNA-seq data from normal lung and NSCLC tissues, along with co-incubation experiments involving NSCLC cells and T cells, were utilized to identify T cell senescence characteristics.

View Article and Find Full Text PDF
Article Synopsis
  • Left ventricular hypertrophy (LVH) is linked to serious cardiovascular issues, and identifying its cause is important for treatment; this systematic review explores how AI can help in diagnosing LVH and its causes from imaging data.
  • A thorough search was conducted utilizing multiple databases, leading to the inclusion of 30 studies which mainly focused on echocardiography and cardiac magnetic resonance imaging (CMR), with a smaller number on cardiac computed tomography (CT).
  • The review found that AI methods, especially deep learning and convolutional neural networks, showed good diagnostic performance, with the highest accuracy in identifying the causes of LVH rather than just detecting it; more real-life validation studies and cost-effectiveness assessments are recommended.
View Article and Find Full Text PDF

A generalizable methodology for predicting retention time of small molecule pharmaceutical compounds across reversed-phase HPLC columns.

J Chromatogr A

December 2024

Synthetic Molecule Pharmaceutical Science, gRED, Genentech, Inc., 1 DNA Way, South San Francisco, CA, 94080, United States. Electronic address:

Quantitative structure retention relation (QSRR) is an active field of research, primarily focused on predicting chromatography retention time (Rt) based on molecular structures of an input analyte on a single or limited number of reversed-phase HPLC (RP-HPLC) columns. However, in the pharmaceutical chemistry manufacturing and controls (CMC) settings, single-column QSRR models are often insufficient. It is important to translate retention time across different HPLC methods, specifically different stationary phases (SP) and mobile phases (MP), to guide the HPLC method development, and to bridge organic impurity profiles across different development phases and laboratories.

View Article and Find Full Text PDF

Integrating Model-Informed Drug Development With AI: A Synergistic Approach to Accelerating Pharmaceutical Innovation.

Clin Transl Sci

January 2025

Global Biometrics and Data Management, Pfizer Research and Development, New York, New York, USA.

The pharmaceutical industry constantly strives to improve drug development processes to reduce costs, increase efficiencies, and enhance therapeutic outcomes for patients. Model-Informed Drug Development (MIDD) uses mathematical models to simulate intricate processes involved in drug absorption, distribution, metabolism, and excretion, as well as pharmacokinetics and pharmacodynamics. Artificial intelligence (AI), encompassing techniques such as machine learning, deep learning, and Generative AI, offers powerful tools and algorithms to efficiently identify meaningful patterns, correlations, and drug-target interactions from big data, enabling more accurate predictions and novel hypothesis generation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!