Data-Driven Imputation of Miscibility of Aqueous Solutions via Graph-Regularized Logistic Matrix Factorization.

J Phys Chem B

Department of Chemistry, Fordham University, The Bronx, New York 10458, United States.

Published: September 2023

Aqueous, two-phase systems (ATPSs) may form upon mixing two solutions of independently water-soluble compounds. Many separation, purification, and extraction processes rely on ATPSs. Predicting the miscibility of solutions can accelerate and reduce the cost of the discovery of new ATPSs for these applications. Whereas previous machine learning approaches to ATPS prediction used physicochemical properties of each solute as a descriptor, in this work, we show how to impute missing miscibility outcomes directly from an incomplete collection of pairwise miscibility experiments. We use graph-regularized logistic matrix factorization (GR-LMF) to learn a latent vector of each solution from (i) the observed entries in the pairwise miscibility matrix and (ii) a graph where each node is a solution and edges are relationships indicating the general category of the solute (i.e., polymer, surfactant, salt, protein). For an experimental data set of the pairwise miscibility of 68 solutions from Peacock et al. [ , , 11449-11460], we find that GR-LMF more accurately predicts missing (im)miscibility outcomes of pairs of solutions than ordinary logistic matrix factorization and random forest classifiers that use physicochemical features of the solutes. GR-LMF obviates the need for features of the solutions and solutions to impute missing miscibility outcomes, but it cannot predict the miscibility of a new solution without some observations of its miscibility with other solutions in the training data set.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jpcb.3c03789DOI Listing

Publication Analysis

Top Keywords

logistic matrix
12
matrix factorization
12
miscibility solutions
12
pairwise miscibility
12
miscibility
9
solutions
8
graph-regularized logistic
8
impute missing
8
missing miscibility
8
miscibility outcomes
8

Similar Publications

Background: With a rapidly aging population, South Korea anticipates a surge in Alzheimer disease (AD). However, the genetic basis of AD in Koreans is not well understood.

Method: We sequenced the genomes of 3,540 Koreans (1,583 AD cases and 1,957 controls) older than age 60 and performed a genome-wide association study (GWAS) of AD using logistic regression models that included covariates for age, sex, five ancestry principal components, and an empirical genetic relationship matrix.

View Article and Find Full Text PDF

Objectives: Develop risk-adapted conditional biopsy pathways utilizing MRI in combination with prostate-specific antigen (PSA) density (PSAD) and the ratio of free to total PSA (f/tPSA), respectively, to enhance the detection of clinically significant prostate cancer (csPCa) while minimizing 'negative' biopsies in low-risk patients.

Methods: The Prostate Imaging Reporting and Data System (PI-RADS) category, PSAD, f/tPSA and biopsy-pathology of 1018 patients were collected retrospectively. Subsequently, PSAD and f/tPSA were divided into four intervals, which were then combined with the MRI findings to construct two risk stratification matrix tables.

View Article and Find Full Text PDF

Purpose: Based on the well-known risks associated with deviating from established routines in primary healthcare and the positive consequences of upholding them, the purpose of this study is to increase the understanding of the role of meaningfulness in the enactment of organizational routines.

Design/methodology/approach: The study is based on 24 semi-structured interviews with three different professional categories in primary healthcare in Sweden. The data were analyzed using thematic analysis on a latent level, combined with a two-factor model as sensitizing concepts.

View Article and Find Full Text PDF

Background: Hyperostosis is a common radiographic feature of inverted papilloma (IP) tumor origin on computed tomography (CT). Herein, we developed a machine learning (ML) model capable of analyzing CT images and identifying IP attachment sites.

Methods: A retrospective review of patients treated for IP at our institution was performed.

View Article and Find Full Text PDF

Background: An important indicator of mothers' satisfaction with their care is birth satisfaction. Maternal health care can only be deemed to be of good quality if mothers are satisfied with the care they received. This increases maternal joy.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!