Publications by authors named "Sakib Abrar"

For 20 years, the Comparative Toxicogenomics Database (CTD; https://ctdbase.org) has provided high-quality, literature-based curated content describing how environmental chemicals affect human health. Today, CTD includes over 94 million toxicogenomic connections relating chemicals, genes/proteins, phenotypes, anatomical terms, diseases, comparative species, pathways and exposures.

View Article and Find Full Text PDF

In environmental health, the specific molecular mechanisms connecting a chemical exposure to an adverse endpoint are often unknown, reflecting knowledge gaps. At the public Comparative Toxicogenomics Database (CTD; https://ctdbase.org/), we integrate manually curated, literature-based interactions from CTD to compute four-unit blocks of information organized as a potential step-wise molecular mechanism, known as "CGPD-tetramers," wherein a chemical interacts with a gene product to trigger a phenotype which can be linked to a disease.

View Article and Find Full Text PDF

Introduction: Severity of penile curvature (PC) is commonly used to select the optimal surgical intervention for hypospadias, either alone or in conjunction with other phenotypic characteristics. Despite this, current literature on the accuracy and precision of different PC measurement techniques in hypospadias patients remains limited.

Purpose: Assess the feasibility and validity of an artificial intelligence (AI)-based model for automatic measurement of PC.

View Article and Find Full Text PDF

Fully connected deep neural networks (DNN) often include redundant weights leading to overfitting and high memory requirements. Additionally, in tabular data classification, DNNs are challenged by the often superior performance of traditional machine learning models. This paper proposes periodic perturbations (prune and regrow) of DNN weights, especially at the self-supervised pre-training stage of deep autoencoders.

View Article and Find Full Text PDF

Missing values in tabular data restrict the use and performance of machine learning, requiring the imputation of missing values. The most popular imputation algorithm is arguably multiple imputations using chains of equations (MICE), which estimates missing values from linear conditioning on observed values. This paper proposes methods to improve both the imputation accuracy of MICE and the classification accuracy of imputed data by replacing MICE's linear regressors with ensemble learning and deep neural networks (DNN).

View Article and Find Full Text PDF