Publications by authors named "Hao H Zhang"

Article Synopsis
  • This study explores the effectiveness of machine learning algorithms in predicting viral suppression among people living with HIV in South Carolina.
  • The data was collected from adult patients diagnosed between 2005 and 2021, focusing on various predictors like demographics, healthcare use, and previous viral load information.
  • Results showed that the Long Short-Term Memory (LSTM) model significantly outperformed traditional statistical methods, indicating its potential for better risk prediction in managing HIV.
View Article and Find Full Text PDF

Primary Central Nervous System tumors in the brain are among the most aggressive diseases affecting humans. Early detection and classification of brain tumor types, whether benign or malignant, glial or non-glial, is critical for cancer prevention and treatment, ultimately improving human life expectancy. Magnetic Resonance Imaging (MRI) is the most effective technique for brain tumor detection, generating comprehensive brain scans.

View Article and Find Full Text PDF

Nucleotide modifications deviate nanopore sequencing readouts, therefore generating artifacts during the basecalling of sequence backbones. Here, we present an iterative approach to polish modification-disturbed basecalling results. We show such an approach is able to promote the basecalling accuracy of both artificially-synthesized and real-world molecules.

View Article and Find Full Text PDF

Conflicting clinical trial results on omega-3 highly unsaturated fatty acids (n-3 HUFA) have prompted uncertainty about their cardioprotective effects. While the VITAL trial found no overall cardiovascular benefit from n-3 HUFA supplementation, its substantial African American (AfAm) enrollment provided a unique opportunity to explore racial differences in response to n-3 HUFA supplementation. The current observational study aimed to simulate randomized clinical trial (RCT) conditions by matching 3766 AfAm and 15,553 non-Hispanic White (NHW) individuals from the VITAL trial utilizing propensity score matching to address the limitations related to differences in confounding variables between the two groups.

View Article and Find Full Text PDF

Accurately basecalling sequence backbones in the presence of nucleotide modifications remains a substantial challenge in nanopore sequencing bioinformatics. It has been extensively demonstrated that state-of-the-art basecallers are less compatible with modification-induced sequencing signals. A precise basecalling, on the other hand, serves as the prerequisite for virtually all the downstream analyses.

View Article and Find Full Text PDF
Article Synopsis
  • The study focuses on enhancing nanopore sequencing basecallers using machine learning to detect nucleotide modifications, which are significant for biological research.
  • The researchers use incremental learning to better interpret sequences rich in modifications and apply anomaly detection on individual nucleotides to identify their modified status.
  • They tested their method on various biological samples, including yeast tRNAs and mammalian mRNAs, effectively demonstrating the pipeline's ability to detect multiple modifications simultaneously; the workflow is publicly available on GitHub.
View Article and Find Full Text PDF

The secreted phospholipase A (sPLA) isoform, sPLA-IIA, has been implicated in a variety of diseases and conditions, including bacteremia, cardiovascular disease, COVID-19, sepsis, adult respiratory distress syndrome, and certain cancers. Given its significant role in these conditions, understanding the regulatory mechanisms impacting its levels is crucial. Genome-wide association studies (GWAS) have identified several single nucleotide polymorphisms (SNPs), including rs11573156, that are associated with circulating levels of sPLA-IIA.

View Article and Find Full Text PDF

Maintaining retention in care (RIC) for people living with HIV (PLWH) helps achieve viral suppression and reduce onward transmission. This study aims to identify the best machine learning model that predicts the RIC transition over time. Extracting from the enhanced HIV/AIDS reporting system, this study included 9765 PLWH from 2005 to 2020 in South Carolina.

View Article and Find Full Text PDF

Previous research suggests that group IIA-secreted phospholipase A (sPLA-IIA) plays a role in and predicts lethal COVID-19 disease. The current study reanalyzed a longitudinal proteomic data set to determine the temporal relationship between levels of several members of a family of sPLA isoforms and the severity of COVID-19 in 214 ICU patients. The levels of six secreted PLA isoforms, sPLA-IIA, sPLA-V, sPLA-X, sPLA-IB, sPLA-IIC, and sPLA-XVI, increased over the first 7 ICU days in those who succumbed to the disease but attenuated over the same time period in survivors.

View Article and Find Full Text PDF

Objective: Recent evidence suggests that the fimbriated end of the fallopian tube harbors the precursor cells for many high-grade ovarian cancers, opening the door for development of better screening methods that directly assess the fallopian tube in women at risk for malignancy. Previously we have shown that the karyometric signature is abnormal in the fallopian tube epithelium in women at hereditary risk of ovarian cancer. In this study, we sought to determine whether the karyometric signature in serous tubal intraepithelial carcinoma (STIC) is significantly different from normal, and whether an abnormal karyometric signature can be detected in histologically normal tubal epithelial cells adjacent to STIC lesions.

View Article and Find Full Text PDF

We leverage machine learning approaches to adapt nanopore sequencing basecallers for nucleotide modification detection. We first apply the incremental learning technique to improve the basecalling of modification-rich sequences, which are usually of high biological interests. With sequence backbones resolved, we further run anomaly detection on individual nucleotides to determine their modification status.

View Article and Find Full Text PDF

The secreted phospholipase A (sPLA ) isoform, sPLA -IIA, has been implicated in a variety of diseases and conditions, including bacteremia, cardiovascular disease, COVID-19, sepsis, adult respiratory distress syndrome, and certain cancers. Given its significant role in these conditions, understanding the regulatory mechanisms impacting its levels is crucial. Genome-wide association studies (GWAS) have identified several single nucleotide polymorphisms (SNPs), including rs11573156, that are associated with circulating levels of sPLA -IIA.

View Article and Find Full Text PDF

Previous research suggests that group IIA secreted phospholipase A (sPLA -IIA) plays a role in and predicts severe COVID-19 disease. The current study reanalyzed a longitudinal proteomic data set to determine the temporal (days 0, 3 and 7) relationship between the levels of several members of a family of sPLA isoforms and the severity of COVID-19 in 214 ICU patients. The levels of six secreted PLA isoforms, sPLA -IIA, sPLA -V, sPLA -X, sPLA -IB, sPLA -IIC, and sPLA -XVI, increased over the first 7 ICU days in those who succumbed to the disease.

View Article and Find Full Text PDF

Background: Little is known about whether machine-learning algorithms developed to predict opioid overdose using earlier years and from a single state will perform as well when applied to other populations. We aimed to develop a machine-learning algorithm to predict 3-month risk of opioid overdose using Pennsylvania Medicaid data and externally validated it in two data sources (ie, later years of Pennsylvania Medicaid data and data from a different state).

Methods: This prognostic modelling study developed and validated a machine-learning algorithm to predict overdose in Medicaid beneficiaries with one or more opioid prescription in Pennsylvania and Arizona, USA.

View Article and Find Full Text PDF

Learning predictive models in new domains with scarce training data is a growing challenge in modern supervised learning scenarios. This incentivizes developing domain adaptation methods that leverage the knowledge in known domains (source) and adapt to new domains (target) with a different probability distribution. This becomes more challenging when the source and target domains are in heterogeneous feature spaces, known as heterogeneous domain adaptation (HDA).

View Article and Find Full Text PDF

There is an urgent need to identify the cellular and molecular mechanisms responsible for severe COVID-19 that results in death. We initially performed both untargeted and targeted lipidomics as well as focused biochemical analyses of 127 plasma samples and found elevated metabolites associated with secreted phospholipase A2 (sPLA2) activity and mitochondrial dysfunction in patients with severe COVID-19. Deceased COVID-19 patients had higher levels of circulating, catalytically active sPLA2 group IIA (sPLA2-IIA), with a median value that was 9.

View Article and Find Full Text PDF

There is an urgent need to identify cellular and molecular mechanisms responsible for severe COVID-19 disease accompanied by multiple organ failure and high mortality rates. Here, we performed untargeted/targeted lipidomics and focused biochemistry on 127 patient plasma samples, and showed high levels of circulating, enzymatically active secreted phospholipase A Group IIA (sPLA -IIA) in severe and fatal COVID-19 disease compared with uninfected patients or mild illness. Machine learning demonstrated that sPLA -IIA effectively stratifies severe from fatal COVID-19 disease.

View Article and Find Full Text PDF

Developing patient-centric baseline standards that enable the detection of clinically significant outlier gene products on a genome-scale remains an unaddressed challenge required for advancing personalized medicine beyond the small pools of subjects implied by "precision medicine". This manuscript proposes a novel approach for reference standard development to evaluate the accuracy of single-subject analyses of transcriptomes and offers extensions into proteomes and metabolomes. In evaluation frameworks for which the distributional assumptions of statistical testing imperfectly model genome dynamics of gene products, artefacts and biases are confounded with authentic signals.

View Article and Find Full Text PDF

Background: In this era of data science-driven bioinformatics, machine learning research has focused on feature selection as users want more interpretation and post-hoc analyses for biomarker detection. However, when there are more features (i.e.

View Article and Find Full Text PDF

Objective: To develop and validate a machine-learning algorithm to improve prediction of incident OUD diagnosis among Medicare beneficiaries with ≥1 opioid prescriptions.

Methods: This prognostic study included 361,527 fee-for-service Medicare beneficiaries, without cancer, filling ≥1 opioid prescriptions from 2011-2016. We randomly divided beneficiaries into training, testing, and validation samples.

View Article and Find Full Text PDF

Recent years have witnessed the tendency of measuring a biological sample on multiple omics scales for a comprehensive understanding of how biological activities on varying levels are perturbed by genetic variants, environments, and their interactions. This new trend raises substantial challenges to data integration and fusion, of which the latter is a specific type of integration that applies a uniform method in a scalable manner, to solve biological problems which the multi-omics measurements target. Fusion-based analysis has advanced rapidly in the past decade, thanks to application drivers and theoretical breakthroughs in mathematics, statistics, and computer science.

View Article and Find Full Text PDF

The reactivation of quiescent cells to proliferate is fundamental to tissue repair and homeostasis in the body. Often referred to as the G0 state, quiescence is, however, not a uniform state but with graded depth. Shallow quiescent cells exhibit a higher tendency to revert to proliferation than deep quiescent cells, while deep quiescent cells are still fully reversible under physiological conditions, distinct from senescent cells.

View Article and Find Full Text PDF

A chemopreventive effect of aspirin (ASA) on lung cancer risk is supported by epidemiologic and preclinical studies. We conducted a randomized, double-blinded study in current heavy smokers to compare modulating effects of intermittent versus continuous low-dose ASA on nasal epithelium gene expression and arachidonic acid (ARA) metabolism. Fifty-four participants were randomized to intermittent (ASA 81 mg daily for one week/placebo for one week) or continuous (ASA 81 mg daily) for 12 weeks.

View Article and Find Full Text PDF

A large body of epidemiologic evidence has shown that use of progestin-containing preparations lowers ovarian cancer risk. The purpose of the current study was to gather further preclinical evidence supporting progestins as cancer chemopreventives by demonstrating progestin-activation of surrogate endpoint biomarkers pertinent to cancer prevention in the genital tract of women at increased risk of ovarian cancer. There were 64 women enrolled in a multi-institutional randomized trial who chose to undergo risk-reducing bilateral salpingo-oophorectomy (BSO) and to receive the progestin levonorgestrel or placebo for 4 to 6 weeks prior to undergoing BSO.

View Article and Find Full Text PDF