Publications by authors named "Yu-Zong Chen"

Modeling molecular activity and quantitative structure-activity relationships of chemical compounds is critical in drug design. Graph neural networks, which utilize molecular structures as frames, have shown success in assessing the biological activity of chemical compounds, guiding the selection and optimization of candidates for further development. However, current models often overlook activity cliffs (ACs)-cases where structurally similar molecules exhibit different bioactivities-due to latent spaces primarily optimized for structural features.

View Article and Find Full Text PDF

Feature representation is critical for data learning, particularly in learning spectroscopic data. Machine learning (ML) and deep learning (DL) models learn Raman spectra for rapid, nondestructive, and label-free cell phenotype identification, which facilitate diagnostic, therapeutic, forensic, and microbiological applications. But these are challenged by high-dimensional, unordered, and low-sample spectroscopic data.

View Article and Find Full Text PDF

Knowledge of the collective activities of individual plants together with the derived clinical effects and targeted disease associations is useful for plant-based biomedical research. To provide the information in complement to the established databases, we introduced a major update of CMAUP database, previously featured in NAR. This update includes (i) human transcriptomic changes overlapping with 1152 targets of 5765 individual plants, covering 74 diseases from 20 027 patient samples; (ii) clinical information for 185 individual plants in 691 clinical trials; (iii) drug development information for 4694 drug-producing plants with metabolites developed into approved or clinical trial drugs; (iv) plant and human disease associations (428 737 associations by target, 220 935 reversion of transcriptomic changes, 764 and 154121 associations by clinical trials of individual plants and plant ingredients); (v) the location of individual plants in the phylogenetic tree for navigating taxonomic neighbors, (vi) DNA barcodes of 3949 plants, (vii) predicted human oral bioavailability of plant ingredients by the established SwissADME and HobPre algorithm, (viii) 21-107% increase of CMAUP data over the previous version to cover 60 222 chemical ingredients, 7865 plants, 758 targets, 1399 diseases, 238 KEGG human pathways, 3013 gene ontologies and 1203 disease ontologies.

View Article and Find Full Text PDF

Extensive investigations in outer space have revealed not only how life adapts to the space environment, but also that interesting biophysical phenomena occur. These phenomena affect human health and other life forms (animals, plants, bacteria, and fungi), and to ensure the safety of future human space exploration need to be further investigated. This calls for joint research efforts between biologists and physicists, as these phenomena present cross-disciplinary barriers.

View Article and Find Full Text PDF

Infrared (IR) spectroscopy is a powerful and versatile tool for analyzing functional groups in organic compounds. A complex and time-consuming interpretation of massive unknown spectra usually requires knowledge of chemistry and spectroscopy. This paper presents a new deep learning method for transforming IR spectral features into intuitive imagelike feature maps and prediction of major functional groups.

View Article and Find Full Text PDF

Extensive spaceflight life investigations (SLIs) have revealed observable space effects on plants, particularly their growth, nutrition yield, and secondary metabolite production. Knowledge of these effects not only facilitates space agricultural and biopharmaceutical technology development but also provides unique perspectives to ground-based investigations. SLIs are specialized experimental protocols and notable biological phenomena.

View Article and Find Full Text PDF

Clinical outcome prediction is important for stratified therapeutics. Machine learning (ML) and deep learning (DL) methods facilitate therapeutic response prediction from transcriptomic profiles of cells and clinical samples. Clinical transcriptomic DL is challenged by the low-sample sizes (34-286 subjects), high-dimensionality (up to 21,653 genes) and unordered nature of clinical transcriptomic data.

View Article and Find Full Text PDF

Wan Xiang Shen, a postdoctoral researcher at National University of Singapore, and Yu Zong Chen, the PI of the Bioinformatics and Drug Design (BIDD) group, have developed an AI pipeline for enhanced deep learning of metagenomic data. Their paper highlights the advantages of unsupervised data restructuring in microbiome-based disease prediction and biomarker discovery. They talk about their view of data science and the backstory of the article published in .

View Article and Find Full Text PDF

Metagenomic analysis has been explored for disease diagnosis and biomarker discovery. Low sample sizes, high dimensionality, and sparsity of metagenomic data challenge metagenomic investigations. Here, an unsupervised microbial embedding, grouping, and mapping algorithm (MEGMA) was developed to transform metagenomic data into individualized multichannel microbiome 2D representation by manifold learning and clustering of microbial profiles (e.

View Article and Find Full Text PDF

Quantitative activity and species source data of natural products (NPs) are important for drug discovery, medicinal plant research, and microbial investigations. Activity values of NPs against specific targets are useful for discovering targeted therapeutic agents and investigating the mechanism of medicinal plants. Composition/concentration values of NPs in individual species facilitate the assessments and investigations of the therapeutic quality of herbs and phenotypes of microbes.

View Article and Find Full Text PDF

Biological experiments performed in space crafts like space stations, space shuttles, and recoverable satellites has enabled extensive spaceflight life investigations (SLIs). In particular, SLIs have revealed distinguished space effects on microbial growth, survival, metabolite production, biofilm formation, virulence development and drug resistant mutations. These provide unique perspectives to ground-based microbiology and new opportunities for industrial pharmaceutical and metabolite productions.

View Article and Find Full Text PDF

Phosphorylation plays a key role in the regulation of protein function. In addition to the extensively studied O-phosphorylation of serine, threonine, and tyrosine, emerging evidence suggests that the non-canonical phosphorylation of histidine, lysine, and arginine termed N-phosphorylation, exists widely in eukaryotes. At present, the study of N-phosphorylation is still in its infancy, and its regulatory role and specific biological functions in mammalian cells are still unknown.

View Article and Find Full Text PDF

Objective: To investigate how the National Health Commission of China (NHCC)-recommended Chinese medicines (CMs) modulate the major maladjustments of coronavirus disease 2019 (COVID-19), particularly the clinically observed complications and comorbidities.

Methods: By focusing on the potent targets in common with the conventional medicines, we investigated the mechanisms of 11 NHCC-recommended CMs in the modulation of the major COVID-19 pathophysiology (hyperinflammations, viral replication), complications (pain, headache) and comorbidities (hypertension, obesity, diabetes). The constituent herbs of these CMs and their chemical ingredients were from the Traditional Chinese Medicine Information Database.

View Article and Find Full Text PDF

Emerging liquid biopsy methods for investigating biomarkers in bodily fluids such as blood, saliva, or urine can be used to perform noninvasive cancer detection. However, the complexity and heterogeneity of exosomes require improved methods to achieve the desired sensitivity and accuracy. Herein, we report our study on developing a breast cancer liquid biopsy system, including a fluorescence sensor array and deep learning (DL) tool AggMapNet.

View Article and Find Full Text PDF

Omics-based biomedical learning frequently relies on data of high-dimensions (up to thousands) and low-sample sizes (dozens to hundreds), which challenges efficient deep learning (DL) algorithms, particularly for low-sample omics investigations. Here, an unsupervised novel feature aggregation tool AggMap was developed to Aggregate and Map omics features into multi-channel 2D spatial-correlated image-like feature maps (Fmaps) based on their intrinsic correlations. AggMap exhibits strong feature reconstruction capabilities on a randomized benchmark dataset, outperforming existing methods.

View Article and Find Full Text PDF

Inspired by the traceable analogies between protein sequences and music notes, protein music has been composed from amino acid sequences for popularizing science and sourcing melodies. Despite the continuous development of protein-to-music algorithms, the musicality of protein music lags far behind human music. Musicality may be enhanced by fine-tuned protein-to-music mapping to the features of a specific music style.

View Article and Find Full Text PDF

Xenobiotic and host active substances interact with gut microbiota to influence human health and therapeutics. Dietary, pharmaceutical, herbal and environmental substances are modified by microbiota with altered bioavailabilities, bioactivities and toxic effects. Xenobiotics also affect microbiota with health implications.

View Article and Find Full Text PDF

Cancers resist targeted therapeutics by drug-escape signaling. Multitarget drugs co-targeting cancer and drug-escape mediators (DEMs) are clinically advantageous. DEM coverage may be expanded by drug combinations.

View Article and Find Full Text PDF

Protein dynamics is central to all biological processes, including signal transduction, cellular regulation and biological catalysis. Among them, in-depth exploration of ligand-driven protein dynamics contributes to an optimal understanding of protein function, which is particularly relevant to drug discovery. Hence, a wide range of computational tools have been designed to investigate the important dynamic information in proteins.

View Article and Find Full Text PDF

Human A adenosine receptor hAAR has been implicated in gastrointestinal cancer, where its cellular expression has been found increased, thus suggesting its potential as a molecular target for novel anticancer compounds. Observation made in our previous work indicated the importance of the carbonyl group of amide in the indolylpyrimidylpiperazine (IPP) for its human A adenosine receptor (hAAR) subtype binding selectivity over the other AR subtypes. Taking this observation into account, we structurally modified an indolylpyrimidylpiperazine (IPP) scaffold, (a non-selective adenosine receptors' ligand) into a modified IPP (mIPP) scaffold by switching the position of the carbonyl group, resulting in the formation of both ketone and tertiary amine groups in the new scaffold.

View Article and Find Full Text PDF