Publications by authors named "Yonghui Wu"

Recent advancements in large language models (LLMs) like ChatGPT and LLaMA have shown significant potential in medical applications, but their effectiveness is limited by a lack of specialized medical knowledge due to general-domain training. In this study, we developed Me-LLaMA, a new family of open-source medical LLMs that uniquely integrate extensive domain-specific knowledge with robust instruction-following capabilities. Me-LLaMA comprises foundation models (Me-LLaMA 13B and 70B) and their chat-enhanced versions, developed through comprehensive continual pretraining and instruction tuning of LLaMA2 models using both biomedical literature and clinical notes.

View Article and Find Full Text PDF

Recent studies recommend sublobectomy as a surgical approach for non-small cell lung cancer (NSCLC) tumors that are 2 cm or smaller. However, it remains unclear whether NSCLC patients with squamous cell carcinoma (SCC) have comparable outcomes to those with adenocarcinoma (ADC) following sublobectomy. To that end, this study aims to compare the survival outcomes between SCC and ADC in patients with stage IA NSCLC (≤ 2 cm) who have undergone sublobectomy.

View Article and Find Full Text PDF

Delirium is an acute decline or fluctuation in attention, awareness, or other cognitive function that can lead to serious adverse outcomes. Despite the severe outcomes, delirium is frequently unrecognized and uncoded in patients' electronic health records (EHRs) due to its transient and diverse nature. Natural language processing (NLP), a key technology that extracts medical concepts from clinical narratives, has shown great potential in studies of delirium outcomes and symptoms.

View Article and Find Full Text PDF
Article Synopsis
  • Delayed healing in diabetic wounds is primarily due to a dysfunctional microenvironment caused by high blood sugar and ongoing inflammation.
  • Topical microenvironment modulation, particularly using microneedles, offers a promising solution to enhance healing by delivering therapeutic agents directly to the wound's surface.
  • A hybrid microneedle has been developed incorporating carvacrol, cyclodextrin, mesoporous ceria nanoparticles, and hyaluronate, which improves healing by providing antibacterial, antioxidant, and anti-inflammatory effects to accelerate tissue reconstruction processes like cell proliferation and angiogenesis.
View Article and Find Full Text PDF

Background: Chronic obstructive pulmonary disease (COPD) is closely linked to lung cancer (LC) development. The aim of this study is to identify the genetic and clinical risk factors for LC risk in COPD, according to which the prediction model for LC in COPD was constructed.

Methods: This is a case-control study in which patientis with COPD + LC as the case group, patientis with only COPD as the control group, and patientis with only LC as the second control group.

View Article and Find Full Text PDF

Background: Integrating advanced machine-learning (ML) algorithms into clinical practice is challenging and requires interdisciplinary collaboration to develop transparent, interpretable, and ethically sound clinical decision support (CDS) tools. We aimed to design a ML-driven CDS tool to predict opioid overdose risk and gather feedback for its integration into the University of Florida Health (UFHealth) electronic health record (EHR) system.

Methods: We used user-centered design methods to integrate the ML algorithm into the EHR system.

View Article and Find Full Text PDF

Irradiation-resistance presents a substantial challenge in the successful application of radiotherapy for non-small-cell lung cancer (NSCLC). However, the specific molecular mechanisms responsible for irradiation-resistance have yet to be completely understood. In this research, the DNA methylation and gene expression patterns resulting from irradiation treatment were produced using the DNA methylation BeadChip and RNA-Seq.

View Article and Find Full Text PDF

Background: Immunochemotherapy involving the combination of programmed cell death 1/programmed cell death ligand 1 inhibitors with chemotherapy has advanced the treatment of locally advanced esophageal squamous cell carcinoma (ESCC). The use of corticosteroids as pretreatment might reduce immunotherapy efficacy.

Aim: To investigate the impact of baseline corticosteroid use on neoadjuvant immunochemotherapy (nIC) outcomes in locally advanced ESCC patients.

View Article and Find Full Text PDF

Aim: To develop an automated computable phenotype (CP) algorithm for identifying diabetes cases in children and adolescents using electronic health records (EHRs) from the UF Health System.

Materials And Methods: The CP algorithm was iteratively derived based on structured data from EHRs (UF Health System 2012-2020). We randomly selected 536 presumed cases among individuals aged <18 years who had (1) glycated haemoglobin levels ≥ 6.

View Article and Find Full Text PDF
Article Synopsis
  • Immunotherapy is a new way to help fight cancer, especially lung cancer, but there are still some problems like not all patients responding well to treatment.
  • Researchers studied different samples from patients to see how their immune cells reacted to treatment.
  • They found important differences in immune cells between patients who responded well and those who didn't, which might help predict who will do better with immunotherapy in the future.
View Article and Find Full Text PDF

Background: We aim to use Natural Language Processing to automate the extraction and classification of thyroid cancer risk factors from pathology reports.

Methods: We analyzed 1410 surgical pathology reports from adult papillary thyroid cancer patients from 2010 to 2019. Structured and nonstructured reports were used to create a consensus-based ground truth dictionary and categorized them into modified recurrence risk levels.

View Article and Find Full Text PDF
Article Synopsis
  • * Exposure to ACR leads to the upregulation of eEF2K, which is linked to learning and memory deficits, and manipulating eEF2K levels improved these cognitive issues in experiments.
  • * The findings suggest that eEF2K could be a promising target for clinical interventions aimed at countering ACR-related cognitive impairments by influencing lipid metabolism in the brain.
View Article and Find Full Text PDF
Article Synopsis
  • * Researchers used a dataset of 3,080 patients and manually annotated 394 reports to establish a gold standard for training various transformer models, such as BERT and RoBERTa, for better accuracy in identifying and linking nodule characteristics.
  • * The results showed that the RoBERTa-mimic model excelled in nodule concept extraction with a score of 0.9279, while ALBERT-base and GatorTron were effective in linking characteristics, achieving a score of 0.9737;
View Article and Find Full Text PDF
Article Synopsis
  • CdS quantum dots are promising photocatalysts due to their effective visible light absorption, but their application is limited by photocorrosion issues.
  • Researchers developed a composite photocatalyst by integrating CoS nanotubes with CdS QDs, which helps to stabilize the CdS structure and prevent photocorrosion.
  • The best-performing composite, with 30% CoS, achieved a hydrogen production rate of 9642.7 μmol·g·h, significantly improving photocatalytic efficiency compared to the original CdS QDs.
View Article and Find Full Text PDF

Hospital-acquired falls are a continuing clinical concern. The emergence of advanced analytical methods, including NLP, has created opportunities to leverage nurse-generated data, such as clinical notes, to better address the problem of falls. In this nurse-driven study, we employed an iterative process for expert manual annotation of RNs clinical notes to enable the training and testing of an NLP pipeline to extract factors related to falls.

View Article and Find Full Text PDF

The abundant water wave energy on Earth stands as one of the most promising renewable blue energy sources, as it exhibits minimal dependence on weather, time and temperature. However, the low fluctuation frequency and extremely irregular nature of the wave energy restrict both the methods and efficiency of energy harvesting. In this study, a packed box-like hybrid nanogenerator was designed, comprising two single-electrode triboelectric nanogenerators (TENGs) and two electromagnetic generators (EMGs).

View Article and Find Full Text PDF

Transforming growth factor-β (TGF-β) signaling pathway serves a pivotal role in the pathogenesis of colorectal cancer (CRC). However, the specific molecular mechanisms by which the TGF-β signaling pathway regulates CRC are still not fully understood. In the present study, metabolomics and transcriptomics were used to screen for key metabolites and regulatory genes most related to the regulation of the TGF-β signaling pathway in CRC.

View Article and Find Full Text PDF

Background: Pediatric asthma is a heterogeneous disease; however, current characterizations of its subtypes are limited. Machine learning (ML) methods are well-suited for identifying subtypes. In particular, deep neural networks can learn patient representations by leveraging longitudinal information captured in electronic health records (EHRs) while considering future outcomes.

View Article and Find Full Text PDF

Introduction: Alzheimer's disease (AD) is often misclassified in electronic health records (EHRs) when relying solely on diagnosis codes. This study aimed to develop a more accurate, computable phenotype (CP) for identifying AD patients using structured and unstructured EHR data.

Methods: We used EHRs from the University of Florida Health (UFHealth) system and created rule-based CPs iteratively through manual chart reviews.

View Article and Find Full Text PDF

This study aimed to review the application of natural language processing (NLP) in thyroid-related conditions and to summarize current challenges and potential future directions. We performed a systematic search of databases for studies describing NLP applications in thyroid conditions published in English between January 1, 2012 and November 4, 2022. In addition, we used a snowballing technique to identify studies missed in the initial search or published after our search timeline until April 1, 2023.

View Article and Find Full Text PDF

There is growing evidence linking glutamine levels to the risk of gastrointestinal diseases, yet the presence of a causal relationship remains uncertain. In this study, we employed a Mendelian randomization (MR) approach to investigate potential causal associations between glutamine and colitis, inflammatory bowel disease (IBD), and digestive tumors. Genetic instrumental variables for glutamine exposure were identified from a genome-wide association study (GWAS) involving 114,751 participants.

View Article and Find Full Text PDF

Objectives: Radiomics has been demonstrated to be strongly associated with TNM stage and patient prognosis. We aimed to develop a model for predicting lymph node metastasis (LNM) and survival.

Methods: For radiomics texture selection, 3D Slicer 5.

View Article and Find Full Text PDF

Ulcerative colitis (UC) is a chronic and recurrent inflammatory disease that affects the colon and rectum. The response to treatment varies among individuals with UC. Therefore, the aim of this study was to identify and explore potential biomarkers for different subtypes of UC and examine their association with immune cell infiltration.

View Article and Find Full Text PDF

Recent advancements in large language models (LLMs) such as ChatGPT and LLaMA have hinted at their potential to revolutionize medical applications, yet their application in clinical settings often reveals limitations due to a lack of specialized training on medical-specific data. In response to this challenge, this study introduces Me-LLaMA, a novel medical LLM family that includes foundation models - Me-LLaMA 13/70B, along with their chat-enhanced versions - Me-LLaMA 13/70B-chat, developed through continual pre-training and instruction tuning of LLaMA2 using large medical datasets. Our methodology leverages a comprehensive domain-specific data suite, including a large-scale, continual pre-training dataset with 129B tokens, an instruction tuning dataset with 214k samples, and a new medical evaluation benchmark (MIBE) across six critical medical tasks with 12 datasets.

View Article and Find Full Text PDF