Ensuring reliability of Large Language Models (LLMs) in clinical tasks is crucial. Our study assesses two state-of-the-art LLMs (ChatGPT and LlaMA-2) for extracting clinical information, focusing on cognitive tests like MMSE and CDR. Our data consisted of 135,307 clinical notes (Jan 12th, 2010 to May 24th, 2023) mentioning MMSE, CDR, or MoCA.
View Article and Find Full Text PDFObjectives: The study aimed to assess the usage and impact of a private and secure instance of a generative artificial intelligence (GenAI) application in a large academic health center. The goal was to understand how employees interact with this technology and the influence on their perception of skill and work performance.
Materials And Methods: New York University Langone Health (NYULH) established a secure, private, and managed Azure OpenAI service (GenAI Studio) and granted widespread access to employees.
Background: Healthcare crowdsourcing events (e.g. hackathons) facilitate interdisciplinary collaboration and encourage innovation.
View Article and Find Full Text PDFImportance: Large language models (LLMs) are crucial for medical tasks. Ensuring their reliability is vital to avoid false results. Our study assesses two state-of-the-art LLMs (ChatGPT and LlaMA-2) for extracting clinical information, focusing on cognitive tests like MMSE and CDR.
View Article and Find Full Text PDFImprovements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material.
View Article and Find Full Text PDFPeptide and protein identification via tandem mass spectrometry (MS/MS) lies at the heart of proteomic characterization of biological samples. Several algorithms are able to search, score, and assign peptides to large MS/MS datasets. Most popular methods, however, underutilize the intensity information available in the tandem mass spectrum due to the complex nature of the peptide fragmentation process, thus contributing to loss of potential identifications.
View Article and Find Full Text PDFInt Conf Collab Comput
January 2012
Mass-spectrometry (MS) based proteomics has become a key enabling technology for the systems approach to biology, providing insights into the protein complement of an organism. Bioinformatics analyses play a critical role in interpretation of large, and often replicated, MS datasets generated across laboratories and institutions. A significant amount of computational effort in the workflow is spent on the identification of protein and peptide components of complex biological samples, and consists of a series of steps relying on large database searches and intricate scoring algorithms.
View Article and Find Full Text PDFIntroduction: Lung cancer remains the leading cause of cancer-related death with poor survival due to the late stage at which lung cancer is typically diagnosed. Given the clinical burden from lung cancer and the relatively favorable survival associated with early-stage lung cancer, biomarkers for early detection of lung cancer are of important potential clinical benefit.
Methods: We performed a global lung cancer serum biomarker discovery study using liquid chromatography-tandem mass spectrometry in a set of pooled non-small cell lung cancer case sera and matched controls.
This paper offers a potential measurement solution for assessing disaster impacts and subsequent recovery at the household level by using a modified domestic assets index (MDAI) approach. Assessment of the utility of the domestic assets index first proposed by Bates, Killian and Peacock (1984) has been confined to earthquake areas in the Americas and southern Europe. This paper modifies and extends the approach to the Indian sub-continent and to coastal surge hazards utilizing data collected from 1,000 households impacted by the Indian Ocean tsunami (2004) in the Nagapattinam district of south-eastern India.
View Article and Find Full Text PDFThe Empirical Proteomic Ontology Knowledge Base (EPO-KB) is an online database that represents current knowledge of biomarkers and contains associations between mass-to-charge (m/z) ratios of mass-spectrometry peaks to proteins. Such a database is a useful tool for identifying putative proteins associated with a m/z ratio. At present, EPO-KB contains data that have been extracted from 120 published research papers.
View Article and Find Full Text PDFDiscretization acts as a variable selection method in addition to transforming the continuous values of the variable to discrete ones. Machine learning algorithms such as Support Vector Machines and Random Forests have been used for classification in high-dimensional genomic and proteomic data due to their robustness to the dimensionality of the data. We show that discretization can help improve significantly the classification performance of these algorithms as well as algorithms like Naïve Bayes that are sensitive to the dimensionality of the data.
View Article and Find Full Text PDFStudies on the impacts of hurricanes, tropical storms, and tornados indicate that poor communities of colour suffer disproportionately in human death and injury.(2) Few quantitative studies have been conducted on the degree to which flood events affect socially vulnerable populations. We address this research void by analysing 832 countywide flood events in Texas from 1997-2001.
View Article and Find Full Text PDFFloods continue to pose the greatest threat to the property and safety of human communities among all natural hazards in the United States. This study examines the relationship between the built environment and flood impacts in Texas, which consistently sustains the most damage from flooding of any other state in the country. Specifically, we calculate property damage resulting from 423 flood events between 1997 and 2001 at the county level.
View Article and Find Full Text PDFRecent interest in expanding offshore oil production within waters of the United States has been met with opposition by groups concerned with recreational, environmental, and aesthetic values associated with the coastal zone. Although the proposition of new oil platforms off the coast has generated conflict over how coastal resources should be utilized, little research has been conducted on where these user conflicts might be most intense and which sites might be most suitable for locating oil production facilities in light of the multiple, and often times, competing interests. In this article, we develop a multiple-criteria spatial decision support tool that identifies the potential degree of conflict associated with oil and gas production activities for existing lease tracts in the coastal margin of Texas.
View Article and Find Full Text PDF