The study provides a comprehensive review of OpenAI's Generative Pre-trained Transformer 4 (GPT-4) technical report, with an emphasis on applications in high-risk settings like healthcare. A diverse team, including experts in artificial intelligence (AI), natural language processing, public health, law, policy, social science, healthcare research, and bioethics, analyzed the report against established peer review guidelines. The GPT-4 report shows a significant commitment to transparent AI research, particularly in creating a systems card for risk assessment and mitigation. However, it reveals limitations such as restricted access to training data, inadequate confidence and uncertainty estimations, and concerns over privacy and intellectual property rights. Key strengths identified include the considerable time and economic investment in transparent AI research and the creation of a comprehensive systems card. On the other hand, the lack of clarity in training processes and data raises concerns about encoded biases and interests in GPT-4. The report also lacks confidence and uncertainty estimations, crucial in high-risk areas like healthcare, and fails to address potential privacy and intellectual property issues. Furthermore, this study emphasizes the need for diverse, global involvement in developing and evaluating large language models (LLMs) to ensure broad societal benefits and mitigate risks. The paper presents recommendations such as improving data transparency, developing accountability frameworks, establishing confidence standards for LLM outputs in high-risk settings, and enhancing industry research review processes. It concludes that while GPT-4's report is a step towards open discussions on LLMs, more extensive interdisciplinary reviews are essential for addressing bias, harm, and risk concerns, especially in high-risk domains. The review aims to expand the understanding of LLMs in general and highlights the need for new reflection forms on how LLMs are reviewed, the data required for effective evaluation, and addressing critical issues like bias and risk.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10795998 | PMC |
http://dx.doi.org/10.1371/journal.pdig.0000417 | DOI Listing |
Microsurgery
January 2025
Department of Plastic and Reconstructive Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
Thinning of anterolateral thigh flap is challenging. Anatomical studies have shown variations in arterial branching patterns in the subcutaneous layer, which were suspected to be the reason for the high frequency of thinning failures. We attempted to visualize subcutaneous arterial courses preoperatively and perform thinning of perforator flaps using this information appropriately.
View Article and Find Full Text PDFAlzheimers Dement
December 2024
Research Program on Cognition and Neuromodulation-Based Interventions, University of Michigan, Ann Arbor, MI, USA.
Background: The computerized NIH Toolbox Cognition Battery (NIHTB-CB) was designed to assess cognitive functioning across the lifespan. Previous studies demonstrated that NIHTB-CB measures discriminate between healthy controls (HCs), individuals with amnestic mild cognitive impairment (aMCI), and individuals with dementia of the Alzheimer's type (DAT). Scores on NIHTB-CB tasks also correspond with performance on well-validated neuropsychological measures of the same cognitive domains.
View Article and Find Full Text PDFAlzheimers Dement
December 2024
The Graduate Center, CUNY, New York, NY, USA.
Background: Judgment is an aspect of executive functioning that is critical to many aspects of daily functioning, and often affected in older adults with cognitive decline. The Test of Practical Judgement (TOP-J) evaluates judgment related real-world issues that may arise in aging populations. The current study investigates the incremental validity of the TOP-J-i.
View Article and Find Full Text PDFFront Plant Sci
December 2024
Center for Precision and Automated Agricultural Systems, Department of Biological Systems Engineering, Washington State University, Prosser, WA, United States.
Molecular-based detection of pathogens from potato tubers hold promise, but the initial sample extraction process is labor-intensive. Developing a robotic tuber sampling system, equipped with a fast and precise machine vision technique to identify optimal sampling locations on a potato tuber, offers a viable solution. However, detecting sampling locations such as eyes and stolon scar is challenging due to variability in their appearance, size, and shape, along with soil adhering to the tubers.
View Article and Find Full Text PDFZhongguo Shi Yan Xue Ye Xue Za Zhi
December 2024
Department of Blood Transfusion, The Fifth Medical Center of PLA General Hospital, Beijing 100071, China.
Objective: To analyze the diagnostic value of IgG anti-A/anti-B antibody titer in the serum of type O pregnant women after absorption of IgG anti-AB antibody for ABO hemolytic disease of fetus and newborn (ABO-HDFN).
Methods: From February 2020 to September 2020, 235 samples of neonatal hemolytic disease whose mother's blood type O from Beijing Blood Center were selected. The titer of IgG anti-A/anti-B antibody in mother's serum before and after absorption of IgG anti -AB antibody was detected by microcolumn gel card, and the incidence of ABO-HDFN was statistically analyzed.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!