The Artificial Intelligence (AI) research community has used ad-hoc benchmarks to measure the "" level of Large Language Models (LLMs). In humans, intelligence is closely linked to the functional integrity of the prefrontal lobes, which are essential for higher-order cognitive processes. Previous research has found that LLMs struggle with cognitive tasks that rely on these prefrontal functions, highlighting a significant challenge in replicating human-like intelligence. In December 2022, OpenAI released ChatGPT, a new chatbot based on the GPT-3.5 model that quickly gained popularity for its impressive ability to understand and respond to human instructions, suggesting a significant step towards intelligent behaviour in AI. Therefore, to rigorously investigate LLMs' level of "," we evaluated the GPT-3.5 and GPT-4 versions through a neuropsychological assessment using tests in the Italian language routinely employed to assess prefrontal functioning in humans. The same tests were also administered to Claude2 and Llama2 to verify whether similar language models perform similarly in prefrontal tests. When using human performance as a reference, GPT-3.5 showed inhomogeneous results on prefrontal tests, with some tests well above average, others in the lower range, and others frankly impaired. Specifically, we have identified poor planning abilities and difficulty in recognising semantic absurdities and understanding others' intentions and mental states. Claude2 exhibited a similar pattern to GPT-3.5, while Llama2 performed poorly in almost all tests. These inconsistent profiles highlight how LLMs' emergent abilities do not yet mimic human cognitive functioning. The sole exception was GPT-4, which performed within the normative range for all the tasks except planning. Furthermore, we showed how standardised neuropsychological batteries developed to assess human cognitive functions may be suitable for challenging LLMs' performance.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11490853 | PMC |
http://dx.doi.org/10.1016/j.heliyon.2024.e38911 | DOI Listing |
Alzheimers Dement
December 2024
University of California, San Francisco, San Francisco, CA, USA.
Background: Prior research shows that caregiving for people living with dementia (PLWD) varies with cultural, institutional, and social structural context, but less is known about the role of context in dementias of different etiologies. We compared experiences of caregiving in frontal-temporal dementia (FTD) versus non-FTD dementias using community-based comparative ethnography. We expected to find differences in caring for people living with FTD (PLWFTD) versus people living with other dementias (PLWOD).
View Article and Find Full Text PDFAlzheimers Dement
December 2024
NYU Langone Health, New York, NY, USA.
Background: Large language models (LLMs) provide powerful natural language processing capabilities in medical and clinical tasks. Evaluating LLM performance is crucial due to potential false results. In this study, we assessed ChatGPT and Llama2, two state-of-the-art LLMs, in extracting information from clinical notes, focusing on cognitive tests, specifically the Mini Mental State Exam (MMSE) and Cognitive Dementia Rating (CDR).
View Article and Find Full Text PDFAlzheimers Dement
December 2024
Mayo Clinic, Rochester, MN, USA.
Background: Discussion surrounding the nomenclature of the "nonfluent/agrammatic" spectrum of progressive speech-language disorders has largely focused on the clinical-pathological and neuroimaging correlations, with some attention paid to the prognostication afforded by differentiating clinical phenotypes. Progressive apraxia of speech (AOS), with or without agrammatic aphasia, is generally associated with an underlying tauopathy; however, patients have offered a unique perspective on the importance of distinguishing between difficulties with speech and language that extends beyond pathological specificity. This study aimed to provide insight into the experience of patients with primary progressive AOS (PPAOS), with particular attention to their diagnostic journey.
View Article and Find Full Text PDFAlzheimers Dement
December 2024
University of Saint Joseph, West Hartford, CT, USA.
Background: Many individuals with health problems and/or disabilities are largely dependent on the help of an informal caregiver, most often a family member with whom they live (CDC Report, 2018). A recent report by the Alzheimer's Association (2023) found that, compared with caregivers of people without dementia, twice as many caregivers of those with dementia have reported significant emotional, financial, and physical difficulties. Despite the important role that caregivers have in our society, research on potential factors that may buffer the negative impacts of caregiving has been lacking.
View Article and Find Full Text PDFAlzheimers Dement
December 2024
University of Sheffield, Sheffield, United Kingdom.
Background: Marital status and living status are components of social isolation (SI), a modifiable factor thought to impact cognitive resilience, which has the potential to impact cognition throughout the course of Alzheimer's and related dementia (ADRD) diagnosis. Electronic health records (EHRs) offer access to large scale clinical data, capable of longitudinal analyses.
Method: Cognitive function measurement - Montreal Cognitive Assessment (MoCA) - data, demographic (including marital and living status as SI proxies) data and ADRD diagnosis data from patients aged 50+ years from Oxford Health NHS Foundation Trust (UK) were extracted using natural language processing algorithms from EHRs dated 1995 to 2022.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!