AI assistants, such as ChatGPT, are being increasingly used by students in higher education institutions. While these tools provide opportunities for improved teaching and education, they also pose significant challenges for assessment and learning outcomes. We conceptualize these challenges through the lens of vulnerability, the potential for university assessments and learning outcomes to be impacted by student use of generative AI. We investigate the potential scale of this vulnerability by measuring the degree to which AI assistants can complete assessment questions in standard university-level Science, Technology, Engineering, and Mathematics (STEM) courses. Specifically, we compile a dataset of textual assessment questions from 50 courses at the École polytechnique fédérale de Lausanne (EPFL) and evaluate whether two AI assistants, GPT-3.5 and GPT-4 can adequately answer these questions. We use eight prompting strategies to produce responses and find that GPT-4 answers an average of 65.8% of questions correctly, and can even produce the correct answer across at least one prompting strategy for 85.1% of questions. When grouping courses in our dataset by degree program, these systems already pass the nonproject assessments of large numbers of core courses in various degree programs, posing risks to higher education accreditation that will be amplified as these models improve. Our results call for revising program-level assessment design in higher education in light of advances in generative AI.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11626143 | PMC |
http://dx.doi.org/10.1073/pnas.2414955121 | DOI Listing |
Allergol Immunopathol (Madr)
January 2025
Regional Center for Allergy and Clinical Immunology, University Hospital "Dr. José Eleuterio González," Universidad Autónoma de Nuevo León, Monterrey, Nuevo León, Mexico;
Background: Atopic dermatitis (AD) is a disease of multifactorial etiology that affects the quality of life of those afflicted.
Objective: The degree of control of patients with AD over 12 years of age was determined with the Atopic Dermatitis Control Instrument (ADCT).
Material And Methods: This observational, cross-sectional, descriptive study included patients with AD who were evaluated with a self-administered instrument, the ADCT.
Psychol Trauma
January 2025
Paul Baerwald School of Social Work and Social Welfare, Hebrew University of Jerusalem.
Objective: The Adult Scale/National Stressful Events Survey Short Scale (NSESSS) is an emerging brief screening measure for the severity of Acute Stress Symptoms based on the of acute stress disorder (ASD). Scant information is known about the NSESSS's psychometric properties among different cultures or populations exposed to an ongoing trauma and displacement. Therefore, the present study aimed to (a) assess for the first time the psychometric properties and construct validity of the Hebrew version of NSESSS in an internally displaced population following the massacre in Israel on October 7, 2023; and (b) assess the possible risk and protective predictors of ASD according to sociodemographic characteristics, types of trauma exposure, absence of basic needs, and social support.
View Article and Find Full Text PDFPsychol Addict Behav
January 2025
Department of Psychiatry, Center for Behavior Genetics of Aging, University of California, San Diego.
Objective: Alcohol use is common in older adults and linked to poor health and aging outcomes. Studies have demonstrated genetic and environmental contributions to the quantity of alcohol consumption in mid-to-late life, but less is known about whether these influences are moderated by sociodemographic factors such as age, sex, and educational attainment. This study sought to better understand sociodemographic trends in alcohol consumption across the second half of the life course and their underlying genetic and environmental influences.
View Article and Find Full Text PDFPsychol Addict Behav
January 2025
Department of Psychology, Center on Alcohol, Substance use, And Addictions, University of New Mexico.
Objective: Community characteristics (e.g., alcohol access, poverty) are associated with alcohol use disorder (AUD) at the population level, and person-level AUD severity indicators (e.
View Article and Find Full Text PDFChildren's high and increasing levels of screen time are of growing concern to parents, health professionals, and researchers. With the growing availability and use of devices such as smartphones and tablets, it is important to understand the impact of children's screen use on development. Prospective longitudinal data from 6,281 children (48.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!