Analysis of large-language model versus human performance for genetics questions.

medRxiv

Published: January 2023

Large-language models like ChatGPT have recently received a great deal of attention. To assess ChatGPT in the field of genetics, we compared its performance to human respondents in answering genetics questions (involving 13,636 responses) that had been posted on social media platforms starting in 2021. Overall, ChatGPT did not perform significantly differently than human respondents, but did significantly better on memorization-type questions versus critical thinking questions, frequently provided different answers when asked questions multiple times, and provided plausible explanations for both correct and incorrect answers.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9928145	PMC
http://dx.doi.org/10.1101/2023.01.27.23285115	DOI Listing

Publication Analysis

Top Keywords

genetics questions

human respondents

questions

analysis large-language

large-language model

model versus

versus human

human performance

performance genetics

questions large-language

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!