Large-language models like ChatGPT have recently received a great deal of attention. To assess ChatGPT in the field of genetics, we compared its performance to human respondents in answering genetics questions (involving 13,636 responses) that had been posted on social media platforms starting in 2021. Overall, ChatGPT did not perform significantly differently than human respondents, but did significantly better on memorization-type questions versus critical thinking questions, frequently provided different answers when asked questions multiple times, and provided plausible explanations for both correct and incorrect answers.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9928145PMC
http://dx.doi.org/10.1101/2023.01.27.23285115DOI Listing

Publication Analysis

Top Keywords

genetics questions
8
human respondents
8
questions
5
analysis large-language
4
large-language model
4
model versus
4
versus human
4
human performance
4
performance genetics
4
questions large-language
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!