Background: Qualitative methods are incredibly beneficial to the dissemination and implementation of new digital health interventions; however, these methods can be time intensive and slow down dissemination when timely knowledge from the data sources is needed in ever-changing health systems. Recent advancements in generative artificial intelligence (GenAI) and their underlying large language models (LLMs) may provide a promising opportunity to expedite the qualitative analysis of textual data, but their efficacy and reliability remain unknown.
Objective: The primary objectives of our study were to evaluate the consistency in themes, reliability of coding, and time needed for inductive and deductive thematic analyses between GenAI (ie, ChatGPT and Bard) and human coders.
Methods: The qualitative data for this study consisted of 40 brief SMS text message reminder prompts used in a digital health intervention for promoting antiretroviral medication adherence among people with HIV who use methamphetamine. Inductive and deductive thematic analyses of these SMS text messages were conducted by 2 independent teams of human coders. An independent human analyst conducted analyses following both approaches using ChatGPT and Bard. The consistency in themes (or the extent to which the themes were the same) and reliability (or agreement in coding of themes) between methods were compared.
Results: The themes generated by GenAI (both ChatGPT and Bard) were consistent with 71% (5/7) of the themes identified by human analysts following inductive thematic analysis. The consistency in themes was lower between humans and GenAI following a deductive thematic analysis procedure (ChatGPT: 6/12, 50%; Bard: 7/12, 58%). The percentage agreement (or intercoder reliability) for these congruent themes between human coders and GenAI ranged from fair to moderate (ChatGPT, inductive: 31/66, 47%; ChatGPT, deductive: 22/59, 37%; Bard, inductive: 20/54, 37%; Bard, deductive: 21/58, 36%). In general, ChatGPT and Bard performed similarly to each other across both types of qualitative analyses in terms of consistency of themes (inductive: 6/6, 100%; deductive: 5/6, 83%) and reliability of coding (inductive: 23/62, 37%; deductive: 22/47, 47%). On average, GenAI required significantly less overall time than human coders when conducting qualitative analysis (20, SD 3.5 min vs 567, SD 106.5 min).
Conclusions: The promising consistency in the themes generated by human coders and GenAI suggests that these technologies hold promise in reducing the resource intensiveness of qualitative thematic analysis; however, the relatively lower reliability in coding between them suggests that hybrid approaches are necessary. Human coders appeared to be better than GenAI at identifying nuanced and interpretative themes. Future studies should consider how these powerful technologies can be best used in collaboration with human coders to improve the efficiency of qualitative research in hybrid approaches while also mitigating potential ethical risks that they may pose.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11329846 | PMC |
http://dx.doi.org/10.2196/54482 | DOI Listing |
PLoS One
January 2025
Department of English and Communication, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China.
This study aims to provide an LLM (Large Language Model)-based method for the discourse analysis of media attitudes, and thereby investigate media attitudes towards China in a Hong Kong-based newspaper. Analysis of attitudes in large amounts of media data is crucial for understanding public opinions, market trends, social dynamics, etc. However, corpus-based approaches have traditionally focused on explicit linguistic expressions of attitudes, leaving implicit expressions unconsidered.
View Article and Find Full Text PDFDev Psychol
January 2025
Department of Global Health and Population, Harvard T.H. Chan School of Public Health, Harvard University.
A strong body of evidence has underscored the cross-cultural importance of nurturing parent-child relationships for promoting early child development outcomes. However, most research on parenting has predominantly relied on self-reported measures collected from mothers. Observational tools for assessing parent-child interactions from not only mothers but also fathers remains limited, especially in Majority World contexts.
View Article and Find Full Text PDFBMJ Open
January 2025
Adult and Child Center for Health Outcomes Research and Delivery Science (ACCORDS), University of Colorado School of Medicine, Aurora, Colorado, USA.
Introduction: The ability of healthcare, community and public health systems to effectively implement and disseminate research innovations depends on contextual factors at multiple interconnected levels of influence (eg, the innovation, individual, provider/implementor, organisation and health system). Recently, there has been an increase in the development of complex interventions designed to target multiple levels, designed for or adapted to the context in which they are delivered. Two concepts from complex systems thinking have been increasingly used to operationalise such interventions-core functions (theory and evidence-driven purposes of interventions) and forms (adaptable activities that perform each core function).
View Article and Find Full Text PDFJMIR Form Res
January 2025
Department of Health Administration, The College of Health Professions, Central Michigan University, Mt Pleasant, MI, United States.
Sensors (Basel)
December 2024
Department of Psychiatry, Penn Center for Mental Health, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.
Many children on the autism spectrum engage in challenging behaviors, like aggression, due to difficulties communicating and regulating their stress. Identifying effective intervention strategies is often subjective and time-consuming. Utilizing unobservable internal physiological data to predict strategy effectiveness may help simplify this process for teachers and parents.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!