Stud Health Technol Inform
August 2024
We applied natural language processing (NLP) to a corpus extracted from 4 hours of expert panel discussion transcripts to determine the sustainability of a Stage II-III clinical trial of online social support interventions for Hispanic and African American dementia caregivers. Prominent topics included Technology/hard to reach populations, Training younger populations, Building trust, Privacy and security issues, Simplification of screening questions and recruitment procedures, Understanding participants' needs, Planning strategies and logistics, Potential recruitment places, Adjusting intervention size downwards to engage elderly participants, Targeting different generations, Internet-based interventions by age range, and Providing step-by-step instructions and an overview of the entire research process during recruitment. The application of NLP to qualitative data on a dementia caregiving clinical trial provides useful insights for recruitment, retention, and adherence to guidelines for such interventions serving Hispanic and African American dementia caregivers.
View Article and Find Full Text PDFWe applied natural language processing and topic modeling to publicly available abstracts and titles of 263 papers in the scientific literature mentioning AI and demographics (corpus 1 before Covid-19, corpus 2 after Covid-19) extracted from the MEDLINE database. We found exponential growth of AI studies mentioning demographics since the pandemic (Before Covid-19: N= 40 vs. After Covid-19: N= 223) [forecast model equation: ln(Number of Records) = 250.
View Article and Find Full Text PDFWe compared emotional valence scores as determined via machine learning approaches to human-coded scores of direct messages on Twitter from our 2,301 followers during a Twitter-based clinical trial screening for Hispanic and African American family caregivers of persons with dementia. We manually assigned emotional valence scores to 249 randomly selected direct Twitter messages from our followers (N=2,301), then we applied three machine learning sentiment analysis algorithms to extract emotional valence scores for each message and compared their mean scores to the human coding results. The aggregated mean emotional scores from the natural language processing were slightly positive, while the mean score from human coding as a gold standard was negative.
View Article and Find Full Text PDFWe applied social network analysis to compare Hispanic and Black dementia caregiving networks on Twitter that were established as part of a clinical trial from January 12, 2022, to October 31, 2022. We extracted Twitter data from our caregiver support communities (N=1980 followers, 811 enrollees) via the Twitter API and used social network analysis software to compare friend/follower interactions within each Hispanic and Black caregiving network. Analysis of the social networks revealed that enrolled family caregivers without prior social media competency had overall low connectedness compared to both enrolled and non-enrolled caregivers with social media competency, who were more integrated into the communities that developed through the clinical trial, partly due to their ties to external dementia caregiving groups.
View Article and Find Full Text PDFWe applied mixed-methods to refine our first version of the Twitter message library (English 400, translated into Spanish 400) for African Americans and Hispanic family caregivers for a person with dementia. We conducted a series of expert panels to collect quantitative and qualitative data using surveys and in-depth interviews. Using mixed methods to ensure unbiased results, the panelists first independently scored them (1 message/5 panelist) on a scale of 1 to 4 (1: lowest, 4: highest), followed by in-depth interviews and group discussions.
View Article and Find Full Text PDFWe randomly extracted Korean-language Tweets mentioning dementia/Alzheimer's disease (n= 12,413) from November 28 to December 9, 2020. We independently applied three machine learning algorithms (Afinn, Syuzhet, and Bing) using natural language processing (NLP) techniques and qualitative manual scoring to assign emotional valence scores to Tweets. We then compared the means and distributions of the four emotional valence scores.
View Article and Find Full Text PDFWe randomly examined Korean-language Tweets mentioning dementia/Alzheimer's disease (n= 12,413) posted from November 28 to December 9, 2020, without limiting geographical locations. We independently applied Latent Dirichlet Allocation (LDA) topic modeling and qualitative content analysis to the texts of the Tweets. We compared the themes extracted by LDA topic modeling to those identified via manual coding methods.
View Article and Find Full Text PDFWe applied social network analysis (SNA) on Tweets to compare Hispanic and Black dementia caregiving networks. We randomly extracted Tweets mentioning dementia caregiving and related terms from corpora collected daily via the Twitter API from September 1 to December 31, 2019 (initial corpus: n = 2,742,539 Tweets, random sample n = 549,380 English Tweets, n= 185,684 Spanish Tweets). After removing bot-generated Tweets, we first applied a lexicon-based demographic inference algorithm to automatically identify Tweets likely authored by Black and Hispanic individuals using Python (n = 114,511 English, n = 1,185 Spanish).
View Article and Find Full Text PDFWe randomly extracted Tweets mentioning dementia/Alzheimer's caregiving-related terms (n= 58,094) from Aug 23, 2019, to Sep 14, 2020, via an API. We applied a clustering algorithm and natural language processing (NLP) to publicly available English Tweets to detect topics and sentiment. We compared emotional valence scores of Tweets from before (through the end of 2019) and after the beginning of the COVID-19 pandemic (2020-).
View Article and Find Full Text PDFWe interviewed six clinicians to learn about their lived experience using electronic health records (EHR, Allscripts users) using a semi-structured interview guide in an academic medical center in New York City from October to November 2016. Each participant interview lasted approximately one to two hours. We applied a clustering algorithm to the interview transcript to detect topics, applying natural language processing (NLP).
View Article and Find Full Text PDFWe extracted 3,291,101 Tweets using hashtags associated with African American-related discourse (#BlackTwitter, #BlackLivesMatter, #StayWoke) and 1,382,441 Tweets from a control set (general or no hashtags) from September 1, 2019 to December 31, 2019 using the Twitter API. We also extracted a literary historical corpus of 14,692 poems and prose writings by African American authors and 66,083 items authored by others as a control, including poems, plays, short stories, novels and essays, using a cloud-based machine learning platform (Amazon SageMaker) via ProQuest TDM Studio. Lastly, we combined statistics from log likelihood and Fisher's exact tests as well as feature analysis of a batch-trained Naive Bayes classifier to select lexicons of terms most strongly associated with the target or control texts.
View Article and Find Full Text PDFImportance: Adults who belong to racial/ethnic minority groups are more likely than White adults to receive a diagnosis of chronic disease in the United States.
Objective: To evaluate which health indicators have improved or become worse among Black and Hispanic middle-aged and older adults since the Minority Health and Health Disparities Research and Education Act of 2000.
Design, Setting, And Participants: In this repeated cross-sectional study, a total of 4 856 326 records were extracted from the Behavioral Risk Factor Surveillance System from January 1999 through December 2018 of persons who self-identified as Black (non-Hispanic), Hispanic (non-White), or White and who were 45 years or older.
We applied artificial intelligence techniques to build correlate models that predict general poor health in a national sample of caregivers with mild cognitive impairment (MCI). Our application of deep learning identified age, duration of caregiving, amount of alcohol intake, weight, myocardial infarction (MI) and frequency of MCI symptoms for Blacks and Hispanics whereas frequency of MCI symptoms, income, weight, coronary heart disease (CHD), age, and use of e-cigarette for the others as the strongest correlates of poor health among 81 variables entered. The application of artificial intelligence efficiently provided intervention strategies for Black and Hispanic caregivers with MCI.
View Article and Find Full Text PDFWe randomly extracted publicly available Tweets mentioning COVID-19 related terms (n=2,558,474 Tweets) from Tweet corpora collected daily using an API from Jan 21st to May 3rd, 2020. We applied a clustering algorithm to publicly available Tweets authored by African Americans (n=1,763) to detect topics and sentiment applying natural language processing (NLP). We visualized fifteen topics (four themes) using network diagrams (Newman modularity 0.
View Article and Find Full Text PDFWe applied social network analysis (SNA) to Tweets mentioning cannabis or opioid-related terms to publicly available COVID-19 related Tweets collected from Jan 21st to May 3rd, 2020 (n= 2,558,474 Tweets). We randomly extracted 16,154 Tweets mentioning cannabis and 4,670 Tweets mentioning opioids from the COVID-19 Tweet corpora for our analysis. The cannabis related Tweets created by 6,144 users were disseminated to 280,042,783 users and retweeted 11 times the number of original messages while opioid-related Tweets created by 3,412 users were disseminated to smaller number of users.
View Article and Find Full Text PDFBackground: HIV/AIDS is a tremendous public health crisis, with a call for its eradication by 2030. A human rights response through civil society engagement is critical to support and sustain HIV eradication efforts. However, ongoing civil engagement is a challenge.
View Article and Find Full Text PDFWe present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process.
View Article and Find Full Text PDF