Background: Data collection and cleaning procedures to exclude bot-generated responses are used to maintain the data integrity of samples from online surveys. However, these procedures may be time-consuming and difficult to implement. Thus, we aim to evaluate the validity of a single-step geolocation algorithm for recruiting eligible gay, bisexual, and men who have sex with men in Philadelphia for an online study.

Methods: We used a 4-step approach, based on common practices for evaluating bot-generated and fraudulent responses, to assess the validity of participants' Qualtrics survey data as our referent standard. We then compared it to Qualtrics' single-step geolocation algorithm that used the MaxMind commercial database to map participants' Internet protocol address to their approximate location. We calculated the sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of the single-step geolocation approach relative to the 4-step approach.

Results: There were 826 respondents who completed the survey and 440 (53%) were eligible for enrollment based on the 4-step approach. The single-step geolocation approach yielded a sensitivity of 91% (95% CI = 88%, 93%), specificity of 79% (95% CI = 74%, 83%), PPV of 83% (95% CI = 80%, 86%), and NPV of 88% (95% CI = 85%, 91%).

Conclusions: Geolocation alone provided a moderately high level of agreement with the 4-step approach for identifying geographically eligible participants in the online sample, but both approaches may be subject to additional misclassification. Researchers may want to consider multiple procedures to ensure data integrity in online samples.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10316145PMC
http://dx.doi.org/10.1097/EDE.0000000000001607DOI Listing

Publication Analysis

Top Keywords

single-step geolocation
16
4-step approach
12
gay bisexual
8
bisexual men
8
men sex
8
sex men
8
men philadelphia
8
data integrity
8
geolocation algorithm
8
geolocation approach
8

Similar Publications

Background: Data collection and cleaning procedures to exclude bot-generated responses are used to maintain the data integrity of samples from online surveys. However, these procedures may be time-consuming and difficult to implement. Thus, we aim to evaluate the validity of a single-step geolocation algorithm for recruiting eligible gay, bisexual, and men who have sex with men in Philadelphia for an online study.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!