Objective: Our objective was to evaluate tokens commonly used by clinical research consortia to aggregate clinical data across institutions.
Methods: This study compares tokens alone and token-based matching algorithms against manual annotation for 20,002 record pairs extracted from the University of Texas Houston's clinical data warehouse (CDW) in terms of entity resolution.
Results: The highest precision achieved was 99.