There is a great and growing need to ascertain what exactly is the state of a patient, in terms of disease progression, actual care practices, pathology, adverse events, and much more, beyond the paucity of data available in structured medical record data. Ascertaining these harder-to-reach data elements is now critical for the accurate phenotyping of complex traits, detection of adverse outcomes, efficacy of off-label drug use, and longitudinal patient surveillance. Clinical notes often contain the most detailed and relevant digital information about individual patients, the nuances of their diseases, the treatment strategies selected by physicians, and the resulting outcomes. However, notes remain largely unused for research because they contain Protected Health Information (PHI), which is synonymous with individually identifying data. Previous clinical note de-identification approaches have been rigid and still too inaccurate to see any substantial real-world use, primarily because they have been trained with too small medical text corpora. To build a new de-identification tool, we created the largest manually annotated clinical note corpus for PHI and develop a customizable open-source de-identification software called Philter ("Protected Health Information filter"). Here we describe the design and evaluation of Philter, and show how it offers substantial real-world improvements over prior methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7156708PMC
http://dx.doi.org/10.1038/s41746-020-0258-yDOI Listing

Publication Analysis

Top Keywords

protected health
8
clinical notes
8
clinical note
8
substantial real-world
8
health filter
4
filter philter
4
philter accurately
4
accurately securely
4
securely de-identifying
4
de-identifying free-text
4

Similar Publications

Influencers are content creators who post online about their lives and can amass a significant following. Influencers can be dangerous by negatively affecting their followers' body image and marketing products in a deceptive way. The limited academic writings which consider influencer regulation note an incongruency between influencer conduct and the corresponding regulatory system.

View Article and Find Full Text PDF

Background: Cleft lip and/or palate is the most common congenital orofacial deformity, affecting 1/800 births. A thorough review of the literature has shown that children with cleft have poorer oral hygiene and dental health than other children, with higher levels of caries in both temporary and permanent teeth and poorer periodontal health. Cleft patients are treated by a multidisciplinary team that aims to provide comprehensive care from pre- or post-natal diagnosis to early adulthood and the end of growth.

View Article and Find Full Text PDF

Objective: The occurrence of hypofibrinogenemia after tocilizumab treatment has attracted increasing attention, which may cause bleeding and even life-threatening. This study aims to explore the risk factors for tocilizumab-induced hypofibrinogenemia (T-HFIB) and construct a risk prediction model.

Methods: A total of 221 inpatients that received tocilizumab from 2015 to 2023 were retrospectively collected and divided into T-HFIB group or control group.

View Article and Find Full Text PDF

Background: Hip morphology variations, particularly in femoral neck shaft angle (NSA) and iliac wing width (IWW), have been associated with gluteal tendinopathy. However, the biomechanical implications of these morphological differences on gluteal muscle function are not well understood. This study investigates how NSA and IWW influence gluteal muscle forces, moment arms, and estimated tendon loads during walking, aiming to provide insights into the potential biomechanical pathways that may contribute to altered lateral hip loading patterns.

View Article and Find Full Text PDF

Background: Patients with non-small cell lung cancer (NSCLC) are prone to developing brain metastases (BMs), particularly those with epidermal growth factor receptor (EGFR) mutations. In clinical practice, treatment-naïve EGFR-mutant NSCLC patients with asymptomatic BMs tend to choose EGFR-tyrosine kinase inhibitors (TKIs) as first-line therapy and defer intracranial radiotherapy (RT). However, the effectiveness of upfront intracranial RT remains unclear.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!