Precise phenotype information is needed to understand the effects of genetic and epigenetic changes on tumor behavior and responsiveness. Extraction and representation of cancer phenotypes is currently mostly performed manually, making it difficult to correlate phenotypic data to genomic data. In addition, genomic data are being produced at an increasingly faster pace, exacerbating the problem. The DeepPhe software enables automated extraction of detailed phenotype information from electronic medical records of cancer patients. The system implements advanced Natural Language Processing and knowledge engineering methods within a flexible modular architecture, and was evaluated using a manually annotated dataset of the University of Pittsburgh Medical Center breast cancer patients. The resulting platform provides critical and missing computational methods for computational phenotyping. Working in tandem with advanced analysis of high-throughput sequencing, these approaches will further accelerate the transition to precision cancer treatment. .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5690492PMC
http://dx.doi.org/10.1158/0008-5472.CAN-17-0615DOI Listing

Publication Analysis

Top Keywords

natural language
8
language processing
8
cancer phenotypes
8
genomic data
8
cancer patients
8
cancer
5
deepphe natural
4
processing system
4
system extracting
4
extracting cancer
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!