An annotated corpus with nanomedicine and pharmacokinetic parameters.

Int J Nanomedicine

Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.

Published: February 2018

A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP) approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP) efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration's Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5644562PMC
http://dx.doi.org/10.2147/IJN.S137117DOI Listing

Publication Analysis

Top Keywords

annotated corpus
8
food drug
8
entity extraction
8
annotated
5
nanomedicine
5
corpus nanomedicine
4
nanomedicine pharmacokinetic
4
pharmacokinetic parameters
4
parameters vast
4
vast amount
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!