A Hybrid Human-Computer Approach to the Extraction of Scientific Facts from the Literature.

Roselyne B Tchoua Kyle Chard Debra Audus Jian Qin Juan de Pablo Ian Foster

Procedia Comput Sci

Department of Computer Science, The University of Chicago, Chicago, IL, USA.

Published: June 2016

A wealth of valuable data is locked within the millions of research articles published each year. Reading and extracting pertinent information from those articles has become an unmanageable task for scientists. This problem hinders scientific progress by making it hard to build on results buried in literature. Moreover, these data are loosely structured, encoded in manuscripts of various formats, embedded in different content types, and are, in general, not machine accessible. We present a hybrid human-computer solution for semi-automatically extracting scientific facts from literature. This solution combines an automated discovery, download, and extraction phase with a semi-expert crowd assembled from students to extract specific scientific facts. To evaluate our approach we apply it to a challenging molecular engineering scenario, extraction of a polymer property: the Flory-Huggins interaction parameter. We demonstrate useful contributions to a comprehensive database of polymer properties.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5482373	PMC
http://dx.doi.org/10.1016/j.procs.2016.05.338	DOI Listing

Publication Analysis

Top Keywords

scientific facts

hybrid human-computer

facts literature

human-computer approach

approach extraction

scientific

extraction scientific

literature wealth

wealth valuable

valuable data

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!