Publications by authors named "Daniel Berleant"

Introduction: Data and information quality play a critical role in the managed healthcare sector, where accurate and reliable information is crucial for optimal decision-making, operations, and patient outcomes. However, managed care organizations face significant challenges in ensuring information quality due to the complexity of data sources, regulatory requirements, and the need for effective data management practices. The goal of this article is to develop and justify an information quality framework for managed healthcare, thereby enabling the sector to better meet its unique information quality challenges.

View Article and Find Full Text PDF

Background: The WikiHyperGlossary is an information literacy technology that was created to enhance reading comprehension of documents by connecting them to socially generated multimedia definitions as well as semantically relevant data. The WikiHyperGlossary enhances reading comprehension by using the lexicon of a discipline to generate dynamic links in a document to external resources that can provide implicit information the document did not explicitly provide. Currently, the most common method to acquire additional information when reading a document is to access a search engine and browse the web.

View Article and Find Full Text PDF

Background: We describe a method for extracting data about how biomolecule pairs interact from texts. This method relies on empirically determined characteristics of sentences. The characteristics are efficient to compute, making this approach to extraction of biomolecular interactions scalable.

View Article and Find Full Text PDF

We live in an age of access to more information than ever before. This can be a double-edged sword. Increased access to information allows for more informed and empowered researchers, while information overload becomes an increasingly serious risk.

View Article and Find Full Text PDF

Background: Analyzing global experimental data can be tedious and time-consuming. Thus, helping biologists see results as quickly and easily as possible can facilitate biological research, and is the purpose of the software we describe.

Results: We present BirdsEyeView, a software system for visualizing experimental transcriptomic data using different views that users can switch among and compare.

View Article and Find Full Text PDF

Background: Rapid growth in the scientific literature available on-line continues to motivate shifting data analysis from humans to computers. For example, greater knowledge of sentence characteristics indicative of interaction between two biological entities is needed to aid in the creation of better-performing information extraction tools for effectively using this rich body of information.

Findings: The Interaction Sentence Database (ISDB) allows users to retrieve sets of sentences fitting specified characteristics.

View Article and Find Full Text PDF

Motivation: The increasingly large amount of free, online biological text makes automatic interaction extraction correspondingly attractive. Machine learning is one strategy that works by uncovering and using useful properties that are implicit in the text. However these properties are usually not reported in the literature explicitly.

View Article and Find Full Text PDF

MEDLINE is one of the most important bibliographical information sources for biologists and medical workers. Its PubMed interface supports Boolean queries, which are potentially expressive and exact. However, PubMed is also designed to support simplicity of use at the expense of query expressiveness and exactness.

View Article and Find Full Text PDF

Unlabelled: MEDLINE/PubMed is one of the most important information sources for bioinformatics text mining. However, there remain limitations in working with MEDLINE/PubMed citations. For example, PubMed imposes an upper limit of 10,000 for downloading PMID list or citations; and MEDLINE files are too large for most off-the-shelf XML parsers.

View Article and Find Full Text PDF