Background: Diabetes has become one of the hot topics in life science researches. To support the analytical procedures, researchers and analysts expend a mass of labor cost to collect experimental data, which is also error-prone. To reduce the cost and to ensure the data quality, there is a growing trend of extracting clinical events in form of knowledge from electronic medical records (EMRs). To do so, we first need a high-coverage knowledge base (KB) of a specific disease to support the above extraction tasks called KB-based Extraction.

Methods: We propose an approach to build a diabetes-centric knowledge base (a.k.a. DKB) via mining the Web. In particular, we first extract knowledge from semi-structured contents of vertical portals, fuse individual knowledge from each site, and further map them to a unified KB. The target DKB is then extracted from the overall KB based on a distance-based Expectation-Maximization (EM) algorithm.

Results: During the experiments, we selected eight popular vertical portals in China as data sources to construct DKB. There are 7703 instances and 96,041 edges in the final diabetes KB covering diseases, symptoms, western medicines, traditional Chinese medicines, examinations, departments, and body structures. The accuracy of DKB is 95.91%. Besides the quality assessment of extracted knowledge from vertical portals, we also carried out detailed experiments for evaluating the knowledge fusion performance as well as the convergence of the distance-based EM algorithm with positive results.

Conclusions: In this paper, we introduced an approach to constructing DKB. A knowledge extraction and fusion pipeline was first used to extract semi-structured data from vertical portals and individual KBs were further fused into a unified knowledge base. After that, we develop a distance based Expectation Maximization algorithm to extract a subset from the overall knowledge base forming the target DKB. Experiments showed that the data in DKB are rich and of high-quality.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454670PMC
http://dx.doi.org/10.1186/s12911-019-0771-6DOI Listing

Publication Analysis

Top Keywords

knowledge base
20
vertical portals
16
knowledge
11
mining web
8
target dkb
8
dkb
7
base
5
data
5
building diabetes
4
diabetes centric
4

Similar Publications

Purpose: Speech-language Therapists (SLTs) are specialists in communication, feeding and swallowing as core members of the paediatric tracheostomy multidisciplinary team (MDT). Inconsistent tracheostomy care leads to staff and family frustration and delayed intervention. Little is known about international SLT tracheostomy practices.

View Article and Find Full Text PDF

Boriranes, highly strained three-membered cyclic organoboron heterocycles, have emerged as potential synthons for the synthesis of many organoboron species. However, the synthesis of boriranes with tricoordinate, sp-hybridised boron and tetracoordinate, sp-hybridised carbon atoms is very challenging owing to their high Lewis acidity. Herein we describe the isolation of base-free triaminoboriranes from the room-temperature reaction of diaminoalkynes with an aminodistannylborane.

View Article and Find Full Text PDF

Prostate cancer presents a major health issue, with its progression influenced by intricate molecular factors. Notably, the interplay between miRNAs and changes in transcriptomic patterns is not fully understood. Our study seeks to bridge this knowledge gap, employing computational techniques to explore how miRNAs and transcriptomic alterations jointly regulate the development of prostate cancer.

View Article and Find Full Text PDF

Purpose: The spine research within India has seen significant advancement, yet detailed examinations of its impact and evolution still need to be made sparse. To conduct a comprehensive scientometric review of the most frequently cited papers in Indian spine research from 1995 to 2024, aiming to map the field's evolution and its global impact.

Methods: Utilizing the Scopus database, a search was performed with keywords related to spine research, identifying 105 highly cited papers.

View Article and Find Full Text PDF

Contribution of rat insular cortex to stimulus-guided action.

J Neurosci

January 2025

Univ. Bordeaux, CNRS, INCIA, UMR 5287, F-33000 Bordeaux, France

Anticipating rewards is fundamental for decision-making. Animals often use cues to assess reward availability and to make predictions about future outcomes. The gustatory region of the insular cortex (IC), the so-called gustatory cortex, has a well-established role in the representation of predictive cues, such that IC neurons encode both a general form of outcome expectation as well as anticipatory outcome-specific knowledge.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!