Background: We participated in three of the protein-protein interaction subtasks of the Second BioCreative Challenge: classification of abstracts relevant for protein-protein interaction (interaction article subtask [IAS]), discovery of protein pairs (interaction pair subtask [IPS]), and identification of text passages characterizing protein interaction (interaction sentences subtask [ISS]) in full-text documents. We approached the abstract classification task with a novel, lightweight linear model inspired by spam detection techniques, as well as an uncertainty-based integration scheme. We also used a support vector machine and singular value decomposition on the same features for comparison purposes. Our approach to the full-text subtasks (protein pair and passage identification) includes a feature expansion method based on word proximity networks.

Results: Our approach to the abstract classification task (IAS) was among the top submissions for this task in terms of measures of performance used in the challenge evaluation (accuracy, F-score, and area under the receiver operating characteristic curve). We also report on a web tool that we produced using our approach: the Protein Interaction Abstract Relevance Evaluator (PIARE). Our approach to the full-text tasks resulted in one of the highest recall rates as well as mean reciprocal rank of correct passages.

Conclusion: Our approach to abstract classification shows that a simple linear model, using relatively few features, can generalize and uncover the conceptual nature of protein-protein interactions from the bibliome. Because the novel approach is based on a rather lightweight linear model, it can easily be ported and applied to similar problems. In full-text problems, the expansion of word features with word proximity networks is shown to be useful, although the need for some improvements is discussed.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2559982PMC
http://dx.doi.org/10.1186/gb-2008-9-s2-s11DOI Listing

Publication Analysis

Top Keywords

linear model
16
protein interaction
12
word proximity
12
abstract classification
12
interaction
8
proximity networks
8
protein-protein interaction
8
interaction interaction
8
classification task
8
lightweight linear
8

Similar Publications

Malnutrition in the early days of life is a global public health concern that affects children's growth. It results from a variety of factors, including pathogenic infections. Enterocytozoon bieneusi is a microsporidian parasite that can cause diarrhea and malnutrition in children.

View Article and Find Full Text PDF

Objectives: We examined the health-related quality of life (HRQoL) during menopause transition (MT) among middle-aged Korean women.

Methods: This cross-sectional study comprised 2,290 middle-aged women who completed web-based questionnaires between 2020 and 2022. Based on self-reported menstrual cycle patterns, menopause status was classified as premenopausal, early or late transition, or postmenopausal.

View Article and Find Full Text PDF

Objective: To examine the association between serum thyroid-stimulating hormone (TSH) levels with handgrip strength (HGS) and dynapenia in euthyroid postmenopausal women.

Methods: This was an exploratory cross-sectional study among 385 participants from the Department of Obstetrics, Gynecology, and Reproduction of the Dexeus Women's University Hospital, Barcelona, Spain. Age, age at menopause, adiposity, alcohol consumption, body mass index (BMI), and smoking status were recorded.

View Article and Find Full Text PDF

Objective: To compare arterial stiffness between young adults with perinatally acquired HIV (YAPHIV) and young adults perinatally HIV exposed but uninfected (YAPHEU).

Design: Cross-sectional analysis of pulse wave velocity (PWV) measures among participants with echocardiography in the PHACS Cardiac Toxicity Substudy.

Methods: A total of 150 participants (95 YAPHIV, 55 YAPHEU, mean 23.

View Article and Find Full Text PDF

Objective: We investigated associations between per- and polyfluoroalkyl substances (PFAS) and changes in diabetes indicators from pregnancy to 12 years after delivery among women with a history of gestational diabetes mellitus (GDM).

Research Design And Methods: Eighty Hispanic women with GDM history were followed from the third trimester of pregnancy to 12 years after delivery. Oral and intravenous glucose tolerance tests were conducted during follow-up.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!