Multi-instance multi-label distance metric learning for genome-wide protein function prediction.

Comput Biol Chem

School of Software Engineering, South China University of Technology, Guangzhou 510006, China. Electronic address:

Published: August 2016

Multi-instance multi-label (MIML) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with not only multiple instances but also multiple class labels. To find an appropriate MIML learning method for genome-wide protein function prediction, many studies in the literature attempted to optimize objective functions in which dissimilarity between instances is measured using the Euclidean distance. But in many real applications, Euclidean distance may be unable to capture the intrinsic similarity/dissimilarity in feature space and label space. Unlike other previous approaches, in this paper, we propose to learn a multi-instance multi-label distance metric learning framework (MIMLDML) for genome-wide protein function prediction. Specifically, we learn a Mahalanobis distance to preserve and utilize the intrinsic geometric information of both feature space and label space for MIML learning. In addition, we try to deal with the sparsely labeled data by giving weight to the labeled data. Extensive experiments on seven real-world organisms covering the biological three-domain system (i.e., archaea, bacteria, and eukaryote; Woese et al., 1990) show that the MIMLDML algorithm is superior to most state-of-the-art MIML learning algorithms.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiolchem.2016.02.011DOI Listing

Publication Analysis

Top Keywords

genome-wide protein
16
protein function
16
function prediction
16
miml learning
16
multi-instance multi-label
12
multi-label distance
8
distance metric
8
metric learning
8
euclidean distance
8
feature space
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!