Publications by Elon Portugaly

Publications by authors named "Elon Portugaly"

Page 1 of 1

Hidden Markov model speed heuristic and iterative HMM search procedure.

L Steven Johnson Sean R Eddy Elon Portugaly

BMC Bioinformatics

August 2010

Background: Profile hidden Markov models (profile-HMMs) are sensitive tools for remote protein homology detection, but the main scoring algorithms, Viterbi or Forward, require considerable time to search large sequence databases.

Results: We have designed a series of database filtering steps, HMMERHEAD, that are applied prior to the scoring algorithms, as implemented in the HMMER package, in an effort to reduce search time. Using this heuristic, we obtain a 20-fold decrease in Forward and a 6-fold decrease in Viterbi search time with a minimal loss in sensitivity relative to the unfiltered approaches.

View Article and Find Full Text PDF

Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space.

Yaniv Loewenstein Elon Portugaly Menachem Fromer Michal Linial

Bioinformatics

July 2008

Motivation: UPGMA (average linking) is probably the most popular algorithm for hierarchical data clustering, especially in computational biology. However, UPGMA requires the entire dissimilarity matrix in memory. Due to this prohibitive requirement, UPGMA is not scalable to very large datasets.

View Article and Find Full Text PDF

EVEREST: a collection of evolutionary conserved protein domains.

Elon Portugaly Nathan Linial Michal Linial

Nucleic Acids Res

January 2007

Protein domains are subunits of proteins that recur throughout the protein world. There are many definitions attempting to capture the essence of a protein domain, and several systems that identify protein domains and classify them into families. EVEREST, recently described in Portugaly et al.

View Article and Find Full Text PDF

EVEREST: automatic identification and classification of protein domains in all protein sequences.

Elon Portugaly Amir Harel Nathan Linial Michal Linial

BMC Bioinformatics

June 2006

Background: Proteins are comprised of one or several building blocks, known as domains. Such domains can be classified into families according to their evolutionary origin. Whereas sequencing technologies have advanced immensely in recent years, there are no matching computational methodologies for large-scale determination of protein domains and their boundaries.

View Article and Find Full Text PDF

ProtoNet 4.0: a hierarchical classification of one million protein sequences.

Noam Kaplan Ori Sasson Uri Inbar Moriah Friedlich Menachem Fromer Elon Portugaly

Nucleic Acids Res

January 2005

ProtoNet is an automatic hierarchical classification of the protein sequence space. In 2004, the ProtoNet (version 4.0) presents the analysis of over one million proteins merged from SwissProt and TrEMBL databases.

View Article and Find Full Text PDF

ProtoNet: hierarchical classification of the protein space.

Ori Sasson Avishay Vaaknin Hillel Fleischer Elon Portugaly Yonatan Bilu

Nucleic Acids Res

January 2003

The ProtoNet site provides an automatic hierarchical clustering of the SWISS-PROT protein database. The clustering is based on an all-against-all BLAST similarity search. The similarities' E-score is used to perform a continuous bottom-up clustering process by applying alternative rules for merging clusters.

View Article and Find Full Text PDF

Selecting targets for structural determination by navigating in a graph of protein families.

Elon Portugaly Ilona Kifer Michal Linial

Bioinformatics

July 2002

Motivation: A major goal in structural genomics is to enrich the catalogue of proteins whose 3D structures are known. In an attempt to address this problem we mapped over 10 000 proteins with solved structures onto a graph of all Swissprot protein sequences (release 36, approximately 73 000 proteins) provided by ProtoMap, with the goal of sorting proteins according to their likelihood of belonging to new superfamilies. We hypothesized that proteins within neighbouring clusters tend to share common structural superfamilies or folds.

View Article and Find Full Text PDF

Publications by authors named "Elon Portugaly"

Hidden Markov model speed heuristic and iterative HMM search procedure.

Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space.

EVEREST: a collection of evolutionary conserved protein domains.

EVEREST: automatic identification and classification of protein domains in all protein sequences.

ProtoNet 4.0: a hierarchical classification of one million protein sequences.

ProtoNet: hierarchical classification of the protein space.

Selecting targets for structural determination by navigating in a graph of protein families.

A PHP Error was encountered

A PHP Error was encountered