CD-HIT Suite: a web server for clustering and comparing biological sequences.

Bioinformatics

California Institute for Telecommunications and Information Technology, University of California San Diego, La Jolla, CA, USA.

Published: March 2010

Unlabelled: CD-HIT is a widely used program for clustering and comparing large biological sequence datasets. In order to further assist the CD-HIT users, we significantly improved this program with more functions and better accuracy, scalability and flexibility. Most importantly, we developed a new web server, CD-HIT Suite, for clustering a user-uploaded sequence dataset or comparing it to another dataset at different identity levels. Users can now interactively explore the clusters within web browsers. We also provide downloadable clusters for several public databases (NCBI NR, Swissprot and PDB) at different identity levels.

Availability: Free access at http://cd-hit.org

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2828112PMC
http://dx.doi.org/10.1093/bioinformatics/btq003DOI Listing

Publication Analysis

Top Keywords

cd-hit suite
8
web server
8
clustering comparing
8
cd-hit
4
suite web
4
server clustering
4
comparing biological
4
biological sequences
4
sequences unlabelled
4
unlabelled cd-hit
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!