EPGD: a comprehensive web resource for integrating and displaying eukaryotic paralog/paralogon information.

Nucleic Acids Res

Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yueyang Road, P. R. China.

Published: January 2008

Gene duplication is common in all three domains of life, especially in eukaryotic genomes. The duplicates provide new material for the action of evolutionary forces such as selection or genetic drift. Here we describe a sophisticated procedure to extract duplicated genes (paralogs) from 26 available eukaryotic genomes, to pre-calculate several evolutionary indexes (evolutionary rate, synonymous distance/clock, transition redundant exchange clock, etc.) based on the paralog family, and to identify block or segmental duplications (paralogons). We also constructed an internet-accessible Eukaryotic Paralog Group Database (EPGD; http://epgd.biosino.org/EPGD/). The database is gene-centered and organized by paralog family. It focuses on paralogs and evolutionary duplication events. The paralog families and paralogons can be searched by text or sequence, and are downloadable from the website as plain text files. The database will be very useful for both experimentalists and bioinformaticians interested in the study of duplication events or paralog families.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2238967PMC
http://dx.doi.org/10.1093/nar/gkm924DOI Listing

Publication Analysis

Top Keywords

eukaryotic genomes
8
paralog family
8
duplication events
8
events paralog
8
paralog families
8
paralog
5
epgd comprehensive
4
comprehensive web
4
web resource
4
resource integrating
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!