Submission of full-length HLA allele sequences presents a unique challenge, both for high-throughput sequencing laboratories and smaller diagnostic laboratories. HLA's extensive polymorphism means that accurate representation and annotation of allele sequence is of critical importance, and curators of nucleotide databases must establish submission formats to ensure high-quality data and prevent ambiguities. The IPD-IMGT/HLA database is established as the standard repository for HLA sequences, and it is a major goal of the 17th International HLA and Immunogenetics Workshop to fill the IPD-IMGT/HLA database with full-length HLA sequences. The process of preparing sequence annotation and metadata is cumbersome and error prone, and it is desirable to create a straightforward and concise method of preparing sequence submissions. We introduce Saddlebags, a software tool for rapid generation of HLA (novel) full-length allele sequence submissions. HLA allele sequences are submitted first to EMBL European Nucleotide Archive (EMBL-ENA), and metadata is gathered for subsequent preparation of an IPD-IMGT/HLA formatted submission. Combining these steps into a pipeline reduces effort and minimizes errors for submitting laboratories. This software has been used by Maastricht University Medical Center Transplantation Immunology Laboratory to submit 79 novel alleles to EMBL-ENA, and the tool is freely available for the HLA community.

Download full-text PDF

Source
http://dx.doi.org/10.1111/tan.13179DOI Listing

Publication Analysis

Top Keywords

full-length hla
12
hla allele
12
allele sequences
12
saddlebags software
8
hla
8
allele sequence
8
ipd-imgt/hla database
8
hla sequences
8
preparing sequence
8
sequence submissions
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!