The CSDB Linear notation for carbohydrate sequences utilized in the Carbohydrate Structure Database (CSDB) has been improved to meet modern requirements in glycoinformatics. The new features include: the possibility to combine repeating and nonrepeating moieties in one structure; support of carbon-carbon bonds; and usage of SMILES encodings for unambiguous chemical description of glycan structures, including aglycons and atypical components. The new capabilities of CSDB Linear, together with the older ones, allow efficient detection of errors in CSDB and, at the same time, ensure the absence of informatic problems common for human-readable notations. The CSDB Linear implementation provides translation to other carbohydrate notations and multiple procedures for content error checking.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1021/acs.jcim.9b00744 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!