Accurate representation of structural ambiguity is important for storing carbohydrate structures containing varying levels of ambiguity in the literature and databases. Although many representations for carbohydrates have been developed in the past, a generalized but discrete representation format did not exist. We had previously developed the Web3 Unique Representation of Carbohydrate Structures (WURCS) in an attempt to define a generalizable and unique linear representation for carbohydrate structures. However, it lacked sufficient rules to uniquely describe ambiguous structures. In this work, we updated WURCS to handle such ambiguous monosaccharide structures. In particular, to handle structural ambiguity around (potential) carbonyl groups incidental to the carbohydrate analysis, we defined a representation of backbone carbons containing atomic-level ambiguity. As a result, we show that WURCS 2.0 can represent a wider variety of carbohydrate structures containing ambiguous monosaccharides, such as those whose ring closure is undefined or whose anomeric information is only known. This new format provides a representation of carbohydrates that was not possible before, and it is currently being used by the International Glycan Structure Repository GlyTouCan.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1021/acs.jcim.6b00650 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!