A list of 147 tetralin- and indan-like compounds was compiled from the literature for investigating the relationship between molecular structure and musk odor. Each compound in the data set was represented by 374 CODESSA and 970 TAE descriptors. A genetic algorithm (GA) for pattern recognition analysis was used to identify a subset of molecular descriptors that could differentiate musks from nonmusks in a plot of the two largest principal components (PCs) of the data. A PC map of the 110 compounds in the training set using 45 molecular descriptors identified by the pattern recognition GA revealed an asymmetric data structure. Tetralin and indan musks were found to occupy a small, but well-defined region of the PC (descriptor) space, with the nonmusks randomly distributed in the PC plot. A three-layer feed-forward neural network trained by back propagation was used to develop a discriminant that correctly classified all the compounds in the training set as musk or nonmusk. The neural network was successfully validated using an external prediction of 37 compounds.

Download full-text PDF

Source
http://dx.doi.org/10.1093/chemse/bjs058DOI Listing

Publication Analysis

Top Keywords

tetralin indan
8
indan musks
8
pattern recognition
8
molecular descriptors
8
compounds training
8
training set
8
neural network
8
odor-structure relationship
4
relationship studies
4
studies tetralin
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!