In the past decade, macrocyclic peptides gained increasing interest as a new therapeutic modality to tackle intracellular and extracellular therapeutic targets that had been previously classified as "undruggable". Several technological advances have made discovering macrocyclic peptides against these targets possible: 1) the inclusion of noncanonical amino acids (NCAAs) into mRNA display, 2) increased availability of next generation sequencing (NGS), and 3) improvements in rapid peptide synthesis platforms. This type of directed-evolution based screening can produce large numbers of potential hit sequences given that DNA sequencing is the functional output of this platform.
View Article and Find Full Text PDFIn drug discovery, partition and distribution coefficients, logP and logD for octanol/water, are widely used as metrics of the lipophilicity of molecules, which in turn have a strong influence on the bioactivity and bioavailability of potential drugs. There are a variety of established methods, mostly fragment or atom-based, to calculate logP while logD prediction generally relies on calculated logP and pKa for the estimation of neutral and ionized populations at a given pH. Algorithms such as ClogP have limitations generally leading to systematic errors for chemically related molecules while pKa estimation is generally more difficult due to the interplay of electronic, inductive and conjugation effects for ionizable moieties.
View Article and Find Full Text PDFBackground: Analyzing files containing chemical information is at the core of cheminformatics. Each analysis may require a unique workflow. This paper describes the chemalot and chemalot_knime open source packages.
View Article and Find Full Text PDFStructure- and property-based drug design is an integral part of modern drug discovery, enabling the design of compounds aimed at improving potency and selectivity. However, building molecules using desktop modeling tools can easily lead to poor designs that appear to form many favorable interactions with the protein's active site. Although a proposed molecule looks good on screen and appears to fit into the protein site X-ray crystal structure or pharmacophore model, doing so might require a high-energy small molecule conformation, which would likely be inactive.
View Article and Find Full Text PDFBackground: After performing a fragment based screen the resulting hits need to be prioritized for follow-up structure elucidation and chemistry. This paper describes a new similarity metric, Atom-Atom-Path (AAP) similarity that is used in conjunction with the Directed Sphere Exclusion (DISE) clustering method to effectively organize and prioritize the fragment hits. The AAP similarity rewards common substructures and recognizes minimal structure differences.
View Article and Find Full Text PDFUsing data from the in vitro liver microsomes metabolic stability assay, we have developed QSAR models to predict in vitro human clearance. Models were trained using in house high-throughput assay data reported as the predicted human hepatic clearance by liver microsomes or pCLh. Machine learning regression methods were used to generate the models.
View Article and Find Full Text PDFJ Chem Inf Model
February 2012
Automated registration of compounds from external sources is necessitated by the numerous compound acquisitions from vendors and by the increasing number of collaborations with external partners. A prerequisite for automating compound registration is a robust module for determining the structural novelty of the input structures. Any such tool needs to be able to take uncertainty about stereochemistry into account and to identify tautomeric forms of the same compound.
View Article and Find Full Text PDFTo minimize the risk of failure in clinical trials, drug discovery teams must propose active and selective clinical candidates with good physicochemical properties. An additional challenge is that today drug discovery is often conducted by teams at different geographical locations. To improve the collaborative decision making on which compounds to synthesize, we have implemented DEGAS, an application which enables scientists from Genentech and from collaborating external partners to instantly access the same data.
View Article and Find Full Text PDFRelational databases are the current standard for storing and retrieving data in the pharmaceutical and biotech industries. However, retrieving data from a relational database requires specialized knowledge of the database schema and of the SQL query language. At Anadys, we have developed an easy-to-use system for searching and reporting data in a relational database to support our drug discovery project teams.
View Article and Find Full Text PDFWhile established pharmaceutical companies have chemical information systems in place to manage their compounds and the associated data, new startup companies need to implement these systems from scratch. Decisions made early in the design phase usually have long lasting effects on the expandability, maintenance effort, and costs associated with the information management system. Careful analysis of work and data flows, both inter- and intradepartmental, and identification of existing dependencies between activities are important.
View Article and Find Full Text PDFThe Sphere Exclusion algorithm is a well-known algorithm used to select diverse subsets from chemical-compound libraries or collections. It can be applied with any given distance measure between two structures. It is popular because of the intuitive geometrical interpretation of the method and its good performance on large data sets.
View Article and Find Full Text PDF