Chemical reactions serve as foundational building blocks for organic chemistry and drug design. In the era of large AI models, data-driven approaches have emerged to innovate the design of novel reactions, optimize existing ones for higher yields, and discover new pathways for synthesizing chemical structures comprehensively. To effectively address these challenges with machine learning models, it is imperative to derive robust and informative representations or engage in feature engineering using extensive data sets of reactions. This work aims to provide a comprehensive review of established reaction featurization approaches, offering insights into the selection of representations and the design of features for a wide array of tasks. The advantages and limitations of employing SMILES, molecular fingerprints, molecular graphs, and physics-based properties are meticulously elaborated. Solutions to bridge the gap between different representations will also be critically evaluated. Additionally, we introduce a new frontier in chemical reaction pretraining, holding promise as an innovative yet unexplored avenue.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1021/acs.jcim.4c00004 | DOI Listing |
J Am Chem Soc
January 2025
Key Laboratory of Colloid and Interface Chemistry, Ministry of Education, School of Chemistry and Chemical Engineering, Shandong University, Jinan 250100, P. R. China.
Chirality epitomizes the sophistication of chemistry, representing some of its most remarkable achievements. Yet, the precise synthesis of chiral structures from achiral building blocks remains a profound and enduring challenge in synthetic chemistry and materials science. Here, we demonstrate that achiral colloidal nanocrystals, including Au and Ag nanocrystals, can assemble into long-range-ordered helical assemblies with the assistance of chiral molecules.
View Article and Find Full Text PDFPostgrad Med J
January 2025
Department of Pediatric Metabolic Diseases, University of Health Sciences, Ankara Etlik City Hospital, Ankara 06170, Turkey.
Metabolism is the name given to all of the chemical reactions in the cell involving thousands of proteins, including enzymes, receptors, and transporters. Inborn errors of metabolism (IEM) are caused by defects in the production and breakdown of proteins, fats, and carbohydrates. Micro ribonucleic acids (miRNAs) are short non-coding RNA molecules, ⁓19-25 nucleotides long, hairpin-shaped, produced from DNA.
View Article and Find Full Text PDFEnviron Sci Technol
January 2025
Environmental Engineering and Science, Department of Chemical and Environmental Engineering (ChEE), University of Cincinnati, Cincinnati, Ohio 45221, United States.
Frequent and severe occurrences of harmful algal blooms increasingly threaten human health by the release of microcystins (MCs). Urgent attention is directed toward managing MCs, as evidenced by rising HAB-related do not drink/do not boil advisories due to unsafe MC levels in drinking water. UV/chlorine treatment, in which UV light is applied simultaneously with chlorine, showed early promise for effectively degrading MC-LR to values below the World Health Organization's guideline limits.
View Article and Find Full Text PDFBeilstein J Org Chem
December 2024
College of Chemistry and Material Science, Guangdong University of Education, Guangzhou 510303, China.
A novel series of D-A-D-type 9-phenyl-9-phosphafluorene oxide (PhFlOP) derivatives was prepared and is reported herein. The synthetic protocol involved 5 steps from commercially available 2-bromo-4-fluoro-1-nitrobenzene, featuring a noble-metal-free system, mild reaction conditions, and a good yield, especially for the final CsCO-facilitated nucleophilic substitution (77-91% yield). The characterization data obtained from IR and NMR spectroscopy (H, C, F, and P) as well as HRMS spectrometry were in full agreement with the expected structures, and single-crystal X-ray diffraction analysis was conducted to confirm the structure of compound .
View Article and Find Full Text PDFBiochemistry
January 2025
Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States.
Aspartimidylation is a post-translational modification found in multiple families of ribosomally synthesized and post-translationally modified peptides (RiPPs). We recently reported on the imiditides, a new RiPP family in which aspartimidylation is the class-defining modification. Imiditide biosynthetic gene clusters encode a precursor protein and a methyltransferase that methylates a specific Asp residue, converting it to aspartimide.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!