Optical structure recognition software to recover chemical information: OSRA, an open source solution.

J Chem Inf Model

Laboratory of Medicinal Chemistry, SAIC-Frederick, Inc., NCI-Frederick, Frederick, Maryland 21702, USA.

Published: March 2009

Until recently most scientific and patent documents dealing with chemistry have described molecular structures either with systematic names or with graphical images of Kekulé structures. The latter method poses inherent problems in the automated processing that is needed when the number of documents ranges in the hundreds of thousands or even millions since graphical representations cannot be directly interpreted by a computer. To recover this structural information, which is otherwise all but lost, we have built an optical structure recognition application based on modern advances in image processing implemented in open source tools, OSRA. OSRA can read documents in over 90 graphical formats including GIF, JPEG, PNG, TIFF, PDF, and PS, automatically recognizes and extracts the graphical information representing chemical structures in such documents, and generates the SMILES or SD representation of the encountered molecular structure images.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2889020PMC
http://dx.doi.org/10.1021/ci800067rDOI Listing

Publication Analysis

Top Keywords

optical structure
8
structure recognition
8
open source
8
recognition software
4
software recover
4
recover chemical
4
chemical osra
4
osra open
4
source solution
4
solution scientific
4

Similar Publications

The widespread reliance on single-use plastics (SUPs) has fostered a global throwaway culture, especially in the food packaging industry, where convenience and low cost have driven their adoption, posing serious environmental threats, particularly to marine ecosystems and biodiversity. Edible and ecofriendly packaging made from millet, specifically sorghum ( () Moench), is a promising solution to mitigate SUP consumption and promote sustainability. This study explores the development of edible sorghum bowls, enhanced through roasting and incorporating 3 g of hibiscus and rose flower powders.

View Article and Find Full Text PDF

The human body is an intricate system, where diverse and complex signaling among different organs sustains physiological activities. The eye, as a primary organ for information acquisition, not only plays a crucial role in visual perception but also, as increasing evidence suggests, exerts a broad influence on the entire body through complex circuits upon receiving light signals which is called non-image-forming vision. However, the extent and mechanisms of light's impact on the body through the eyes remain insufficiently explored.

View Article and Find Full Text PDF

The Effect of Antisolvent Treatment on the Growth of 2D/3D Tin Perovskite Films for Solar Cells.

ACS Energy Lett

January 2025

Department of Chemistry and Centre for Processable Electronics, Molecular Sciences Research Hub, Imperial College London, London W12 0BZ, U.K.

Antisolvent treatment is used in the fabrication of perovskite films to control grain growth during spin coating. We study widely incorporated aromatic hydrocarbons and aprotic ethers, discussing the origin of their performance differences in 2D/3D Sn perovskite (PEAFASnI) solar cells. Among the antisolvents that we screen, diisopropyl ether yields the highest power conversion efficiency in solar cells.

View Article and Find Full Text PDF

Convergent-beam attosecond x-ray crystallography.

Struct Dyn

January 2025

Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, Notkestr. 85, 22607 Hamburg, Germany.

Sub-ångström spatial resolution of electron density coupled with sub-femtosecond to few-femtosecond temporal resolution is required to directly observe the dynamics of the electronic structure of a molecule after photoinitiation or some other ultrafast perturbation, such as by soft X-rays. Meeting this challenge, pushing the field of quantum crystallography to attosecond timescales, would bring insights into how the electronic and nuclear degrees of freedom couple, enable the study of quantum coherences involved in molecular dynamics, and ultimately enable these dynamics to be controlled. Here, we propose to reach this realm by employing convergent-beam x-ray crystallography with high-power attosecond pulses from a hard-x-ray free-electron laser.

View Article and Find Full Text PDF

The degradation of methylene blue dye-contaminated wastewater via photocatalysis is an efficient approach towards environmental remediation. The SrZrO perovskite photocatalyst was synthesized using the modified Pechini sol-gel method, and characterized using XRD, FESEM, FTIR, and UV-visible spectrophotometer. Crystallite size obtained by the Scherrer and Williamson-Hall methods were 45.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!