We discover a robust self-supervised strategy tailored toward molecular representations for generative masked language models through a series of tailored, in-depth ablations. Using this pretraining strategy, we train BARTSmiles, a BART-like model with an order of magnitude more compute than previous self-supervised molecular representations. In-depth evaluations show that BARTSmiles consistently outperforms other self-supervised representations across classification, regression, and generation tasks, setting a new state-of-the-art on eight tasks.
View Article and Find Full Text PDFIn silico (quantitative) structure-activity relationship modeling is an approach that provides a fast and cost-effective alternative to assess the genotoxic potential of chemicals. However, one of the limiting factors for model development is the availability of consolidated experimental datasets. In the present study, we collected experimental data on micronuclei in vitro and in vivo, utilizing databases and conducting a PubMed search, aided by text mining using the BioBERT large language model.
View Article and Find Full Text PDFObjective: Leukemia represents a serious public health concern as the incidence is increasing worldwide. In this study we aimed to describe the epidemiological profile of acute lymphoblastic (ALL) and myeloid (AML) leukemia, identify disease clusters and find association with possible risk factors.
Methods: Data on leukemia cases were provided by the National Institute of Health of the Republic of Armenia for the period of 2012-2018.
Collecting labeled data for many important tasks in chemoinformatics is time consuming and requires expensive experiments. In recent years, machine learning has been used to learn rich representations of molecules using large scale unlabeled molecular datasets and transfer the knowledge to solve the more challenging tasks with limited datasets. Variational autoencoders are one of the tools that have been proposed to perform the transfer for both chemical property prediction and molecular generation tasks.
View Article and Find Full Text PDFRapidly evolving laser technologies have led to the development of laser-generated particle accelerators as an alternative to conventional facilities. However, the radiobiological characteristics need to be determined to enhance their applications in biology and medicine. In this study, the radiobiological effects of ultrashort pulsed electron beam (UPEB) and X-ray radiation in human lung fibroblasts (MRC-5 cell line) exposed to doses of 0.
View Article and Find Full Text PDFRecently, it was reported that ochratoxin A (OTA) mycotoxin, produced by a number of Aspergillus and Penicillium fungal species, may cause neuropsychological impairment or mental and emotional disorders but the mechanism of neurotoxicity remains unknown. Adverse effects of OTA in human (SHSY5Y) and mouse (HT22) neuronal cell lines were studied in vitro. OTA was found to be non-cytotoxic in both cell lines at concentrations 2.
View Article and Find Full Text PDFType 2 diabetes mellitus (T2DM) is a severe health problem worldwide, reaching epidemic levels. High susceptibility to infections of T2DM patients indicates dysregulated immune responses to pathogens. However, innate immune responses, including monocyte functions, in T2DM are poorly investigated.
View Article and Find Full Text PDFIntroduction: Autoinflammatory and autoimmune disorders are characterized by aberrant changes in innate and adaptive immunity that may lead from an initial inflammatory state to an organ specific damage. These disorders possess heterogeneity in terms of affected organs and clinical phenotypes. However, despite the differences in etiology and phenotypic variations, they share genetic associations, treatment responses and clinical manifestations.
View Article and Find Full Text PDF