Publications by authors named "David A Winkler"

Modelling Data (MODA) reporting guidelines have been proposed for common terminology and for recording metadata for physics-based materials modelling and simulations in a CEN Workshop Agreement (CWA 17284:2018). Their purpose is similar to that of the Quantitative Structure-Activity Relationship (QSAR) model report form (QMRF) that aims to increase industry and regulatory confidence in QSAR models, but for a wider range of model types. Recently, the WorldFAIR project's nanomaterials case study suggested that both QMRF and MODA templates are an important means to enhance compliance of nanoinformatics models, and their underpinning datasets, with the FAIR principles (Findable, Accessible, Interoperable, Reusable).

View Article and Find Full Text PDF

Secondary ion mass spectrometry (SIMS) is a powerful analytical technique for characterizing the molecular and elemental composition of surfaces. Individual mass spectra can provide information about the mean surface composition, while spatial mapping can elucidate the spatial distributions of molecular species in 2D and 3D with no prior labeling of molecular targets. The data sets produced by SIMS techniques are large and inherently complex, often containing subtle relationships between spatial and molecular features.

View Article and Find Full Text PDF

Extracellular vesicles (EVs) are potentially useful biomarkers for disease detection and monitoring. Development of a label-free technique for imaging and distinguishing small volumes of EVs from different cell types and cell states would be of great value. Here, we have designed a method to explore the chemical changes in EVs associated with neuroinflammation using Time-of-Flight Secondary Ion Mass spectrometry (ToF-SIMS) and machine learning (ML).

View Article and Find Full Text PDF

Multivariate statistical tools and machine learning (ML) techniques can deconvolute hyperspectral data and control the disparity between the number of samples and features in materials science. Nevertheless, the importance of generating sufficient high-quality sample replicates in training data cannot be overlooked, as it fundamentally affects the performance of ML models. Here, we present a quantitative analysis of time-of-flight secondary ion mass spectrometry (ToF-SIMS) spectra of a simple microarray system of two food dyes using partial least-squares (PLS, linear) and random forest (RF, nonlinear) algorithms.

View Article and Find Full Text PDF

Supervised and unsupervised machine learning algorithms are routinely applied to time-of-flight secondary ion mass spectrometry (ToF-SIMS) imaging data and, more broadly, to mass spectrometry imaging (MSI). These algorithms have accelerated large-scale, single-pixel analysis, classification, and regression. However, there is relatively little research on methods suited for so-called weakly supervised problems, where ground-truth class labels exist at the image level, but not at the individual pixel level.

View Article and Find Full Text PDF

There is strong interest to improve the therapeutic potential of gold nanoparticles (GNPs) while ensuring their safe development. The utility of GNPs in medicine requires a molecular-level understanding of how GNPs interact with biological systems. Despite considerable research efforts devoted to monitoring the internalisation of GNPs, there is still insufficient understanding of the factors responsible for the variability in GNP uptake in different cell types.

View Article and Find Full Text PDF

Inorganic lead-free halide perovskites, devoid of toxic or rare elements, have garnered considerable attention as photocatalysts for pollution control, CO reduction and hydrogen production. In the extensive perovskite design space, factors like substitution or doping level profoundly impact their performance. To address this complexity, a synergistic combination of machine learning models and theoretical calculations were used to efficiently screen substitution elements that enhanced the photoactivity of substituted Cs AgBiBr perovskites.

View Article and Find Full Text PDF

Decades of nanotoxicology research have generated extensive and diverse data sets. However, data is not equal to information. The question is how to extract critical information buried in vast data streams.

View Article and Find Full Text PDF

The self-organizing map with relational perspective mapping (SOM-RPM) is an unsupervised machine learning method that can be used to visualize and interpret high-dimensional hyperspectral data. We have previously used SOM-RPM for the analysis of time-of-flight secondary ion mass spectrometry (ToF-SIMS) hyperspectral images and three-dimensional (3D) depth profiles. This provides insightful visualization of features and trends of 3D depth profile data, using a slice-by-slice view, which can be useful for highlighting structural flaws including molecular characteristics and transport of contaminants to a buried interface and characterization of spectra.

View Article and Find Full Text PDF

Bacterial infections are increasingly problematic due to the rise of antimicrobial resistance. Consequently, the rational design of materials naturally resistant to biofilm formation is an important strategy for preventing medical device-associated infections. Machine learning (ML) is a powerful method to find useful patterns in complex data from a wide range of fields.

View Article and Find Full Text PDF

Drugs against novel targets are needed to treat COVID-19 patients, especially as SARS-CoV-2 is capable of rapid mutation. Structure-based de novo drug design and repurposing of drugs and natural products is a rational approach to discovering potentially effective therapies. These in silico simulations can quickly identify existing drugs with known safety profiles that can be repurposed for COVID-19 treatment.

View Article and Find Full Text PDF

Addressing climate change challenges by reducing greenhouse gas levels requires innovative adsorbent materials for clean energy applications. Recent progress in machine learning has stimulated technological breakthroughs in the discovery, design, and deployment of materials with potential for high-performance and low-cost clean energy applications. This review summarizes basic machine learning methods-data collection, featurization, model generation, and model evaluation-and reviews their use in the development of robust adsorbent materials.

View Article and Find Full Text PDF

There has been impressive growth in the use of radiopharmaceuticals for therapy, selective toxic payload delivery, and noninvasive diagnostic imaging of disease. The increasing timeframes and costs involved in the discovery and development of new radiopharmaceuticals have driven the development of more efficient strategies for this process. Computer-Aided Drug Design (CADD) methods and Machine Learning (ML) have become more effective over the last two decades for drug and materials discovery and optimization.

View Article and Find Full Text PDF

Management of nanomaterials and nanosafety data needs to operate under the FAIR (findability, accessibility, interoperability, and reusability) principles and this requires a unique, global identifier for each nanomaterial. Existing identifiers may not always be applicable or sufficient to definitively identify the specific nanomaterial used in a particular study, resulting in the use of textual descriptions in research project communications and reporting. To ensure that internal project documentation can later be linked to publicly released data and knowledge for the specific nanomaterials, or even to specific batches and variants of nanomaterials utilised in that project, a new identifier is proposed: the European Registry of Materials Identifier.

View Article and Find Full Text PDF

In a previous study, in silico screening of the binding of almost all proteins in the Protein Data Bank to each of the five noble gases xenon, krypton, argon, neon, and helium was reported. This massive and rich data set requires analysis to identify the gas-protein interactions that have the best binding strengths, those where the binding of the noble gas occurs at a site that can modulate the function of the protein, and where this modulation might generate clinically relevant effects. Here, we report a preliminary analysis of this data set using a rational, heuristic score based on binding strength and location.

View Article and Find Full Text PDF

Layered 2D crystals have unique properties and rich chemical and electronic diversity, with over 6000 2D crystals known and, in principle, millions of different stacked hybrid 2D crystals accessible. This diversity provides unique combinations of properties that can profoundly affect the future of energy conversion and harvesting devices. Notably, this includes catalysts, photovoltaics, superconductors, solar-fuel generators, and piezoelectric devices that will receive broad commercial uptake in the near future.

View Article and Find Full Text PDF

Repurposing of existing drugs is a rapid way to find potential new treatments for SARS-CoV-2. Here, we applied a virtual screening approach using Autodock Vina and molecular dynamic simulation in tandem to screen and calculate binding energies of repurposed drugs against the SARS-CoV-2 helicase protein (non-structural protein nsp13). Amongst the top hits from our study were antivirals, antihistamines, and antipsychotics, plus a range of other drugs.

View Article and Find Full Text PDF

Electrocatalysts and photocatalysts are key to a sustainable future, generating clean fuels, reducing the impact of global warming, and providing solutions to environmental pollution. Improved processes for catalyst design and a better understanding of electro/photocatalytic processes are essential for improving catalyst effectiveness. Recent advances in data science and artificial intelligence have great potential to accelerate electrocatalysis and photocatalysis research, particularly the rapid exploration of large materials chemistry spaces through machine learning.

View Article and Find Full Text PDF

Feature extraction algorithms are an important class of unsupervised methods used to reduce data dimensionality. They have been applied extensively for time-of-flight secondary ion mass spectrometry (ToF-SIMS) imaging─commonly, matrix factorization (MF) techniques such as principal component analysis have been used. A limitation of MF is the assumption of linearity, which is generally not accurate for ToF-SIMS data.

View Article and Find Full Text PDF

We urgently need to identify drugs to treat patients suffering from COVID-19 infection. Drugs rarely act at single molecular targets. Off-target effects are responsible for undesirable side effects and beneficial synergy between targets for specific illnesses.

View Article and Find Full Text PDF

Time-of-flight secondary ion mass spectrometry (ToF-SIMS) imaging offers a powerful, label-free method for exploring organic, bioorganic, and biological systems. The technique is capable of very high spatial resolution, while also producing an enormous amount of information about the chemical and molecular composition of a surface. However, this information is inherently complex, making interpretation and analysis of the vast amount of data produced by a single ToF-SIMS experiment a considerable challenge.

View Article and Find Full Text PDF

The piezoelectric effect, mechanical-to-electrical and electrical-to-mechanical energy conversion, is highly beneficial for functional and responsive electronic devices. To fully exploit this property, miniaturization of piezoelectric materials is the subject of intense research. Indeed, select atomically thin 2D materials strongly exhibit the piezoelectric effect.

View Article and Find Full Text PDF

Unlabelled: Repurposing of existing drugs and drug candidates is an ideal approach to identify new potential therapies for SARS-CoV-2 that can be tested without delay in human trials of infected patients. Here we applied a virtual screening approach using Autodock Vina and molecular dynamics simulation in tandem to calculate binding energies for repurposed drugs against the SARS-CoV-2 RNA-dependent RNA polymerase (RdRp). We thereby identified 80 promising compounds with potential activity against SARS-Cov2, consisting of a mixture of antiviral drugs, natural products and drugs with diverse modes of action.

View Article and Find Full Text PDF

New photocatalysts are traditionally identified through trial-and-error methods. Machine learning has shown considerable promise for improving the efficiency of photocatalyst discovery from a large potential pool. Here, we describe a multi-step, target-driven consensus method using a stacking meta-learning algorithm that robustly predicts bandgaps and H evolution activities of photocatalysts.

View Article and Find Full Text PDF