Motivation: Software plays a crucial and growing role in research. Unfortunately, the computational component in Life Sciences research is often challenging to reproduce and verify. It could be undocumented, opaque, contain unknown errors that affect the outcome, or be directly unavailable and impossible to use for others.
View Article and Find Full Text PDFInteractive Jupyter Notebooks in combination with Conda environments can be used to generate FAIR (Findable, Accessible, Interoperable and Reusable/Reproducible) biomolecular simulation workflows. The interactive programming code accompanied by documentation and the possibility to inspect intermediate results with versatile graphical charts and data visualization is very helpful, especially in iterative processes, where parameters might be adjusted to a particular system of interest. This work presents a collection of FAIR notebooks covering various areas of the biomolecular simulation field, such as molecular dynamics (MD), protein-ligand docking, molecular checking/modeling, molecular interactions, and free energy perturbations.
View Article and Find Full Text PDFThe COVID-19 pandemic has brought significant changes and advances in the field of vaccination, including the implementation and widespread use of encapsidated mRNA vaccines in general healthcare practice. Here, we present two new mRNAs expressing antigenic parts of the SARS-CoV-2 spike protein and provide data supporting their functionality. The first mRNA, called RBD-mRNA, encodes a trimeric form of the virus spike protein receptor binding domain (RBD).
View Article and Find Full Text PDFRegulonDB is a database that contains the most comprehensive corpus of knowledge of the regulation of transcription initiation of Escherichia coli K-12, including data from both classical molecular biology and high-throughput methodologies. Here, we describe biological advances since our last NAR paper of 2019. We explain the changes to satisfy FAIR requirements.
View Article and Find Full Text PDFMolecular dynamics (MD) simulations are keeping computers busy around the world, generating a huge amount of data that is typically not open to the scientific community. Pioneering efforts to ensure the safety and reusability of MD data have been based on the use of simple databases providing a limited set of standard analyses on single-short trajectories. Despite their value, these databases do not offer a true solution for the current community of MD users, who want a flexible analysis pipeline and the possibility to address huge non-Markovian ensembles of large systems.
View Article and Find Full Text PDFBiomaterials research output has experienced an exponential increase over the last three decades. The majority of research is published in the form of scientific articles and is therefore available as unstructured text, making it a challenging input for computational processing. Computational tools are becoming essential to overcome this information overload.
View Article and Find Full Text PDFArtificial intelligence (AI) is transforming the field of medical imaging and has the potential to bring medicine from the era of 'sick-care' to the era of healthcare and prevention. The development of AI requires access to large, complete, and harmonized real-world datasets, representative of the population, and disease diversity. However, to date, efforts are fragmented, based on single-institution, size-limited, and annotation-limited datasets.
View Article and Find Full Text PDFMutations in the kinase domain of the epidermal growth factor receptor (EGFR) can be drivers of cancer and also trigger drug resistance in patients receiving chemotherapy treatment based on kinase inhibitors. knowledge of the impact of EGFR variants on drug sensitivity would help to optimize chemotherapy and design new drugs that are effective against resistant variants before they emerge in clinical trials. To this end, we explored a variety of methods, from sequence-based to "state-of-the-art" atomistic simulations.
View Article and Find Full Text PDFWiley Interdiscip Rev Comput Mol Sci
May 2022
Exascale computing has been a dream for ages and is close to becoming a reality that will impact how molecular simulations are being performed, as well as the quantity and quality of the information derived for them. We review how the biomolecular simulations field is anticipating these new architectures, making emphasis on recent work from groups in the BioExcel Center of Excellence for High Performance Computing. We exemplified the power of these simulation strategies with the work done by the HPC simulation community to fight Covid-19 pandemics.
View Article and Find Full Text PDFWe present BioExcel Building Blocks Workflows, a web-based graphical user interface (GUI) offering access to a collection of transversal pre-configured biomolecular simulation workflows assembled with the BioExcel Building Blocks library. Available workflows include Molecular Dynamics setup, protein-ligand docking, trajectory analyses and small molecule parameterization. Workflows can be launched in the platform or downloaded to be run in the users' own premises.
View Article and Find Full Text PDFMotivation: The BioExcel Building Blocks (BioBB) library offers a broad collection of wrappers on top of common biomolecular simulation and bioinformatics tools. The possibility to access the library remotely and programmatically increases its usability, allowing individual and sporadic executions and enabling remote workflows.
Results: BioBB REST API extends and complements the BioBB library offering programmatic access to the collection of biomolecular simulation tools included in the BioExcel Building Blocks library.
Various data sharing platforms are being developed to enhance the sharing of cohort data by addressing the fragmented state of data storage and access systems. However, policy challenges in several domains remain unresolved. The euCanSHare workshop was organized to identify and discuss these challenges and to set the future research agenda.
View Article and Find Full Text PDFIntroduction: Depression, cardiovascular diseases and diabetes are among the major non-communicable diseases, leading to significant disability and mortality worldwide. These diseases may share environmental and genetic determinants associated with multimorbid patterns. Stressful early-life events are among the primary factors associated with the development of mental and physical diseases.
View Article and Find Full Text PDFModern high-throughput structure-based drug discovery algorithms consider ligand flexibility, but typically with low accuracy, which results in a loss of performance in the derived models. Here we present the bioactive conformational ensemble (BCE) server and its associated database. The server creates conformational ensembles of drug-like ligands and stores them in the BCE database, where a variety of analyses are offered to the user.
View Article and Find Full Text PDFThe identification of orthologs-genes in different species which descended from the same gene in their last common ancestor-is a prerequisite for many analyses in comparative genomics and molecular evolution. Numerous algorithms and resources have been conceived to address this problem, but benchmarking and interpreting them is fraught with difficulties (need to compare them on a common input dataset, absence of ground truth, computational cost of calling orthologs). To address this, the Quest for Orthologs consortium maintains a reference set of proteomes and provides a web server for continuous orthology benchmarking (http://orthology.
View Article and Find Full Text PDFPurpose: To develop recommendations for management of patients with breast cancer (BC) with germline mutations in BC susceptibility genes.
Methods: The American Society of Clinical Oncology, American Society for Radiation Oncology, and Society of Surgical Oncology convened an Expert Panel to develop recommendations based on a systematic review of the literature and a formal consensus process.
Results: Fifty-eight articles met eligibility criteria and formed the evidentiary basis for the local therapy recommendations; six randomized controlled trials of systemic therapy met eligibility criteria.
To understand the role of the extensive senescence-associated 3D genome reorganization, we generated genome-wide chromatin interaction maps, epigenome, replication-timing, whole-genome bisulfite sequencing, and gene expression profiles from cells entering replicative senescence (RS) or upon oncogene-induced senescence (OIS). We identify senescence-associated heterochromatin domains (SAHDs). Differential intra- versus inter-SAHD interactions lead to the formation of senescence-associated heterochromatin foci (SAHFs) in OIS but not in RS.
View Article and Find Full Text PDFIn the recent years, the improvement of software and hardware performance has made biomolecular simulations a mature tool for the study of biological processes. Simulation length and the size and complexity of the analyzed systems make simulations both complementary and compatible with other bioinformatics disciplines. However, the characteristics of the software packages used for simulation have prevented the adoption of the technologies accepted in other bioinformatics fields like automated deployment systems, workflow orchestration, or the use of software containers.
View Article and Find Full Text PDF