Publications by Alejandra Gonzalez-Beltran | LitMetric

Publications by authors named "Alejandra Gonzalez-Beltran"

Page 1 of 2

Machine actionable metadata models.

Dominique Batista Alejandra Gonzalez-Beltran Susanna-Assunta Sansone Philippe Rocca-Serra

Sci Data

September 2022

Community-developed minimum information checklists are designed to drive the rich and consistent reporting of metadata, underpinning the reproducibility and reuse of the data. These reporting guidelines, however, are usually in the form of narratives intended for human consumption. Modular and reusable machine-readable versions are also needed.

View Article and Find Full Text PDF

FAIR data pipeline: provenance-driven data management for traceable scientific workflows.

Sonia Natalie Mitchell Andrew Lahiff Nathan Cummings Jonathan Hollocombe Bram Boskamp Alejandra N Gonzalez-Beltran

Philos Trans A Math Phys Eng Sci

October 2022

Modern epidemiological analyses to understand and combat the spread of disease depend critically on access to, and use of, data. Rapidly evolving data, such as data streams changing during a disease outbreak, are particularly challenging. Data management is further complicated by data being imprecisely identified when used.

View Article and Find Full Text PDF

ISA API: An open platform for interoperable life science experimental metadata.

David Johnson Dominique Batista Keeva Cochrane Robert P Davey Anthony Etuk Alejandra Gonzalez-Beltran

Gigascience

September 2021

Background: The Investigation/Study/Assay (ISA) Metadata Framework is an established and widely used set of open source community specifications and software tools for enabling discovery, exchange, and publication of metadata from experiments in the life sciences. The original ISA software suite provided a set of user-facing Java tools for creating and manipulating the information structured in ISA-Tab-a now widely used tabular format. To make the ISA framework more accessible to machines and enable programmatic manipulation of experiment metadata, the JSON serialization ISA-JSON was developed.

View Article and Find Full Text PDF

Ten simple rules for making a vocabulary FAIR.

Simon J D Cox Alejandra N Gonzalez-Beltran Barbara Magagna Maria-Cristina Marinescu

PLoS Comput Biol

June 2021

We present ten simple rules that support converting a legacy vocabulary-a list of terms available in a print-based glossary or in a table not accessible using web standards-into a FAIR vocabulary. Various pathways may be followed to publish the FAIR vocabulary, but we emphasise particularly the goal of providing a globally unique resolvable identifier for each term or concept. A standard representation of the concept should be returned when the individual web identifier is resolved, using SKOS or OWL serialised in an RDF-based representation for machine-interchange and in a web-page for human consumption.

View Article and Find Full Text PDF

Radical collaboration during a global health emergency: development of the RDA COVID-19 data sharing recommendations and guidelines.

Brian Pickering Timea Biro Claire C Austin Alexander Bernier Louise Bezuidenhout Alejandra Gonzalez-Beltran

Open Res Eur

June 2021

The coronavirus disease 2019 (COVID-19) global pandemic required a rapid and effective response. This included ethical and legally appropriate sharing of data. The European Commission (EC) called upon the Research Data Alliance (RDA) to recruit experts worldwide to quickly develop recommendations and guidelines for COVID-related data sharing.

View Article and Find Full Text PDF

Fostering global data sharing: highlighting the recommendations of the Research Data Alliance COVID-19 working group.

Claire C Austin Alexander Bernier Louise Bezuidenhout Juan Bicarregui Timea Biro Alejandra Gonzalez-Beltran

Wellcome Open Res

May 2021

The systemic challenges of the COVID-19 pandemic require cross-disciplinary collaboration in a global and timely fashion. Such collaboration needs open research practices and the sharing of research outputs, such as data and code, thereby facilitating research and research reproducibility and timely collaboration beyond borders. The Research Data Alliance COVID-19 Working Group recently published a set of recommendations and guidelines on data sharing and related best practices for COVID-19 research.

View Article and Find Full Text PDF

Community standards for open cell migration data.

Alejandra N Gonzalez-Beltran Paola Masuzzo Christophe Ampe Gert-Jan Bakker Sébastien Besson

Gigascience

May 2020

Article Synopsis

Cell migration research is a rapidly growing field, but current datasets are underutilized due to varying experimental methods and formats that hinder data sharing and analysis.
Making these datasets findable, accessible, interoperable, and reusable (FAIR) would enhance opportunities for meta-analysis and data integration.
The Cell Migration Standardisation Organisation (CMSO) is working to establish standardized formats and vocabularies for cell migration data, which will improve algorithms, tools, and enable further exploration of this complex biological process.

View Article and Find Full Text PDF

Semantic concept schema of the linear mixed model of experimental observations.

Hanna Ćwiek-Kupczyńska Katarzyna Filipiak Augustyn Markiewicz Philippe Rocca-Serra Alejandra N Gonzalez-Beltran

Sci Data

February 2020

In the information age, smart data modelling and data management can be carried out to address the wealth of data produced in scientific experiments. In this paper, we propose a semantic model for the statistical analysis of datasets by linear mixed models. We tie together disparate statistical concepts in an interdisciplinary context through the application of ontologies, in particular the Statistics Ontology (STATO), to produce FAIR data summaries.

View Article and Find Full Text PDF

The Data Tags Suite (DATS) model for discovering data access and use requirements.

George Alter Alejandra Gonzalez-Beltran Lucila Ohno-Machado Philippe Rocca-Serra

Gigascience

February 2020

Background: Data reuse is often controlled to protect the privacy of subjects and patients. Data discovery tools need ways to inform researchers about restrictions on data access and re-use.

Results: We present elements in the Data Tags Suite (DATS) metadata schema describing data access, data use conditions, and consent information.

View Article and Find Full Text PDF

FAIRsharing as a community approach to standards, repositories and policies.

Susanna-Assunta Sansone Peter McQuilton Philippe Rocca-Serra Alejandra Gonzalez-Beltran Massimiliano Izzo

Nat Biotechnol

April 2019

View Article and Find Full Text PDF

Addendum: The FAIR Guiding Principles for scientific data management and stewardship.

Mark D Wilkinson Michel Dumontier Ijsbrand Jan Aalbersberg Gabrielle Appleton Myles Axton Alejandra Gonzalez-Beltran

Sci Data

March 2019

View Article and Find Full Text PDF

Interoperable and scalable data analysis with microservices: applications in metabolomics.

Payam Emami Khoonsari Pablo Moreno Sven Bergmann Joachim Burman Marco Capuccini Alejandra N Gonzalez-Beltran

Bioinformatics

October 2019

Motivation: Developing a robust and performant data analysis workflow that integrates all necessary components whilst still being able to scale over multiple compute nodes is a challenging task. We introduce a generic method based on the microservice architecture, where software tools are encapsulated as Docker containers that can be connected into scientific workflows and executed using the Kubernetes container orchestrator.

Results: We developed a Virtual Research Environment (VRE) which facilitates rapid integration of new tools and developing scalable and interoperable workflows for performing metabolomics data analysis.

View Article and Find Full Text PDF

PhenoMeNal: processing and analysis of metabolomics data in the cloud.

Kristian Peters James Bradbury Sven Bergmann Marco Capuccini Marta Cascante Alejandra Gonzalez-Beltran

Gigascience

February 2019

Background: Metabolomics is the comprehensive study of a multitude of small molecules to gain insight into an organism's metabolism. The research field is dynamic and expanding with applications across biomedical, biotechnological, and many other applied biological domains. Its computationally intensive nature has driven requirements for open data formats, data repositories, and data analysis tools.

View Article and Find Full Text PDF

DataMed - an open source discovery index for finding biomedical datasets.

Xiaoling Chen Anupama E Gururaj Burak Ozyurt Ruiling Liu Ergin Soysal Alejandra Gonzalez-Beltran

J Am Med Inform Assoc

March 2018

Article Synopsis

DataMed is an open source biomedical data discovery system developed to help users find relevant datasets easily within the complex biomedical data landscape.
The system includes a data ingestion pipeline that standardizes dataset metadata and a search engine that utilizes user queries to locate relevant data, achieving a 90% accuracy rate in data processing.
Evaluations showed the search engine's performance metrics, including an average precision of 0.2033, with efforts towards increasing data accessibility for the biomedical community through open source availability.

View Article and Find Full Text PDF

Data discovery with DATS: exemplar adoptions and lessons learned.

Alejandra N Gonzalez-Beltran John Campbell Patrick Dunn Diana Guijarro Sanda Ionescu

J Am Med Inform Assoc

January 2018

The DAta Tag Suite (DATS) is a model supporting dataset description, indexing, and discovery. It is available as an annotated serialization with schema.org, a vocabulary used by major search engines, thus making the datasets discoverable on the web.

View Article and Find Full Text PDF

The future of metabolomics in ELIXIR.

Merlijn van Rijswijk Charlie Beirnaert Christophe Caron Marta Cascante Victoria Dominguez Alejandra Gonzalez-Beltran

F1000Res

September 2017

Metabolomics, the youngest of the major omics technologies, is supported by an active community of researchers and infrastructure developers across Europe. To coordinate and focus efforts around infrastructure building for metabolomics within Europe, a workshop on the "Future of metabolomics in ELIXIR" was organised at Frankfurt Airport in Germany. This one-day strategic workshop involved representatives of ELIXIR Nodes, members of the PhenoMeNal consortium developing an e-infrastructure that supports workflow-based metabolomics analysis pipelines, and experts from the international metabolomics community.

View Article and Find Full Text PDF

Four simple recommendations to encourage best practices in research software.

Rafael C Jiménez Mateusz Kuzak Monther Alhamdoosh Michelle Barker Bérénice Batut Alejandra Gonzalez-Beltran

F1000Res

June 2017

Scientific research relies on computer software, yet software is not always developed following practices that ensure its quality and sustainability. This manuscript does not aim to propose new software development best practices, but rather to provide simple recommendations that encourage the adoption of existing best practices. Software development best practices promote better quality software, and better quality software improves the reproducibility and reusability of research.

View Article and Find Full Text PDF

Identifiers for the 21st century: How to design, provision, and reuse persistent identifiers to maximize utility and impact of life science data.

Julie A McMurry Nick Juty Niklas Blomberg Tony Burdett Tom Conlin Alejandra Gonzalez-Beltran

PLoS Biol

June 2017

In many disciplines, data are highly decentralized across thousands of online databases (repositories, registries, and knowledgebases). Wringing value from such databases depends on the discipline of data science and on the humble bricks and mortar that make integration possible; identifiers are a core component of this integration infrastructure. Drawing on our experience and on work by other groups, we outline 10 lessons we have learned about the identifier qualities and best practices that facilitate large-scale data integration.

View Article and Find Full Text PDF

DATS, the data tag suite to enable discoverability of datasets.

Susanna-Assunta Sansone Alejandra Gonzalez-Beltran Philippe Rocca-Serra George Alter Jeffrey S Grethe

Sci Data

June 2017

Today's science increasingly requires effective ways to find and access existing datasets that are distributed across a range of repositories. For researchers in the life sciences, discoverability of datasets may soon become as essential as identifying the latest publications via PubMed. Through an international collaborative effort funded by the National Institutes of Health (NIH)'s Big Data to Knowledge (BD2K) initiative, we have designed and implemented the DAta Tag Suite (DATS) model to support the DataMed data discovery index.

View Article and Find Full Text PDF

Finding useful data across multiple biomedical data repositories using DataMed.

Lucila Ohno-Machado Susanna-Assunta Sansone George Alter Ian Fore Jeffrey Grethe Alejandra Gonzalez-Beltran

Nat Genet

May 2017

The value of broadening searches for data across multiple repositories has been identified by the biomedical research community. As part of the US National Institutes of Health (NIH) Big Data to Knowledge initiative, we work with an international community of researchers, service providers and knowledge experts to develop and test a data index and search engine, which are based on metadata extracted from various data sets in a range of repositories. DataMed is designed to be, for data, what PubMed has been for the scientific literature.

View Article and Find Full Text PDF

The health care and life sciences community profile for dataset descriptions.

Michel Dumontier Alasdair J G Gray M Scott Marshall Vladimir Alexiev Peter Ansell Alejandra N Gonzalez-Beltran

PeerJ

September 2016

Access to consistent, high-quality metadata is critical to finding, understanding, and reusing scientific data. However, while there are many relevant vocabularies for the annotation of a dataset, none sufficiently captures all the necessary metadata. This prevents uniform indexing and querying of dataset repositories.

View Article and Find Full Text PDF

BioSharing: curated and crowd-sourced metadata standards, databases and data policies in the life sciences.

Peter McQuilton Alejandra Gonzalez-Beltran Philippe Rocca-Serra Milo Thurston Allyson Lister

Database (Oxford)

January 2017

BioSharing (http://www.biosharing.org) is a manually curated, searchable portal of three linked registries.

View Article and Find Full Text PDF

The Ontology for Biomedical Investigations.

Anita Bandrowski Ryan Brinkman Mathias Brochhausen Matthew H Brush Bill Bug Alejandra Gonzalez-Beltran

PLoS One

April 2017

The Ontology for Biomedical Investigations (OBI) is an ontology that provides terms with precisely defined meanings to describe all aspects of how investigations in the biological and medical domains are conducted. OBI re-uses ontologies that provide a representation of biomedical knowledge from the Open Biological and Biomedical Ontologies (OBO) project and adds the ability to describe how this knowledge was derived. We here describe the state of OBI and several applications that are using it, such as adding semantic expressivity to existing databases, building data entry forms, and enabling interoperability between knowledge resources.

View Article and Find Full Text PDF

The FAIR Guiding Principles for scientific data management and stewardship.

Mark D Wilkinson Michel Dumontier I Jsbrand Jan Aalbersberg Gabrielle Appleton Myles Axton Alejandra Gonzalez-Beltran

Sci Data

March 2016

Article Synopsis

There is a strong need to make it easier to share and reuse scientific data among different groups like schools, companies, and publishers.
A new set of rules called the FAIR Data Principles has been created to help people make their data easier for both machines and humans to find and use.
This document is the first official introduction to the FAIR Principles and explains why they are important and gives examples of how they are being used.

View Article and Find Full Text PDF

Data standards can boost metabolomics research, and if there is a will, there is a way.

Philippe Rocca-Serra Reza M Salek Masanori Arita Elon Correa Saravanan Dayalan Alejandra Gonzalez-Beltran

Metabolomics

November 2015

Thousands of articles using metabolomics approaches are published every year. With the increasing amounts of data being produced, mere description of investigations as text in manuscripts is not sufficient to enable re-use anymore: the underlying data needs to be published together with the findings in the literature to maximise the benefit from public and private expenditure and to take advantage of an enormous opportunity to improve scientific reproducibility in metabolomics and cognate disciplines. Reporting recommendations in metabolomics started to emerge about a decade ago and were mostly concerned with inventories of the information that had to be reported in the literature for consistency.

View Article and Find Full Text PDF

A PHP Error was encountered

Severity: Warning

Message: fopen(/var/lib/php/sessions/ci_sessionl0pcund2vi8381vm8fj7eeirr4musskc): Failed to open stream: No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 177

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once

A PHP Error was encountered

Severity: Warning

Message: session_start(): Failed to read session data: user (path: /var/lib/php/sessions)

Filename: Session/Session.php

Line Number: 137

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once