Provenance is information describing the lineage of an object, such as a dataset or biological material. Since these objects can be passed between organizations, each organization can document only parts of the objects life cycle. As a result, interconnection of distributed provenance parts forms distributed provenance chains. Dependant on the actual provenance content, complete provenance chains can provide traceability and contribute to reproducibility and FAIRness of research objects. In this paper, we define a lightweight provenance model based on W3C PROV that enables generation of distributed provenance chains in complex, multi-organizational environments. The application of the model is demonstrated with a use case spanning several steps of a real-world research pipeline - starting with the acquisition of a specimen, its processing and storage, histological examination, and the generation/collection of associated data (images, annotations, clinical data), ending with training an AI model for the detection of tumor in the images. The proposed model has become an open conceptual foundation of the currently developed ISO 23494 standard on provenance for biotechnology domain.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9383664PMC
http://dx.doi.org/10.1038/s41597-022-01537-6DOI Listing

Publication Analysis

Top Keywords

distributed provenance
16
provenance chains
12
provenance
9
provenance model
8
model
5
lightweight distributed
4
model complex
4
complex real-world
4
real-world environments
4
environments provenance
4

Similar Publications

Effects of aluminum on metabolism of reactive oxygen species and reactive nitrogen species in root tips of different Eucalyptus species.

BMC Plant Biol

January 2025

Guangxi Colleges and Universities Key Laboratory for Cultivation and Utilization of Subtropical Forest Plantation, Guangxi Key Laboratory of Forest Ecology and Conservation, College of Forestry, Guangxi University, Nanning, 530004, China.

On acidified soil, the growth of Eucalyptus is seriously restricted by aluminum (Al) stress. Therefore, breeding Eucalyptus species with excellent Al tolerance, developing the genetic potential of species, and improving tolerance to Al stress are important for the sustainable development of artificial Eucalyptus forests. By observing the occurrence and distribution of the main reactive oxygen species (ROS) and reactive nitrogen species (RNS) in root tips of Eucalyptus seedlings under Al stress, this study analyzed change in the growth and physiological indexes of Eucalyptus seedlings under Al stress.

View Article and Find Full Text PDF

Motivation: Artificial intelligence (AI) applications require explainability (XAI) for FAIR, ethical deployment, whether in the clinic or in the laboratory. Richly descriptive XAI metadata representing how pre-model data were obtained, characterized, transformed, and distributed, should be available along with the data prior to training and application of AI models.

Results: The FAIRSCAPE framework generates, packages, and integrates critical pre-model XAI descriptive metadata, including deep provenance graphs and data dictionaries with feature validation on uploaded data, software, and computations, with special reference to biomedical datasets.

View Article and Find Full Text PDF

Considering the substantial inaccuracies inherent in the traditional manual identification of ceramic categories and the issues associated with analyzing ceramics based on chemical or spectral features, which may lead to the destruction of ceramics, this paper introduces a novel provenance classification of archaeological ceramics which relies on microscopic features and an ensemble deep learning model, overcoming the time consuming and require costly equipment limitations of current standard methods, and without compromising the structural integrity and artistic value of ceramics. The proposed model includes the following: the construction of a dataset for ancient ceramic microscopic images, image preprocessing methods based on Gamma correction and CLAHE equalization algorithms, extraction of image features based on three deep learning architectures-VGG-16, Inception-v3 and GoogLeNet, and optimal fusion. This latter is based on stochastic gradient descent (SGD) algorithm, which allows optimal fitting of the fusion model parameters by freezing and unfreezing model layers.

View Article and Find Full Text PDF

Sediment provenance of Late Carboniferous-Early Triassic in the Puyang area, Eastern North China Craton.

iScience

December 2024

State Key Laboratory of Oil and Gas Reservoir Geology and Exploitation, Chengdu University of Technology, Chengdu 610059, Sichuan, China.

To reveal the stratigraphic age of the Shiqianfeng Formation in the eastern continental basin of the North China Craton and the provenance of its sediments from the Late Carboniferous to the Early Triassic, six sandstone samples from the Puyang area were selected for zircon U-Pb dating. The result show that the Shiqianfeng Formation in the eastern North China Craton belongs to the Early Triassic. According to the age clusters of six samples, considering the regional geological setting and the distribution of zircon ages in the potential provenance area, it can be inferred that the Inner Mongolia Paleo-uplift provided continuous provenance supply for the study area during the Late Carboniferous-Early Triassic.

View Article and Find Full Text PDF

In temperate and boreal ecosystems, trees undergo dormancy to avoid cold temperatures during the unfavorable season. This phase includes changes in frost hardiness, which is minimal during the growing season and reaches its maximum in winter. Quantifying frost hardiness is important to assess the frost risk and shifts of species distribution under a changing climate.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!