A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@pubfacts.com&api_key=b8daa3ad693db53b1410957c26c9a51b4908&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 176

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3122
Function: getPubMedXML

File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 316
Function: require_once

A resource for automated search and collation of geochemical datasets from journal supplements. | LitMetric

A resource for automated search and collation of geochemical datasets from journal supplements.

Sci Data

School of Earth, Atmosphere and Environment, Monash University, Clayton, Victoria, 3800, Australia.

Published: November 2022

AI Article Synopsis

  • The article introduces a web scraping resource that automates the search, extraction, and collation of geochemical and geochronological data from the Figshare repository.
  • Researchers can use this tool to efficiently update and curate their own databases, addressing the challenge of quickly outdated global geochemical datasets.
  • An example demonstrates the tool’s capability by compiling a zircon geochronology and chemistry database with over 150,000 analyses, supporting data sharing and reuse within the scientific community.

Article Abstract

This article presents a resource for automated search, extraction and collation of geochemical and geochronological data from the Figshare repository using web scraping code. To answer fundamental questions about the Earth's evolution, such as spatial and temporal evolution and interrelationships between the planet's solid and surficial reservoirs, researchers must utilize global geochemical datasets. Due to the volume of data being published, these datasets become quickly outdated. We present a resource that allows researchers to rapidly curate and update their own databases from existing published data. We use open-source Python code to web scrape the Figshare repository for journal supplementary files using the application programming interface, allowing for the collection and download of hundreds of supplementary files and metadata in minutes. Use of this web scraping tool is demonstrated here by collation of a zircon geochronology and chemistry database of >150,000 analyses. The database is consistent in reproducing trends in other published zircon compilations. Providing a resource for automated collection of Figshare data files will encourage data sharing and reuse.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9700723PMC
http://dx.doi.org/10.1038/s41597-022-01730-7DOI Listing

Publication Analysis

Top Keywords

resource automated
12
automated search
8
collation geochemical
8
geochemical datasets
8
figshare repository
8
web scraping
8
supplementary files
8
data
5
resource
4
search collation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!