As DNA sequencing has become more popular, the public genetic repositories where sequences are archived have experienced explosive growth. These repositories now hold invaluable collections of sequences, e.g., for microbial ecology, but whether these data are reusable has not been evaluated. We assessed the availability and state of 16S rRNA gene amplicon sequences archived in public genetic repositories (SRA, EBI, and DDJ). We screened 26,927 publications in 17 microbiology journals, identifying 2015 16S rRNA gene sequencing studies. Of these, 7.2% had not made their data public at the time of analysis. Among a subset of 635 studies sequencing the same gene region, 40.3% contained data which was not available or not reusable, and an additional 25.5% contained faults in data formatting or data labeling, creating obstacles for data reuse. Our study reveals gaps in data availability, identifies major contributors to data loss, and offers suggestions for improving data archiving practices.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7455719PMC
http://dx.doi.org/10.1038/s42003-020-01204-9DOI Listing

Publication Analysis

Top Keywords

data
10
public genetic
8
genetic repositories
8
sequences archived
8
data reusable
8
16s rrna
8
rrna gene
8
archives half-empty
4
half-empty assessment
4
assessment availability
4

Similar Publications

Systematic bias in malaria parasite relatedness estimation.

G3 (Bethesda)

January 2025

Infectious Disease Epidemiology and Analytics G5 Unit, Institut Pasteur, Université Paris Cité, Paris 75015, France.

Genetic studies of Plasmodium parasites increasingly feature relatedness estimates. However, various aspects of malaria parasite relatedness estimation are not fully understood. For example, relatedness estimates based on whole-genome-sequence (WGS) data often exceed those based on sparser data types.

View Article and Find Full Text PDF

The demographic history of a population, and the distribution of fitness effects (DFE) of newly arising mutations in functional genomic regions, are fundamental factors dictating both genetic variation and evolutionary trajectories. Although both demographic and DFE inference has been performed extensively in humans, these approaches have generally either been limited to simple demographic models involving a single population, or, where a complex population history has been inferred, without accounting for the potentially confounding effects of selection at linked sites. Taking advantage of the coding-sparse nature of the genome, we propose a 2-step approach in which coalescent simulations are first used to infer a complex multi-population demographic model, utilizing large non-functional regions that are likely free from the effects of background selection.

View Article and Find Full Text PDF

Hypoxia is a major cause of pulmonary hypertension (PH) worldwide, and it is likely that interstitial pulmonary macrophages contribute to this vascular pathology. We observed in hypoxia-exposed mice an increase in resident interstitial macrophages, which expanded through proliferation and expressed the monocyte recruitment ligand CCL2. We also observed an increase in CCR2+ macrophages through recruitment, which express the protein thrombospondin-1 that functionally activates TGF-beta to cause vascular disease.

View Article and Find Full Text PDF

Novel genetic insight for psoriasis: integrative genome-wide analyses in 863 080 individuals and proteome-wide Mendelian randomization.

Brief Bioinform

November 2024

Department of Dermatology, Daping Hospital, Army Medical University, No. 10, Changjiang Branch Road, Yuzhong District, Chongqing 400042, China.

Psoriasis affects a significant proportion of the worldwide population and causes an extremely heavy psychological and physical burden. The existing therapeutic schemes have many deficiencies such as limited efficacies and various side effects. Therefore, novel ways of treating psoriasis are urgently needed.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!