The rapid advancement of sequencing technologies poses challenges in managing the large volume and exponential growth of sequence data efficiently and on time. To address this issue, we present GenBase (https://ngdc.cncb.ac.cn/genbase), an open-access data repository that follows the International Nucleotide Sequence Database Collaboration (INSDC) data standards and structures, for efficient nucleotide sequence archiving, searching, and sharing. As a core resource within the National Genomics Data Center (NGDC) of the China National Center for Bioinformation (CNCB; https://ngdc.cncb.ac.cn), GenBase offers bilingual submission pipeline and services, as well as local submission assistance in China. GenBase also provides a unique Excel format for metadata description and feature annotation of nucleotide sequences, along with a real-time data validation system to streamline sequence submissions. As of April 23, 2024, GenBase received 68,251 nucleotide sequences and 689,574 annotated protein sequences across 414 species from 2319 submissions. Out of these, 63,614 (93%) nucleotide sequences and 620,640 (90%) annotated protein sequences have been released and are publicly accessible through GenBase's web search system, File Transfer Protocol (FTP), and Application Programming Interface (API). Additionally, in collaboration with INSDC, GenBase has constructed an effective data exchange mechanism with GenBank and started sharing released nucleotide sequences. Furthermore, GenBase integrates all sequences from GenBank with daily updates, demonstrating its commitment to actively contributing to global sequence data management and sharing.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11434157PMC
http://dx.doi.org/10.1093/gpbjnl/qzae047DOI Listing

Publication Analysis

Top Keywords

nucleotide sequences
16
nucleotide sequence
12
sequence database
8
sequence data
8
collaboration insdc
8
annotated protein
8
protein sequences
8
genbase
7
data
7
sequences
7

Similar Publications

Background: The endangered Kashmir musk deer (Moschus cupreus), native to high-altitude Himalayas, is an ecological significant and endangered ungulate, threatened by habitat loss and poaching for musk pod distributed in western Himalayan ranges of India, Nepal and Afghanistan. Despite its critical conservation status and ecological importance in regulating vegetation dynamics, knowledge gaps persist regarding its population structure and genetic diversity, hindering effective management strategies.

Methods And Results: We aimed to understand the population genetics of Kashmir musk deer in north-western Himalayas using two mitochondrial DNA (mtDNA) regions and 11 microsatellite loci.

View Article and Find Full Text PDF

Interleukin-10 (IL-10) is an immunomodulatory molecule that may play an immunosuppressive role in nonmelanoma skin cancer (NMSC), specifically basal cell carcinoma (BCC). We analyzed the role of IL10 promoter variants in genetic determinants of BCC susceptibility and their association with IL10 mRNA and IL-10 serum levels. Three promoter variants (- 1082 A > G, - 819 T > C, and - 592 A > C) were examined in 250 BCC patients and 250 reference group (RG) individuals.

View Article and Find Full Text PDF

An aerobic, Gram-stain-positive, motile, coccus-shaped actinomycete, designated strain LSe6-4, was isolated from leaves of sea purslane (Sesuvium portulacastrum L.) in Thailand and subjected to a polyphasic taxonomic studies. Growth of the strain occurred at temperatures between 15 and 38 °C, and with NaCl concentrations 0-13%.

View Article and Find Full Text PDF

Robust discrimination between closely related species of salmon based on DNA fragments.

Anal Bioanal Chem

January 2025

Statistical Engineering Division, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, MD, 20899-8980, USA.

Closely related species of Salmonidae, including Pacific and Atlantic salmon, can be distinguished from one another based on nucleotide sequences from the cytochrome c oxidase sub-unit 1 mitochondrial gene (COI), using ensembles of fragments aligned to genetic barcodes that serve as digital proxies for the relevant species. This is accomplished by exploiting both the nucleotide sequences and their quality scores recorded in a FASTQ file obtained via Next Generation (NextGen) Sequencing of mitochondrial DNA extracted from Coho salmon caught with hook and line in the Gulf of Alaska. The alignment is done using MUSCLE (Muscle 5.

View Article and Find Full Text PDF

Perceived discrimination, recognized as a chronic psychosocial stressor, has adverse consequences on health. DNA methylation (DNAm) may be a potential mechanism by which stressors get embedded into the human body at the molecular level and subsequently affect health outcomes. However, relatively little is known about the effects of perceived discrimination on DNAm.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!