As more of the human genome draft sequence is finished, and genomes from other organisms begin to be sequenced, the demand for accurate and reliable genome annotation will increase significantly. To facilitate this industrial-scale genome annotation, automated bioinformatics solutions are increasingly required. As a result, automatic genome annotation systems have become more important in gene discovery within recent years. The design of such large-scale bioinformatics systems is an evolving and dynamic field, based on central cores of bioinformatics software tools and relational databases. Not only must these systems efficiently manage and integrate large volumes of genomic data, but they must also deliver accurate gene predictions and effectively distribute annotation data to the biosciences community.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/s1359-6446(02)02289-4 | DOI Listing |
BioData Min
January 2025
The Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, 90069, USA.
Background: With recent advances in single cell technology, high-throughput methods provide unique insight into disease mechanisms and more importantly, cell type origin. Here, we used multi-omics data to understand how genetic variants from genome-wide association studies influence development of disease. We show in principle how to use genetic algorithms with normal, matching pairs of single-nucleus RNA- and ATAC-seq, genome annotations, and protein-protein interaction data to describe the genes and cell types collectively and their contribution to increased risk.
View Article and Find Full Text PDFBMC Cancer
January 2025
Department of Otorhinolaryngology, Shenzhen Key Laboratory of Otorhinolaryngology, Longgang Otorhinolaryngology Hospital, Shenzhen Institute of Otorhinolaryngology, No. 3004 Longgang Avenue, Shenzhen, Guangdong, China.
Background: To investigate the role of the translocase of the outer mitochondrial membrane 40 (TOM40) in oral squamous cell carcinoma (OSCC) with the aim of identifying new biomarkers or potential therapeutic targets.
Methods: TOM40 expression level in OSCC was evaluated using datasets downloaded from The Cancer Genome Atlas (TCGA), as well as clinical data. The correlation between TOM40 expression level and the clinicopathological parameters and survival were analyzed in TCGA.
Sci Rep
January 2025
Department of Biological Sciences, California State University Los Angeles, 5151 State University Dr, Los Angeles, CA, 90032, USA.
The moss Syntrichia caninervis Mitt. is distributed throughout drylands globally, and often anchors ecologically significant communities known as biological soil crusts (biocrusts). The species occupies a variety of dryland habitats with varying levels of drought and temperature stress, suggesting the potential for ecological specialization within S.
View Article and Find Full Text PDFNature
January 2025
Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA.
The human genome contains millions of candidate cis-regulatory elements (cCREs) with cell-type-specific activities that shape both health and many disease states. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these cCREs. Here we used lentivirus-based massively parallel reporter assays (lentiMPRAs) to test the regulatory activity of more than 680,000 sequences, representing an extensive set of annotated cCREs among three cell types (HepG2, K562 and WTC11), and found that 41.
View Article and Find Full Text PDFSci Data
January 2025
Key Laboratory of Freshwater Biodiversity Conservation, Ministry of Agriculture and Rural Affairs, Yangtze River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Wuhan, 430223, China.
Coreius guichenoti, mainly distributed in upstream regions of the Yangtze River China, is currently on the brink of extinction and listed as national secondary protected animal. In this study, we aimed to obtain the chromosome-level genome of C. guichenoti using PacBio and Hi-C techniques.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!