Glioblastoma is an incurable brain malignancy. By the time of clinical diagnosis, these tumours exhibit a degree of genetic and cellular heterogeneity that provides few clues to the mechanisms that initiate and drive gliomagenesis. Here, to explore the early steps in gliomagenesis, we utilized conditional gene deletion and lineage tracing in tumour mouse models, coupled with serial magnetic resonance imaging, to initiate and then closely track tumour formation.
View Article and Find Full Text PDFBMJ Open
November 2024
The characterization of somatic genomic variation associated with the biology of tumors is fundamental for cancer research and personalized medicine, as it guides the reliability and impact of cancer studies and genomic-based decisions in clinical oncology. However, the quality and scope of tumor genome analysis across cancer research centers and hospitals are currently highly heterogeneous, limiting the consistency of tumor diagnoses across hospitals and the possibilities of data sharing and data integration across studies. With the aim of providing users with actionable and personalized recommendations for the overall enhancement and harmonization of somatic variant identification across research and clinical environments, we have developed ONCOLINER.
View Article and Find Full Text PDFJBrowse 2 is a modular genome browser that can visualize many common genomic file formats. While JBrowse 2 supports a variety of different usages, it is particularly suited for deployment on websites, such as model organism databases or other web-based genomic data resources. This protocol provides detailed instructions for setting up JBrowse 2 on an Ubuntu Linux web server, loading a reference genome from a FASTA format file, and adding a gene annotation track from a GFF3 format file.
View Article and Find Full Text PDFWe identify a population of Protogenin-positive (PRTG) MYC NESTIN stem cells in the four-week-old human embryonic hindbrain that subsequently localizes to the ventricular zone of the rhombic lip (RL). Oncogenic transformation of early Prtg rhombic lip stem cells initiates group 3 medulloblastoma (Gr3-MB)-like tumors. PRTG stem cells grow adjacent to a human-specific interposed vascular plexus in the RL, a phenotype that is recapitulated in Gr3-MB but not in other types of medulloblastoma.
View Article and Find Full Text PDFThe COVID-19 pandemic led to a large global effort to sequence SARS-CoV-2 genomes from patient samples to track viral evolution and inform public health response. Millions of SARS-CoV-2 genome sequences have been deposited in global public repositories. The Canadian COVID-19 Genomics Network (CanCOGeN - VirusSeq), a consortium tasked with coordinating expanded sequencing of SARS-CoV-2 genomes across Canada early in the pandemic, created the Canadian VirusSeq Data Portal, with associated data pipelines and procedures, to support these efforts.
View Article and Find Full Text PDFGermline and somatic mutations can give rise to proteins with altered activity, including both gain and loss-of-function. The effects of these variants can be captured in disease-specific reactions and pathways that highlight the resulting changes to normal biology. A disease reaction is defined as an aberrant reaction in which a variant protein participates.
View Article and Find Full Text PDFWormBase has been the major repository and knowledgebase of information about the genome and genetics of Caenorhabditis elegans and other nematodes of experimental interest for over 2 decades. We have 3 goals: to keep current with the fast-paced C. elegans research, to provide better integration with other resources, and to be sustainable.
View Article and Find Full Text PDFThe lack of interoperable data standards among reference genome data-sharing platforms inhibits cross-platform analysis while increasing the risk of data provenance loss. Here, we describe the FAIR bioHeaders Reference genome (FHR), a metadata standard guided by the principles of Findability, Accessibility, Interoperability and Reuse (FAIR) in addition to the principles of Transparency, Responsibility, User focus, Sustainability and Technology. The objective of FHR is to provide an extensive set of data serialisation methods and minimum data field requirements while still maintaining extensibility, flexibility and expressivity in an increasingly decentralised genomic data ecosystem.
View Article and Find Full Text PDFAberrant expression of the E26 transformation-specific (ETS) transcription factors characterizes numerous human malignancies. Many of these proteins, including EWS:FLI1 and EWS:ERG fusions in Ewing sarcoma (EwS) and TMPRSS2:ERG in prostate cancer (PCa), drive oncogenic programs via binding to GGAA repeats. We report here that both EWS:FLI1 and ERG bind and transcriptionally activate GGAA-rich pericentromeric heterochromatin.
View Article and Find Full Text PDFMotivation: A major challenge in cancer care is that patients with similar demographics, tumor types, and medical histories can respond quite differently to the same drug regimens. This difference is largely explained by genetic and other molecular variabilities among the patients and their cancers. Efforts in the pharmacogenomics field are underway to understand better the relationship between the genome of the patient's healthy and tumor cells and their response to therapy.
View Article and Find Full Text PDFThe lack of interoperable data standards among reference genome data-sharing platforms inhibits cross-platform analysis while increasing the risk of data provenance loss. Here, we describe the FAIR-bioHeaders Reference genome (FHR), a metadata standard guided by the principles of Findability, Accessibility, Interoperability, and Reuse (FAIR) in addition to the principles of Transparency, Responsibility, User focus, Sustainability, and Technology (TRUST). The objective of FHR is to provide an extensive set of data serialisation methods and minimum data field requirements while still maintaining extensibility, flexibility, and expressivity in an increasingly decentralised genomic data ecosystem.
View Article and Find Full Text PDFAppreciating the rapid advancement and ubiquity of generative AI, particularly ChatGPT, a chatbot using large language models like GPT, we endeavour to explore the potential application of ChatGPT in the data collection and annotation stages within the Reactome curation process. This exploration aimed to create an automated or semi-automated framework to mitigate the extensive manual effort traditionally required for gathering and annotating information pertaining to biological pathways, adopting a Reactome "reaction-centric" approach. In this pilot study, we used ChatGPT/GPT4 to address gaps in the pathway annotation and enrichment in parallel with the conventional manual curation process.
View Article and Find Full Text PDFThe Reactome Knowledgebase (https://reactome.org), an Elixir and GCBR core biological data resource, provides manually curated molecular details of a broad range of normal and disease-related biological processes. Processes are annotated as an ordered network of molecular transformations in a single consistent data model.
View Article and Find Full Text PDFDisease variant annotation in the context of biological reactions and pathways can provide a standardized overview of molecular phenotypes of pathogenic mutations that is amenable to computational mining and mathematical modeling. Reactome, an open source, manually curated, peer-reviewed database of human biological pathways, provides annotations for over 4000 disease variants of close to 400 genes in the context of ∼800 disease reactions constituting ∼400 disease pathways. Functional annotation of disease variants proceeds from normal gene functions, through disease variants whose divergence from normal molecular behaviors has been experimentally verified, to extrapolation from molecular phenotypes of characterized variants to variants of unknown significance using criteria of the American College of Medical Genetics and Genomics (ACMG).
View Article and Find Full Text PDFThis article presents 14 quick tips to build a team to crowdsource data for public health advocacy. It includes tips around team building and logistics, infrastructure setup, media and industry outreach, and project wrap-up and archival for posterity.
View Article and Find Full Text PDFLimited knowledge about a substantial portion of protein coding genes, known as "dark" proteins, hinders our understanding of their functions and potential therapeutic applications. To address this, we leveraged Reactome, the most comprehensive, open source, open-access pathway knowledgebase, to contextualize dark proteins within biological pathways. By integrating multiple resources and employing a random forest classifier trained on 106 protein/gene pairwise features, we predicted functional interactions between dark proteins and Reactome-annotated proteins.
View Article and Find Full Text PDFWe present JBrowse 2, a general-purpose genome annotation browser offering enhanced visualization of complex structural variation and evolutionary relationships. It retains core features of JBrowse while adding new views for synteny, dotplots, breakpoints, gene fusions, and whole-genome overviews. It allows users to share sessions, open multiple genomes, and navigate between views.
View Article and Find Full Text PDFPathway databases provide descriptions of the roles of proteins, nucleic acids, lipids, carbohydrates, and other molecular entities within their biological cellular contexts. Pathway-centric views of these roles may allow for the discovery of unexpected functional relationships in data such as gene expression profiles and somatic mutation catalogues from tumor cells. For this reason, there is a high demand for high-quality pathway databases and their associated tools.
View Article and Find Full Text PDFMotivation: JBrowse Jupyter is a package that aims to close the gap between Python programming and genomic visualization. Web-based genome browsers are routinely used for publishing and inspecting genome annotations. Historically they have been deployed at the end of bioinformatics pipelines, typically decoupled from the analysis itself.
View Article and Find Full Text PDF