The Solve-RD project brings together clinicians, scientists, and patient representatives from 51 institutes spanning 15 countries to collaborate on genetically diagnosing ("solving") rare diseases (RDs). The project aims to significantly increase the diagnostic success rate by co-analyzing data from thousands of RD cases, including phenotypes, pedigrees, exome/genome sequencing, and multiomics data. Here we report on the data infrastructure devised and created to support this co-analysis.
View Article and Find Full Text PDFImproving patient care and advancing scientific discovery requires responsible sharing of research data, healthcare records, biosamples, and biomedical resources that must also respect applicable use conditions. Defining a standard to structure and manage these use conditions is a complex and challenging task. This is exemplified by a near unlimited range of asset types, a high variability of applicable conditions, and differing applications at the individual or collective level.
View Article and Find Full Text PDFMyriad policy, ethical and legal considerations underpin the sharing of biological resources, implying the need for standardised and yet flexible ways to digitally represent diverse 'use conditions'. We report a core lexicon of terms that are atomic, non-directional 'concepts of use', called Common Conditions of use Elements. This work engaged biobanks and registries relevant to the European Joint Programme for Rare Diseases and aimed to produce a lexicon that would have generalised utility.
View Article and Find Full Text PDFEfficient data sharing is hampered by an array of organizational, ethical, behavioral, and technical challenges, slowing research progress and reducing the utility of data generated by clinical research studies on neurodegenerative diseases. There is a particular need to address differences between public and private sector environments for research and data sharing, which have varying standards, expectations, motivations, and interests. The Neuronet data sharing Working Group was set up to understand the existing barriers to data sharing in public-private partnership projects, and to provide guidance to overcome these barriers, by convening data sharing experts from diverse projects in the IMI neurodegeneration portfolio.
View Article and Find Full Text PDFBiomarker discovery, development, and validation are reliant on large-scale analyses of high-quality samples and data. Currently, significant quantities of data and samples have been generated by European studies on Alzheimer's disease (AD) and other neurodegenerative diseases (NDD), representing a valuable resource for developing biomarkers to support early detection of disease, treatment monitoring, and patient stratification. However, discovery of, access to, and sharing of data and samples from AD and NDD research are hindered both by silos that limit collaboration, and by the array of complex requirements for secure, legal, and ethical sharing.
View Article and Find Full Text PDFThe GWAS Central resource gathers and curates extensive summary-level genome-wide association study (GWAS) data and puts a range of user-friendly but powerful website tools for the comparison and visualisation of GWAS data at the fingertips of researchers. Through our continued efforts to harmonise and import data received from GWAS authors and consortia, and data sets actively collected from public sources, the database now contains over 72.5 million P-values for over 5000 studies testing over 7.
View Article and Find Full Text PDFBeacon is a basic data discovery protocol issued by the Global Alliance for Genomics and Health (GA4GH). The main goal addressed by version 1 of the Beacon protocol was to test the feasibility of broadly sharing human genomic data, through providing simple "yes" or "no" responses to queries about the presence of a given variant in datasets hosted by Beacon providers. The popularity of this concept has fostered the design of a version 2, that better serves real-world requirements and addresses the needs of clinical genomics research and healthcare, as assessed by several contributing projects and organizations.
View Article and Find Full Text PDFThe Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution.
View Article and Find Full Text PDFHuman biomedical datasets that are critical for research and clinical studies to benefit human health also often contain sensitive or potentially identifying information of individual participants. Thus, care must be taken when they are processed and made available to comply with ethical and regulatory frameworks and informed consent data conditions. To enable and streamline data access for these biomedical datasets, the Global Alliance for Genomics and Health (GA4GH) Data Use and Researcher Identities (DURI) work stream developed and approved the Data Use Ontology (DUO) standard.
View Article and Find Full Text PDFFor the first time in Europe hundreds of rare disease (RD) experts team up to actively share and jointly analyse existing patient's data. Solve-RD is a Horizon 2020-supported EU flagship project bringing together >300 clinicians, scientists, and patient representatives of 51 sites from 15 countries. Solve-RD is built upon a core group of four European Reference Networks (ERNs; ERN-ITHACA, ERN-RND, ERN-Euro NMD, ERN-GENTURIS) which annually see more than 270,000 RD patients with respective pathologies.
View Article and Find Full Text PDFWe report proof-of-principle experiments regarding a dynamic microarray protocol enabling accurate and semi-quantitative DNA analysis for re-sequencing, fingerprinting and genotyping. Single-stranded target molecules hybridise to surface-bound probes during initial gradual cooling with high-fidelity. Real-time tracking of target denaturation (via fluorescence) during a 'dynamic' gradual heating phase permits 'melt-curve' analysis.
View Article and Find Full Text PDFIntroduction: Multimorbidity is widely recognised as the presence of two or more concurrent long-term conditions, yet remains a poorly understood global issue despite increasing in prevalence.We have created the Wales Multimorbidity e-Cohort (WMC) to provide an accessible research ready data asset to further the understanding of multimorbidity. Our objectives are to create a platform to support research which would help to understand prevalence, trajectories and determinants in multimorbidity, characterise clusters that lead to highest burden on individuals and healthcare services, and evaluate and provide new multimorbidity phenotypes and algorithms to the National Health Service and research communities to support prevention, healthcare planning and the management of individuals with multimorbidity.
View Article and Find Full Text PDFBackground: Airway bacterial dysbiosis is a feature of chronic obstructive pulmonary disease (COPD). However, there is limited comparative data of the lung microbiome between healthy smokers, non-smokers and COPD.
Methods: We compared the 16S rRNA gene-based sputum microbiome generated from pair-ended Illumina sequencing of 124 healthy subjects (28 smokers and 96 non-smokers with normal lung function), with single stable samples from 218 COPD subjects collected from three UK clinical centres as part of the COPDMAP consortium.
The GWAS Central resource provides a toolkit for integrative access and visualization of a uniquely extensive collection of genome-wide association study data, while ensuring safe open access to prevent research participant identification. GWAS Central is the world's most comprehensive openly accessible repository of summary-level GWAS association information, providing over 70 million P-values for over 3800 studies investigating over 1400 unique phenotypes. The database content comprises direct submissions received from GWAS authors and consortia, in addition to actively gathered data sets from various public sources.
View Article and Find Full Text PDFIn the version of this article initially published, Lena Dolman's second affiliation was given as Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK. The correct second affiliation is Ontario Institute for Cancer Research, Toronto, Ontario, Canada. The error has been corrected in the HTML and PDF versions of the article.
View Article and Find Full Text PDFEur J Hum Genet
December 2018
The Global Alliance for Genomics and Health (GA4GH) proposes a data access policy model-"registered access"-to increase and improve access to data requiring an agreement to basic terms and conditions, such as the use of DNA sequence and health data in research. A registered access policy would enable a range of categories of users to gain access, starting with researchers and clinical care professionals. It would also facilitate general use and reuse of data but within the bounds of consent restrictions and other ethical obligations.
View Article and Find Full Text PDFGiven the data-rich nature of modern biomedical research, there is a pressing need for a systematic, structured, computer-readable way to capture, communicate, and manage sharing rules that apply to biomedical resources. This is essential for responsible recording, versioning, communication, querying, and actioning of resource sharing plans. However, lack of a common "information model" for rules and conditions that govern the sharing of materials, methods, software, data, and knowledge creates a fundamental barrier.
View Article and Find Full Text PDFIntroduction: It is a challenge to find participants for Alzheimer's disease (AD) prevention trials within a short period of time. The European Prevention of Alzheimer's Dementia Registry (EPAD) aims to facilitate recruitment by preselecting subjects from ongoing cohort studies. This article introduces this novel approach.
View Article and Find Full Text PDF