Introduction: Open science initiatives have enabled sharing of large amounts of already collected data. However, significant gaps remain regarding how to find appropriate data, including underutilized data that exist in the long tail of science. We demonstrate the NeuroBridge prototype and its ability to search PubMed Central full-text papers for information relevant to neuroimaging data collected from schizophrenia and addiction studies.
View Article and Find Full Text PDFBackground: Despite the efforts of the neuroscience community, there are many published neuroimaging studies with data that are still not or . Users face significant challenges in neuroimaging data due to the lack of provenance metadata, such as experimental protocols, study instruments, and details about the study participants, which is also required for To implement the FAIR guidelines for neuroimaging data, we have developed an iterative ontology engineering process and used it to create the NeuroBridge ontology. The NeuroBridge ontology is a computable model of provenance terms to implement FAIR principles and together with an international effort to annotate full text articles with ontology terms, the ontology enables users to locate relevant neuroimaging datasets.
View Article and Find Full Text PDFScientific reproducibility that effectively leverages existing study data is critical to the advancement of research in many disciplines including neuroscience, which uses imaging and electrophysiology modalities as primary endpoints or key dependency in studies. We are developing an integrated search platform called NeuroBridge to enable researchers to search for relevant study datasets that can be used to test a hypothesis or replicate a published finding without having to perform a difficult search from scratch, including contacting individual study authors and locating the site to download the data. In this paper, we describe the development of a metadata ontology based on the World Wide Web Consortium (W3C) PROV specifications to create a corpus of semantically annotated published papers.
View Article and Find Full Text PDFThe scale of the human exposome, which covers all environmental exposures encountered from conception to death, presents major challenges in managing, sharing, and integrating a myriad of relevant data types and available data sets for the benefit of exposomics research and public health. By addressing these challenges, the exposomics research community will be able to greatly expand on its ability to aggregate study data for new discoveries, construct and update novel exposomics data sets for building artificial intelligence and machine learning-based models, rapidly survey emerging issues, and advance the application of data-driven science. The diversity of the field, which spans multiple subfields of science disciplines and different environmental contexts, necessitates adopting data federation approaches to bridge between numerous geographically and administratively separated data resources that have varying usage, privacy, access, analysis, and discoverability capabilities and constraints.
View Article and Find Full Text PDFEnvironmental factors affecting health and vulnerability far outweigh genetics in accounting for disparities in health status and longevity in US communities. The concept of the exposome, the totality of exposure from conception onwards, provides a paradigm for researchers to investigate the complex role of the environment on the health of individuals. We propose a complementary framework, community-level exposomics, for population-level exposome assessment.
View Article and Find Full Text PDFThe era of "big data" has radically altered the way scientific research is conducted and new knowledge is discovered. Indeed, the scientific method is rapidly being complemented and even replaced in some fields by data-driven approaches to knowledge discovery. This paradigm shift is sometimes referred to as the "fourth paradigm" of data-intensive and data-enabled scientific discovery.
View Article and Find Full Text PDFThrough support from the National Institutes of Health's National Center for Research Resources, the Biomedical Informatics Research Network (BIRN) is pioneering the use of advanced cyberinfrastructure for medical research. By synchronizing developments in advanced wide area networking, distributed computing, distributed database federation, and other emerging capabilities of e-science, the BIRN has created a collaborative environment that is paving the way for biomedical research and clinical information management. The BIRN Coordinating Center (BIRN-CC) is orchestrating the development and deployment of key infrastructure components for immediate and long-range support of biomedical and clinical research being pursued by domain scientists in three neuroimaging test beds.
View Article and Find Full Text PDF