Background: There is great interest in and excitement about the concept of personalized or precision medicine and, in particular, advancing this vision via various 'big data' efforts. While these methods are necessary, they are insufficient to achieve the full personalized medicine promise. A rigorous, complementary 'small data' paradigm that can function both autonomously from and in collaboration with big data is also needed. By 'small data' we build on Estrin's formulation and refer to the rigorous use of data by and for a specific N-of-1 unit (i.e., a single person, clinic, hospital, healthcare system, community, city, etc.) to facilitate improved individual-level description, prediction and, ultimately, control for that specific unit.

Main Body: The purpose of this piece is to articulate why a small data paradigm is needed and is valuable in itself, and to provide initial directions for future work that can advance study designs and data analytic techniques for a small data approach to precision health. Scientifically, the central value of a small data approach is that it can uniquely manage complex, dynamic, multi-causal, idiosyncratically manifesting phenomena, such as chronic diseases, in comparison to big data. Beyond this, a small data approach better aligns the goals of science and practice, which can result in more rapid agile learning with less data. There is also, feasibly, a unique pathway towards transportable knowledge from a small data approach, which is complementary to a big data approach. Future work should (1) further refine appropriate methods for a small data approach; (2) advance strategies for better integrating a small data approach into real-world practices; and (3) advance ways of actively integrating the strengths and limitations from both small and big data approaches into a unified scientific knowledge base that is linked via a robust science of causality.

Conclusion: Small data is valuable in its own right. That said, small and big data paradigms can and should be combined via a foundational science of causality. With these approaches combined, the vision of precision health can be achieved.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6636023PMC
http://dx.doi.org/10.1186/s12916-019-1366-xDOI Listing

Publication Analysis

Top Keywords

small data
36
data approach
28
big data
20
data
16
small
11
data paradigm
8
'small data'
8
future work
8
precision health
8
small big
8

Similar Publications

Climate change impact on green spaces planning in an urban area using a hybrid approach.

Environ Sci Pollut Res Int

January 2025

Department of Geomatics Engineering, Hacettepe University, 06800, Beytepe, Ankara, Türkiye.

This study presents a hybrid methodology for planning green spaces to enhance urban sustainability and livability, evaluating the impacts of climate change on cities. Cities, once accommodating a small population, have become major centers of migration and development since the eighteenth century. Rapid urban growth intensifies infrastructure, environmental, and social challenges.

View Article and Find Full Text PDF

Purpose: The study explores the role of multimodal imaging techniques, such as [F]F-PSMA-1007 PET/CT and multiparametric MRI (mpMRI), in predicting the ISUP (International Society of Urological Pathology) grading of prostate cancer. The goal is to enhance diagnostic accuracy and improve clinical decision-making by integrating these advanced imaging modalities with clinical variables. In particular, the study investigates the application of few-shot learning to address the challenge of limited data in prostate cancer imaging, which is often a common issue in medical research.

View Article and Find Full Text PDF

Mitochondrial function is crucial for hepatic lipid metabolism. Current research identifies two types of mitochondria based on their contact with lipid droplets: peridroplet mitochondria (PDM) and cytoplasmic mitochondria (CM). This work aimed to investigate the alterations of CM and PDM in metabolic dysfunction-associated steatotic liver disease (MASLD) induced by spontaneous type-2 diabetes mellitus (T2DM) in db/db mice.

View Article and Find Full Text PDF

In cybersecurity, anomaly detection in tabular data is essential for ensuring information security. While traditional machine learning and deep learning methods have shown some success, they continue to face significant challenges in terms of generalization. To address these limitations, this paper presents an innovative method for tabular data anomaly detection based on large language models, called "Tabular Anomaly Detection via Guided Prompts" (TAD-GP).

View Article and Find Full Text PDF

TP53 mutations are recognized to correlate with a worse prognosis in individuals with non-small cell lung cancer (NSCLC). There exists an immediate necessity to pinpoint selective treatment for patients carrying TP53 mutations. Potential drugs were identified by comparing drug sensitivity differences, represented by the half-maximal inhibitory concentration (IC50), between TP53 mutant and wild-type NSCLC cell lines using database analysis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!