The creation of big clinical data cohorts for machine learning and data analysis require a number of steps from the beginning to successful completion. Similar to data set preprocessing in other fields, there is an initial need to complete data quality evaluation; however, with large heterogeneous clinical data sets, it is important to standardize the data in order to facilitate dimensionality reduction. This is particularly important for clinical data sets including medications as a core data component due to the complexity of coded medication data. Data integration at the individual subject level is essential with medication-related machine learning applications since it can be difficult to accurately identify drug exposures, therapeutic effects, and adverse drug events without having high-quality data integration of insurance, medication, and medical data. Successful data integration and standardization efforts can substantially improve the ability to identify and replicate personalized treatment pathways to optimize drug therapy.

Download full-text PDF

Source
http://dx.doi.org/10.1007/978-1-4939-9089-4_14DOI Listing

Publication Analysis

Top Keywords

data
13
machine learning
12
clinical data
12
data integration
12
data sets
8
big data
4
data cohort
4
cohort extraction
4
extraction personalized
4
personalized statin
4

Similar Publications

Triple-negative breast cancer (TNBC) remains a significant global health challenge, emphasizing the need for precise identification of patients with specific therapeutic targets and those at high risk of metastasis. This study aimed to identify novel therapeutic targets for personalized treatment of TNBC patients by elucidating their roles in cell cycle regulation. Using weighted gene co-expression network analysis (WGCNA), we identified 83 hub genes by integrating gene expression profiles with clinical pathological grades.

View Article and Find Full Text PDF

The fungal genus Fusarium is a treasure-trove of structurally diverse secondary metabolites, contributed greatly by marine-derived strains. A new cedrane sesquiterpene, fusacedrol (1), and a new fusarin member, fusarin M (2), were isolated from F. graminearum 12Ⅱ2N that was isolated as an endophyte from the marine brown alga Sargassum sp.

View Article and Find Full Text PDF

Objectives: Cardiac biomarkers are useful for the diagnostic and prognostic assessment of myocardial injury (MI) and heart failure. By measuring specific proteins released into the bloodstream during heart stress or damage, these biomarkers help clinicians detect the presence and extent of heart injury and tailor appropriate treatment plans. This study aims to provide robust biological variation (BV) data for cardiac biomarkers in athletes, specifically focusing on those applied to detect or exclude MI, such as myoglobin, creatine kinase-myocardial band (CK-MB) and cardiac troponins (cTn), and those related to heart failure and cardiac dysfunction, brain natriuretic peptide (BNP) and N-terminal brain natriuretic pro-peptide (NT-proBNP).

View Article and Find Full Text PDF

Background: Major mutations (e.g., KRAS, GNAS, TP53, SMAD4) in pancreatic cyst fluid (PCF) are useful for classifying and risk stratifying certain cyst types, particularly in cases with nondiagnostic cytology.

View Article and Find Full Text PDF

Hepatitis B virus (HBV) infects cells by attaching to heparan sulfate proteoglycans (HSPG) and Na/taurocholate cotransporting polypeptide (NTCP). The endothelial lipase LIPG bridges HSPG and HBV, facilitating HBV attachment. From a randomized peptide expression library, we identified a short sequence binding to LIPG.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!