Clinical and pharmacogenomic data mining: 4. The FANO program and command set as an example of tools for biomedical discovery and evidence based medicine.

J Proteome Res

IBM Global Pharmaceutical and Life Sciences, Somers, NY 10589, USA.

Published: September 2008

The culmination of methodology explored and developed in the preceding three papers is described in terms of the FANO program (also known as CliniMiner) and specifically in terms of the contemporary command set for data mining. This provides a more detailed account of how strategies were implemented in applications described elsewhere, in the previous papers in the series and in a paper on the analysis of 667 000 patient records. Although it is not customary to think of a command set as the output of research, it represents the elements and strategies for data mining biomedical and clinical data with many parameters, that is, in a high dimensional space that requires skilful navigation. The intent is not to promote FANO per se, but to report its science and methodologies. Typical example rules from traditional data mining are that A and B and C associate, or IF A & B THEN C. We need much higher complexity rules for clinical data especially with inclusion of proteomics and genomics. FANO's specific goal is to be able routinely to extract from clinical record repositories and other data not only the complex rules required for biomedical research and the clinical practice of evidence based medicine, but to quantify their uncertainty, that is, their essentially probabilistic nature. The underlying information and number theoretic basis previously described is less of an issue here, being "under the hood", although the fundamental role and use of the Incomplete (generalized) Riemann Zeta Function as a general surprise measure is highlighted, along with its covariance or multivariance analogue, as it appears to be a unique and powerful feature. Another characteristic described is the very general tactic of the metadata operator ':='. It allows decomposition of diverse data types such as trees, spreadsheets, biosequences, sets of objects, amorphous data collections with repeating items, XML structures, and so forth into universally atomic data items with or without metadata, and assists in reconstruction of ontology from the associations and numerical correlations so data mined.

Download full-text PDF

Source
http://dx.doi.org/10.1021/pr800204fDOI Listing

Publication Analysis

Top Keywords

data mining
16
command set
12
data
11
fano program
8
evidence based
8
based medicine
8
biomedical clinical
8
clinical data
8
clinical
5
clinical pharmacogenomic
4

Similar Publications

The case of Lumbar spinal stenosis (LSS) combined with tophi due to gout is rarely reported. In the course of our clinic work, we encountered a young male patient who was diagnosed with a history of gout for 5 years and was targeted as LSS combined with gouty tophi, and we would like to share this case. In addition, in order to further investigate the deep mechanism of LSS associated with gout, we obtained the intersecting genes of the two diseases based on a machine learning approach by obtaining the dataset GSE113212 related to LSS from the Gene Expression Omnibus (GEO) database, and the genes related to gout from the human gene database.

View Article and Find Full Text PDF

Complete genome sequence of Pseudarthrobacter sp. NIBRBAC000502770 from coal mine of Hongcheon on Republic of Korea.

BMC Genom Data

January 2025

Department of Applied Biosciences, College of Agriculture and Life Sciences, Kyungpook National University, Daegu, 41566, Republic of Korea.

Objectives: The data were collected to obtain the complete genome sequence of Pseudarthrobacter sp. NIBRBAC000502770, isolated from the rhizosphere of Sasamorpha in a heavy metal-contaminated coal mine in Hongcheon, Republic of Korea. The objective was to explore the strain's genetic potential for plant growth promotion and heavy metal resistance, particularly arsenate and copper.

View Article and Find Full Text PDF

Caving mining in extra-thick coal seams induces large-scale overburden movement, leading to more intense fracture processes in key strata, more significant surface subsidence, and frequent dynamic disasters in mines. This study, using the N34-2 caving face of the 17th coal seam at Junde Mine as a case study, aims to investigate the time-varying linkage mechanism between surface subsidence, microseismic characteristics, and fracture scales of the overburden's key strata under such mining conditions. Based on Timoshenko's theory, a bearing fracture mode for the overburden's key strata is proposed, and corresponding fracture criteria are established.

View Article and Find Full Text PDF

Role of riverbed sand mining on planform and cross-sectional morphology of Mayurakshi River, India.

Sci Total Environ

January 2025

Laboratorio de Geografía Física, Escuela de Geografía, Universidad de Costa Rica, Costa Rica.

Human interventions in the form of riverbed sand mining are escalating worldwide, especially in the humid tropics with excess population pressure exerting an elevated demand for sand as construction materials. Naturally, channel morphological alterations are observed for the tropical fluvial systems to a large extent. The present work examines the riverbed sand mining of the Mayurakshi River (India) during the last fifty years (1970-2020) using topographical maps, satellite images and field-based cross-sectional measurements.

View Article and Find Full Text PDF

Arctic rivers may be the largest net sources of mercury (Hg) to the Arctic Ocean, yet riverine sources of Hg remain poorly characterized compared to atmospheric processes. This article reviews the current state of knowledge on Hg inputs to the Mackenzie River and Valley in Northern Canada from six point and non-point sources. Point sources include the locations of mines, fossil fuel extraction facilities, and retrogressive permafrost thaw slumps.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!