The complexity of breast cancer biology makes it challenging to analyze large datasets of clinicopathologic and molecular attributes, toward identifying the key prognostic features and producing systems capable of predicting which patients are likely to relapse. We applied machine-learning techniques to analyze a set of well-characterized primary breast cancers, which specified the abundance and localization of various junctional proteins. We hypothesized that disruption of junctional complexes would lead to the cytoplasmic/nuclear redistribution of the protein components and their potential interactions with growth-regulating molecules, which would promote relapse, and that machine-learning techniques could use the subcellular locations of these proteins, together with standard clinicopathological data, to produce an efficient prognostic classifier.
View Article and Find Full Text PDF