Cancer is one of the leading causes of death worldwide and can be caused by environmental aspects (for example, exposure to asbestos), by human behavior (such as smoking), or by genetic factors. To understand which genes might be involved in patients' survival, researchers have invented prognostic genetic signatures: lists of genes that can be used in scientific analyses to predict if a patient will survive or not. In this study, we joined together five different prognostic signatures, each of them related to a specific cancer type, to generate a unique pan-cancer prognostic signature, that contains 207 unique probesets related to 187 unique gene symbols, with one particular probeset present in two cancer type-specific signatures (203072_at related to the MYO1E gene).
View Article and Find Full Text PDFSummary: Having multiple datasets is a key aspect of robust bioinformatics analyses, because it allows researchers to find possible confirmation of the discoveries made on multiple cohorts. For this purpose, Gene Expression Omnibus (GEO) can be a useful database, since it provides hundreds of thousands of microarray gene expression datasets freely available for download and usage. Despite this large availability, collecting prognostic datasets of a specific cancer type from GEO can be a long, time-consuming and energy-consuming activity for any bioinformatician, who needs to execute it manually by first performing a search on the GEO website and then by checking all the datasets found one by one.
View Article and Find Full Text PDF