In recent years, due to an increase in the incidence of different cancers, various data sources are available in this field. Consequently, many researchers have become interested in the discovery of useful knowledge from available data to assist faster decision-making by doctors and reduce the negative consequences of such diseases. Data mining includes a set of useful techniques in the discovery of knowledge from the data: detecting hidden patterns and finding unknown relations. However, these techniques face several challenges with real-world data. Particularly, dealing with inconsistencies, errors, noise, and missing values requires appropriate preprocessing and data preparation procedures. In this article, we investigate the impact of preprocessing to provide high-quality data for classification techniques. A wide range of preprocessing and data preparation methods are studied, and a set of preprocessing steps was leveraged to obtain appropriate classification results. The preprocessing is done on a real-world breast cancer dataset of the Reza Radiation Oncology Center in Mashhad with various features and a great percentage of null values, and the results are reported in this article. To evaluate the impact of the preprocessing steps on the results of classification algorithms, this case study was divided into the following 3 experiments: Breast cancer recurrence prediction without data preprocessing Breast cancer recurrence prediction by error removal Breast cancer recurrence prediction by error removal and filling null values Then, in each experiment, dimensionality reduction techniques are used to select a suitable subset of features for the problem at hand. Breast cancer recurrence prediction models are constructed using the 3 widely used classification algorithms, namely, naïve Bayes, -nearest neighbor, and sequential minimal optimization. The evaluation of the experiments is done in terms of accuracy, sensitivity, F-measure, precision, and G-mean measures. Our results show that recurrence prediction is significantly improved after data preprocessing, especially in terms of sensitivity, F-measure, precision, and G-mean measures.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7262833 | PMC |
http://dx.doi.org/10.1177/1176935120917955 | DOI Listing |
Jpn J Clin Oncol
January 2025
Division of Molecular and Cellular Oncology, Miyagi Cancer Center Research Institute, 47-1 Nodayama, Medeshima-Shiode, Natori, Miyagi 981-1293, Japan.
A Japanese woman with Li-Fraumeni syndrome in her 40s underwent comprehensive genetic profiling accompanied by germline data using the Oncoguide NCC Oncopanel, but no germline pathogenic variants in the tumor suppressor gene TP53 were detected. However, careful examination of additional data in the report suggested the presence of a large TP53 deletion. Custom targeting next-generation sequencing and nanopore sequencing revealed a 3.
View Article and Find Full Text PDFInt J Radiat Oncol Biol Phys
January 2025
Providence Swedish Cancer Institute, Seattle, Washington.
Purpose: Standard therapy for breast cancer after breast-conserving surgery is radiation therapy (RT) plus hormone therapy (HT). For patients with a low-risk of recurrence, there is an interest in deescalating therapy.
Methods And Materials: A retrospective study was carried out for patients treated at the Swedish Cancer Institute from 2000 to 2015, aged 70 years or older, with pT1N0 or pT1NX estrogen receptor-positive and ERBB2-negative unifocal breast cancer without positive surgical margins, high nuclear grade, or lymphovascular invasion.
Acta Oncol
January 2025
Psychological Aspects of Cancer, Cancer Survivorship, The Danish Cancer Institute, Copenhagen, Denmark.
Introduction: To target psychological support to cancer patients most in need of support, screening for psychological distress has been advocated and, in some settings, also implemented. Still, no prior studies have examined the appropriate 'dosage' and whether screening for distress before cancer treatment may be sufficient or if further screenings during treatment are necessary. We examined the development in symptom trajectories for breast cancer patients with low distress before surgery and explored potential risk factors for developing burdensome symptoms at a later point in time.
View Article and Find Full Text PDFCell Mol Biol Lett
January 2025
Department of Pharmacy, Health and Nutritional Sciences, University of Calabria, Arcavacata Di Rende, 87036, Cosenza, Italy.
Breast cancer is the most commonly diagnosed type of cancer and the leading cause of cancer-related death in women worldwide. Highly targeted therapies have been developed for different subtypes of breast cancer, including hormone receptor (HR)-positive and human epidermal growth factor receptor 2 (HER2)-positive breast cancer. However, triple-negative breast cancer (TNBC) and metastatic breast cancer disease are primarily treated with chemotherapy, which improves disease-free and overall survival, but does not offer a curative solution for these aggressive forms of breast cancer.
View Article and Find Full Text PDFSci Rep
January 2025
Department of Genetics, The University of Alabama at Birmingham, Birmingham, AL, USA.
Nowadays, chemotherapy and immunotherapy remain the major treatment strategies for Triple-Negative Breast Cancer (TNBC). Identifying biomarkers to pre-select and subclassify TNBC patients with distinct chemotherapy responses is essential. In the current study, we performed an unbiased Reverse Phase Protein Array (RPPA) on TNBC cells treated with chemotherapy compounds and found a leading significant increase of phosphor-AURKA/B/C, AURKA, AURKB, and PLK1, which fall into the mitotic kinase group.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!