Many prediction models and approaches have been introduced during the past decades that try to forecast bugged code elements based on static source code metrics, change and history metrics, or both. However, there is still no universal best solution to this problem, as most suitable features and models vary from dataset to dataset and depend on the context in which we use them. Therefore, novel approaches and further studies on this topic are highly necessary. In this paper, we employ a chemometric approach - Partial Least Squares with Discriminant Analysis (PLS-DA) - for predicting bug prone Classes in Java programs using static source code metrics. PLS-DA is successfully applied within the field of chemometrics, but to our best knowledge, it has never been used before in the software maintenance domain for predicting bugs. In addition, we have used rigorous statistical treatments and evaluation for representing the software engineering results. We show that our PLS-DA based prediction model achieves superior performances compared to the state-of-the-art approaches (i.e. F-measure of 0.44-0.47 at 90% confidence level) when no data re-sampling applied and comparable to others when applying up-sampling on the largest open bug dataset, while training the model is significantly faster, thus finding optimal parameters is much easier. In terms of completeness, which measures the amount of bugs contained in the Java Classes predicted to be defective, PLS-DA outperforms every other algorithm: it found 69.3% and 79.4% of the total bugs with no re-sampling and up-sampling, respectively.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11334662PMC
http://dx.doi.org/10.1016/j.heliyon.2024.e35045DOI Listing

Publication Analysis

Top Keywords

partial squares
8
discriminant analysis
8
static source
8
source code
8
code metrics
8
development partial
4
squares regression
4
regression discriminant
4
analysis software
4
software bug
4

Similar Publications

There has been a rapid rise in utilization of artificial intelligence (AI) in many different sectors in the last several years. However, business-to-business (B2B) marketing is one of the more notable examples. The initial assessments emphasize the significant advantages of AI in B2B marketing, including its knack for yielding unique understandings into consumer behaviors, recognizing crucial market trends, and improving operational efficiency.

View Article and Find Full Text PDF

Targeted plasma metabolomics reveals potential biomarkers of the elderly with mild cognitive impairment in Qingdao rural area.

Front Aging Neurosci

December 2024

Department of Epidemiology and Health Statistics, School of Public Health, Qingdao University, Qingdao, Shandong, China.

Introduction: Previous research has suggested a link between the onset of Alzheimer's disease (AD) and metabolic disorder; however, the findings have been inconsistent. To date, the majority of metabolomics studies have focused on AD, resulting in a relative paucity of research on early-stage conditions such as mild cognitive impairment (MCI) underexplored. In this study, we employed a comprehensive platform for the early screening of individuals with MCI using high-throughput targeted metabolomics.

View Article and Find Full Text PDF

To investigate the effects of row ratio configurations on intercropping advantages and related rhizosphere microbial communities, a field experiment involving five treatments of different rows of broomcorn millet, i.e., P1M1 (1 row of broomcorn millet intercropped with 1 row of alfalfa), P2M3, P1M2, P1M3 and broomcorn millet alone (SP), was conducted on the Loess Plateau of China.

View Article and Find Full Text PDF

Aims: Improved dietary habits are important for successful secondary prevention after myocardial infarction (MI), with counselling and support on healthy dietary habits constituting a cornerstone of cardiac rehabilitation (CR). However, there is limited knowledge on how to optimize CR organization to motivate patients to adopt healthy dietary habits. We aimed to explore associations between CR programme structure, processes, and self-reported dietary habits 1 year post-MI.

View Article and Find Full Text PDF

Determination and visualization of moisture content in Camellia oleifera seeds rapidly based on hyperspectral imaging combined with deep learning.

Spectrochim Acta A Mol Biomol Spectrosc

December 2024

Co-Innovation Center of Efficient Processing and Utilization of Forest Resources, Nanjing Forestry University, Nanjing 210037, China; College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China. Electronic address:

Moisture content (MC) is crucial for the storage, transportation, and processing of Camellia oleifera seeds. The purpose of this study was to investigate the feasibility for detecting MC in Camellia oleifera seeds using visible near-infrared hyperspectral imaging (VNIR-HSI) (374.98 ∼ 1038.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!