Software defect prediction plays a vital role in SDLC testing by identifying modules prone to defects. However, imbalanced class distributions, where defect (minority) samples are outnumbered by non-defect ones, can hinder model performance. This study investigates the impact of oversampling techniques (SMOTE, ADASYN) on Naive Bayes classification for defect prediction. While the base Naive Bayes model achieved good overall accuracy (83%), it struggled with defect class recall (30%). Applying SMOTE and ADASYN improved recall (40% and 38%, respectively) but slightly lowered accuracy (77% and 80%). Future work will explore feature selection and deep learning approaches for potentially better performance.
DHARMA, I Gusti Agung Ramananda Wira; SANTIYASA, I Wayan.
Pengaruh Penanganan Ketidakseimbangan Kelas pada Prediksi Cacat Perangkat Lunak dengan Teknik Oversampling.
Jurnal Nasional Teknologi Informasi dan Aplikasnya, [S.l.], v. 3, n. 1, p. 143-152, nov. 2024.
ISSN 3032-1948.
Available at: <http://103.29.196.112/index.php/jnatia/article/view/116083>. Date accessed: 04 mar. 2026.
doi: https://doi.org/10.24843/JNATIA.2024.v03.i01.p17.
Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.