Deep Hybrid Intelligence: CNN-LSTM for Accurate Software Bug Prediction
DOI:
https://doi.org/10.69968//ijisem.2024v3i426-33Keywords:
Software bug prediction, deep learning, computer vision, Convolutional neural networksAbstract
This research aimed to explore the potential of applying deep learning to software bug prediction. The study utilized various data preprocessing techniques that were essential in preparing the data for analysis, using a set of commonly available software bug reports and related metrics. In the data collection and preprocessing phase, the dataset was filtered to focus on critical software metrics, scaled for consistency, and additional techniques such as feature engineering and standardization were employed to enhance data variability. In order to analyze the effectiveness of the model in predicting software faults, the dataset was split so that it could be used for testing and training purposes. Several deep learning models, include CNN and LSTM architectures, were developed utilizing the preprocessed dataset in order to enhance the performance of the models. Subsequently, a hybrid ensemble technique was employed, combining the prediction outcomes of the best-performing individual models to form an ensemble model. Using test datasets, each model's performance was assessed using common assessment measures including precision, F1 score, accuracy, and recall. The ensemble models outperformed individual models in bug prediction, as demonstrated by higher accuracy and F1 scores. The final model achieved an accuracy of 96%, which was considered highly satisfactory for predicting software defects.
References
[1] Whitten, Neal, -Managing software development projects: formula for success, ‖ p. 384, 1995.
[2] Galin, Daniel, Software Quality Assurance From theory to implementation Software Quality Assurance From theory to implementation, 1st ed. Essex, England: Pearson Education Limited, 2004. [Online]. Available: www.pearsoned.co.uk
[3] Ran, Yan, Shen Xiaomei, and Xu Zhaowei. "Research and Application of Software Defect Prediction Model based on Data Mining." 2022 IEEE International Conference on Sensing, Diagnostics, Prognostics, and Control (SDPC). IEEE, 2022. https://doi.org/10.1109/SDPC55702.2022.9915822
[4] Mahfoodh, Hussain, and Qasem Obediat. "Software risk estimation through bug reports analysis and bug-fix time predictions." 2020 International Conference on Innovation and Intelligence for Informatics, Computing and Technologies (3ICT). IEEE, 2020. https://doi.org/10.1109/3ICT51146.2020.9312003
[5] Yaojing Wang, Yuan Yao, Hanghang Tong, Xuan Huo, Ming Li, Feng Xu, and Jian Lu. Enhancing supervised bag localization with metadata and stack-trace. Knowledge and Information Systems, 62:2461-2484, 2020 https://doi.org/10.1007/s10115-019-01426-2
[6] Kai Yang, Yi Cai, Ho-fung Leung, Raymond YK Lau, and Qing Li. Itwf: A framework to apply term weighting schemes in topic model. Neurocomputing, 350:248-260, 2019. https://doi.org/10.1016/j.neucom.2019.02.048
[7] Shivaji, Shivkumar, et al. "Reducing features to improve bug prediction." 2009 IEEE/ACM International Conference on Automated Software Engineering. IEEE, 2009. https://doi.org/10.1109/ASE.2009.76
[8] A. Okutan and O. T. Yildiz, "Software defect prediction using Bayesian networks," Empir. Softw. Eng., vol. 19, no. 1, pp. 154-181, 2014. https://doi.org/10.1007/s10664-012-9218-8
[9] Shailee, Nowrin Muhaimin, et al. "Software bug prediction using machine learning on jm1 dataset." 2024 International Conference on Advances in Computing, Communication, Electrical, and Smart Systems (iCACCESS). IEEE, 2024. https://doi.org/10.1109/iCACCESS61735.2024.10499572
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Nasim Uddin Ansari, Pankaj Richhariya

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Re-users must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. This license allows for redistribution, commercial and non-commercial, as long as the original work is properly credited.