ML-Based Detection of Credit Card Fraud Using Synthetic Minority Oversampling
DOI:
https://doi.org/10.69968/ijisem.2023v2i255-62Keywords:
Credit Card Fraud, Imbalanced Learning, SMOTE, XGBoost, Resampling, Supervised LearningAbstract
Credit card fraud detection is an essential and classic but very difficult problem consisting of imbalanced classification where fraud transactions are almost nonexistent as compared with legitimate ones. This paper proposes an approach which main feature is a combination of an imbalanced data technique based on the Synthetic Minority Oversampling Technique (SMOTE) with an XGBoost classifier for credit card fraud detection. We delineate the dataset preparation, feature preprocessing, SMOTE resampling, model configuration and training, and evaluation metrics (Accuracy, Precision, Recall, F1, AUC, and confusion matrix). Additionally, a comparative experiment plan is defined that includes baseline classical models (Logistic Regression, Random Forest, Artificial Neural Network) thereby allowing practitioners to perform benchmarking against other models' performances. The complete code necessary for conducting the experiments is accessible (the user-supplied Colab script was used as the foundation). The results show that if XGBoost is used in combination with careful preprocessing and SMOTE it will acquire a strong recall very important properties for fraud detection while still to an extent retaining high precision. We elaborate on the limitations (synthetic oversampling risks, concept drift) and plan ahead for the future inventiveness (cost-sensitive learning, streaming models, explainability).
References
Get persistent links for your reference list or bibliography.
Copy and paste the list, we’ll match with our metadata and return the links.
Members may also deposit reference lists here too.
[1] E. Ileberi, Y. Sun, and Z. Wang, "Performance evaluation of machine learning methods for credit card fraud detection using SMOTE and AdaBoost," IEEE Access, vol. 9, pp. 165286-165294, 2021.
https://doi.org/10.1109/ACCESS.2021.3134330
[2] Hashemi, Seyedeh Khadijeh, Seyedeh Leili Mirtaheri, and Sergio Greco. "Fraud detection in banking data by machine learning techniques." Ieee Access 11 (2022): 3034-3043.
https://doi.org/10.1109/ACCESS.2022.3232287
[3] Ileberi, Emmanuel. Improved machine learning methods for enhanced credit card fraud detection. University of Johannesburg (South Africa), 2023.
[4] Btoush, Eyad Abdel Latif Marazqah, et al. "A systematic review of literature on credit card cyber fraud detection using machine and deep learning." PeerJ Computer Science 9 (2023): e1278.
https://doi.org/10.7717/peerj-cs.1278
[5] Mahmood, Tariq, et al. "Machine Learning Techniques for Detecting Fraud in Credit Card Transactions." SEBD. 2023.
[6] Khalid, Saba. "MACHINE LEARNING FOR FRAUD DETECTION IN BANKING SYSTEMS." Computer Science Bulletin 6.02 (2023): 256-271.
[7] Chatterjee, Agniv, S. Sahand Mohammadi Ziabari, and Amr Elsherbini. Payments Fraud Detection using ML methods: Exploring Performance, Ethical and Real-World Considerations in Machine Learning-based Fraud Detection for Secure Payments. Technical Report. University of Amsterdam & Deloitte. Retrieved via ResearchGate, 2023.
[8] Ileberi, Emmanuel, Yanxia Sun, and Zenghui Wang. "A machine learning based credit card fraud detection using the GA algorithm for feature selection." Journal of Big Data 9.1 (2022): 24.
https://doi.org/10.1186/s40537-022-00573-8
[9] Ali, Abdulalem, et al. "Financial fraud detection based on machine learning: a systematic literature review." Applied Sciences 12.19 (2022): 9637.
https://doi.org/10.3390/app12199637
[10] Kaddi, Shweta S., and Malini M. Patil. "Ensemble learning based health care claim fraud detection in an imbalance data environment." Indonesian Journal of Electrical Engineering and Computer Science 32.3 (2023): 1686-1694.
https://doi.org/10.11591/ijeecs.v32.i3.pp1686-1694
[11] Mathew, DR TINA ELIZABETH. "An Ensemble Machine Learning Model for Classification of Credit Card Fradulent Transactions." Journal of Theoretical and Applied Information Technology 101.9 (2023): 3530-3546.
[12] Al-Faqir, Shumukh, and O. S. A. M. A. Ouda. "Credit card frauds scoring model based on deep learning ensemble." J. Theor. Appl. Inf. Technol 100.14 (2022): 5223-5234.
[13] Alonge, Enoch Oluwabusayo, et al. "Enhancing data security with machine learning: A study on fraud detection algorithms." Journal of Data Security and Fraud Prevention 7.2 (2021): 105-118.
https://doi.org/10.54660/.IJFMR.2021.2.1.19-31
[14] Iscan, Can, et al. "Wallet-based transaction fraud prevention through lightgbm with the focus on minimizing false alarms." IEEE Access 11 (2023): 131465-131474.
https://doi.org/10.1109/ACCESS.2023.3321666
[15] Ogundokun, Roseline Oluwaseun, et al. "Machine learning classification based techniques for fraud discovery in credit card datasets." International Conference on Applied Informatics. Cham: Springer International Publishing, 2021.
https://doi.org/10.1007/978-3-030-89654-6_3
[16] Abdel Messih, George Ibrahim. RESONANT: Reinforcement Learning Based Moving Target Defense for Detecting Credit Card Fraud. Diss. Virginia Tech, 2023.
https://doi.org/10.1145/3689935.3690395
[17] Naaz, Hena, and Tanweer Farooki. "Credit Card Fraud Detection: Survey and Discussion."
[18] Zalavadia, Jayesh N., and Jaydeep R. Ramani. "Credit Card Fraud Detection using Hybrid Machine Learning Algorithm." (2023).
[19] Kelly, Philip, et al. "Cost Efficient Machine Learning Models for Credit Card Fraud Detection." (2022).
[20] Tang, Zhixin. "Assessing the feasibility of machine learning-based modelling and prediction of credit fraud outcomes using hyperparameter tuning." Advances in Computer, Signals and Systems 7.2 (2023): 84-92.
https://doi.org/10.23977/acss.2023.070212
[21] Kelly, Philip, et al. "Optimizing Machine Learning Algorithms for Cost Effective Credit Card Fraud Detection Systems." (2022).
[22] Sultana, Shirin, Md Saifur Rahman, and Maharin Afroj. "An efficient fraud detection mechanism based on machine learning and blockchain technology." 2023 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT). IEEE, 2023.
https://doi.org/10.1109/3ICT60104.2023.10391306
[23] Fukas, Philipp, Lukas Menzel, and Oliver Thomas. "Augmenting data with generative adversarial networks to improve machine learning-based fraud detection." (2022).
[24] Karthikeyan, T., M. Govindarajan, and V. Vijayakumar. "An effective fraud detection using competitive swarm optimization based deep neural network." Measurement: Sensors 27 (2023): 100793.
https://doi.org/10.1016/j.measen.2023.100793
[25] Fisher, Raymond, et al. "Real Time Credit Card Fraud Detection Using Lightweight Machine Learning Architectures." (2023). [26] Wang, Jiao, and Norhashidah Awang. "A novel synthetic minority oversampling technique for multiclass imbalance problems."
[26] Alkhawaldeh, Ibraheem M., Ibrahem Albalkhi, and Abdulqadir Jeprel Naswhan. "Challenges and limitations of synthetic minority oversampling techniques in machine learning." World journal of methodology 13.5 (2023): 373.
https://doi.org/10.5662/wjm.v13.i5.373
[27] Nuthalapati, Aravind. "Smart fraud detection leveraging machine learning for credit card security." Educational Administration: Theory and Practice 29.2 (2023): 433-443.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Elavarasi Kesavan

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Re-users must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. This license allows for redistribution, commercial and non-commercial, as long as the original work is properly credited.





