Indian Journal of Science and Technology
Year: 2021, Volume: 14, Issue: 32, Pages: 2607-2615
Original Article
Karen Charly Veigas1, Durga Srilekha Regulagadda1, Sujatha Arun Kokatnoor2*
1Student, Department of Computer Science and Engineering, School of Engineering and
Technology, CHRIST (Deemed to be University), Bengaluru, 560074, India
2Assistant Professor, Department of Computer Science and Engineering, School of
Engineering and Technology, CHRIST (Deemed to be University), Bengaluru, 560074, India
*Corresponding Author
Email: [email protected]
Received Date:10 May 2021, Accepted Date:03 September 2021, Published Date:01 October 2021
Objectives: Credit fraud is a global threat to financial institutions due to specific challenges like imbalanced datasets and hidden patterns in real-life scenarios. The objective of this study is to propose a model that effectively identifies fraudulent transactions. Methods: Methods such as Synthetic Minority Oversampling Technique (SMOTE) and Generative Adversarial Networks (GAN) that artificially generate synthetic data are used in this paper to approximate the distribution of data among the two classes in the original dataset. After balancing the dataset, the individual models Multi-Layer Perceptron (MLP), k- Nearest Neighbors algorithm (kNN) and Support Vector Machine (SVM) are trained on the augmented dataset to establish an initial improvement at the data level. These base-classifiers are further incorporated into the Optimized Stacked Ensemble (OSE) learning process to fit the meta-classifier which creates an effective predictive model for fraud detection. All base-classifiers and the final Optimized Stacked Ensemble (OSE) have been implemented to critically assess and evaluate their performances. Findings: Empirical results obtained in this paper show that the quality of the final dataset is considerably improved when Synthetic Minority Oversampling Technique (SMOTE) and Generative Adversarial Networks (GAN) are used as oversampling algorithms. The Multi-Layer Perceptron model showed an increase of 10% in the F1 Score while kNN and SVM showed an increase of 3% each. The optimized model is built using a Stacking Classifier that combines the GAN-improved Multi-Perceptron Model with the other standard classification models such as KNN and SVM. This ensemble outperforms the existing enhanced Multi-Layer Perceptron with near-perfect accuracy (99.86%) and an increase of 16% in F1 Score, resulting in an effective fraud detection mechanism. Novelty: For the current dataset, the Optimized Stacked Ensemble model shows an increase of 16% in F1 Score as compared to the existing Multi-Perceptron model.
Keywords: Ensemble; Credit Card; Fraud Detection; GAN; SMOTE; MLP
© 2021 Veigas et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)
Subscribe now for latest articles and news.