scholarly journals Credit Card Fraud Detection and Prevention using Machine Learning

This research focused mainly on detecting credit card fraud in real world. We must collect the credit card data sets initially for qualified data set. Then provide queries on the user's credit card to test the data set. After random forest algorithm classification method using the already evaluated data set and providing current data set[1]. Finally, the accuracy of the results data is optimised. Then the processing of a number of attributes will be implemented, so that affecting fraud detection can be found in viewing the representation of the graphical model. The techniques efficiency is measured based on accuracy, flexibility, and specificity, precision. The results obtained with the use of the Random Forest Algorithm have proved much more effective

Author(s):  
Prof. Teena Varma ◽  
Mahesh Poojari ◽  
Jobin Joseph ◽  
Ainsley Cardozo

In today's economy, credit card (CC) plays a major role. It is an inevitable part of a household, business & global business. While using CCs can offer huge advantages if used cautiously and safely, significant credit & financial damage can be incurred by fraudulent activity. Several methods to deal with the rising credit card fraud (CCF) have been suggested. Both such strategies, though, are meant to prevent CCFs; each of them has its own drawbacks, benefits, and functions. CCF has become a significant global concern because of the huge growth of e-commerce and the proliferation of payment online. Machine learning (ML) algo as a data mining technology (DM) was recently very involved in the detection of CCF. There are however several challenges, including the absence of publicly available data sets, high unbalanced size, and different confusing behavior. In this paper, we discuss the state of the art in credit card fraud detection (CCFD), dataset and assessment standards after analyzing issues with the CCFD. Dataset is publicly available in the CCFD data set used in experiments. Here, we compare two ML algos of performance: Logistic Regression (LR) and XGBoost in detecting CCF Transactions Real Life Data. XGBoosthas an inherent ability to handle missing values. When XGBoost encounters node at lost value, it tries to split left & right hands & learn all ways to the highest loss. This is when the test runs on the data. The experimental results show an effective use of the XGBoost classifier. Technique of performance is widely accepted metric based on exclusion: accuracy & recall. Also, the comparison between both approaches displayed based on the ROC curve


Author(s):  
Aman .

It is important that companies are able to identify fraudulent credit card transactions so that customers are not charged for items that they did not purchase. These problems can be handled with Data Science and its importance, along with Machine Learning. This project aim is to illustrate the modelling of a data set using machine learning with Credit Card. Our objective is to detect 100% of the fraudulent transactions while minimizing the incorrect fraud classifications. Credit Card Fraud Detection is a sample of classification. In this process, we have focused on analysing and pre-processing data sets as well as the deployment of multiple anomaly detection algorithms such as Local Outlier Factor and Isolation Forest algorithm on the PCA transformed Credit Card Transaction data.


Author(s):  
Upasana Mukherjee ◽  
Vandana Thakkar ◽  
Shawni Dutta ◽  
Utsab Mukherjee ◽  
Samir Kumar Bandyopadhyay

The growth of regularly generated data from many financial activities has significant implications for every corner of financial modelling. This study has investigated the utilization of these continuous growing data by a means of an automated process. The automated process can be developed by using Machine learning based techniques that analyze the data and gain experience from the underlying data. Different important domains of financial fields such as Credit card fraud detection, bankruptcy detection, loan default prediction, investment prediction, marketing and many more can be modelled by implementing machine learning methods. Among several machine learning based techniques, the use of parametric and non-parametric based methods are approached by this research. Two parametric models namely Logistic Regression, Gaussian Naive Bayes models and two non-parametric methods such as Random Forest, Decision Tree are implemented in this paper. All the mentioned models are developed and implemented in the field of Credit card fraud detection, bankruptcy detection, loan default prediction. In each of the aforementioned cases, the comparative study among the classification techniques is drawn and the best model is identified. The performance of each classifier on each considered domain is evaluated by various performance metrics such as accuracy, F1-score and mean squared error. In the credit card fraud detection model the decision tree classifier performs the best with an accuracy of 99.1% and, in the loan default prediction and bankruptcy detection model, the random forest classifier gives the best accuracy of  97% and 96.84% respectively.


2021 ◽  
Vol 23 (06) ◽  
pp. 318-344
Author(s):  
Amit Pundir ◽  
◽  
Rajesh Pandey ◽  

Misrepresentation of money is a developing issue in monetary business with far-reaching consequences and keeping in mind that many processes have been found. Data quality management with data mining has been effectively applied to data sets to mechanize the investigation of massive amounts of complex information. Data mining has likewise played a notable role in identifying credit card fraud in online exchanges. Fraud detection in credit cards is a data quality management issue that considered under data mining, tested for two important reasons — first, the profiles of ordinary and false practices habitually change, and also because of the explanation that charge card fraud information is exceptionally slow. This research paper examines the performance of Decision Trees, Logistics Regression, and Random Forest rely strategically on profoundly skewed credit card fraud data. The dataset of credit card transactions is sourced from Kaggle (a publically accessible dataset repository) with 284,807 transactions. These methods are applied to raw data values and data preprocessing techniques. Assessment of the performance of techniques depends on accuracy, sensitivity, specificity, precision, and recall. Results indicate the optimal accuracy for the decision trees, logistics regression, and random forest classifiers with 90.8%, 98.5%, and 99.1% respectively.


Sign in / Sign up

Export Citation Format

Share Document