scholarly journals Credit Card Fraud Detection Using Random Forest and Local Outlier Factor

Author(s):  
Abhilasha Kulkarni
Author(s):  
Shashank Singh and Meenu Garg

It is essential that Visa organizations can distinguish false Mastercard exchanges so clients are not charged for things that they didn't buy. Such issues can be handled with Data Science and its significance, alongside Machine Learning, couldn't be more important. This undertaking expects to outline the demonstrating of an informational collection utilizing AI with Credit Card Fraud Detection. The Credit Card Fraud Detection Problem incorporates demonstrating past Visa exchanges with the information of the ones that ended up being extortion. This model is then used to perceive if another exchange is fake. Our target here is to identify 100% of the fake exchanges while limiting the off base misrepresentation arrangements. Charge card Fraud Detection is an average example of arrangement. In this cycle, we have zeroed in on examining and pre- preparing informational indexes just as the sending of numerous irregularity discovery calculations, for example, Local Outlier Factor and Isolation Forest calculation on the PCA changed Credit Card Transaction


2021 ◽  
Vol 5 (1) ◽  
pp. 56
Author(s):  
Giulia Moschini ◽  
Régis Houssou ◽  
Jérôme Bovay ◽  
Stephan Robert-Nicoud

This paper addresses the problem of the unsupervised approach of credit card fraud detection in unbalanced datasets using the ARIMA model. The ARIMA model is fitted to the regular spending behaviour of the customer and is used to detect fraud if some deviations or discrepancies appear. Our model is applied to credit card datasets and is compared to four anomaly detection approaches, namely, the K-means, box plot, local outlier factor and isolation forest approaches. The results show that the ARIMA model presents better detecting power than that of the benchmark models.


Author(s):  
Upasana Mukherjee ◽  
Vandana Thakkar ◽  
Shawni Dutta ◽  
Utsab Mukherjee ◽  
Samir Kumar Bandyopadhyay

The growth of regularly generated data from many financial activities has significant implications for every corner of financial modelling. This study has investigated the utilization of these continuous growing data by a means of an automated process. The automated process can be developed by using Machine learning based techniques that analyze the data and gain experience from the underlying data. Different important domains of financial fields such as Credit card fraud detection, bankruptcy detection, loan default prediction, investment prediction, marketing and many more can be modelled by implementing machine learning methods. Among several machine learning based techniques, the use of parametric and non-parametric based methods are approached by this research. Two parametric models namely Logistic Regression, Gaussian Naive Bayes models and two non-parametric methods such as Random Forest, Decision Tree are implemented in this paper. All the mentioned models are developed and implemented in the field of Credit card fraud detection, bankruptcy detection, loan default prediction. In each of the aforementioned cases, the comparative study among the classification techniques is drawn and the best model is identified. The performance of each classifier on each considered domain is evaluated by various performance metrics such as accuracy, F1-score and mean squared error. In the credit card fraud detection model the decision tree classifier performs the best with an accuracy of 99.1% and, in the loan default prediction and bankruptcy detection model, the random forest classifier gives the best accuracy of  97% and 96.84% respectively.


2021 ◽  
Vol 23 (06) ◽  
pp. 318-344
Author(s):  
Amit Pundir ◽  
◽  
Rajesh Pandey ◽  

Misrepresentation of money is a developing issue in monetary business with far-reaching consequences and keeping in mind that many processes have been found. Data quality management with data mining has been effectively applied to data sets to mechanize the investigation of massive amounts of complex information. Data mining has likewise played a notable role in identifying credit card fraud in online exchanges. Fraud detection in credit cards is a data quality management issue that considered under data mining, tested for two important reasons — first, the profiles of ordinary and false practices habitually change, and also because of the explanation that charge card fraud information is exceptionally slow. This research paper examines the performance of Decision Trees, Logistics Regression, and Random Forest rely strategically on profoundly skewed credit card fraud data. The dataset of credit card transactions is sourced from Kaggle (a publically accessible dataset repository) with 284,807 transactions. These methods are applied to raw data values and data preprocessing techniques. Assessment of the performance of techniques depends on accuracy, sensitivity, specificity, precision, and recall. Results indicate the optimal accuracy for the decision trees, logistics regression, and random forest classifiers with 90.8%, 98.5%, and 99.1% respectively.


Sign in / Sign up

Export Citation Format

Share Document