Recommender System for Term Deposit Likelihood Prediction Using Cross-Validated Neural Network

2021 ◽

pp. 21-28

Author(s):

Shawni Dutta ◽

Samir Kumar Bandyopadhyay

Keyword(s):

Neural Network ◽

Cross Validation ◽

Nearest Neighbor ◽

Automated System ◽

K Nearest Neighbor ◽

Decision Tree Classifier ◽

Proposed Model ◽

Tree Classifier ◽

Customer Perspective ◽

Fold Cross Validation

For enhancing the maximized profit from bank as well as customer perspective, term deposit can accelerate finance fields. This paper focuses on likelihood of term deposit subscription taken by the customers. Bank campaign efforts and customer details are influential while considering possibilities of taking term deposit subscription. An automated system is provided in this paper that approaches towards prediction of term deposit investment possibilities in advance. Neural network along with stratified 10-fold cross-validation methodology is proposed as predictive model which is later compared with other benchmark classifiers such as k-Nearest Neighbor (k-NN), Decision tree classifier (DT), and Multi-layer perceptron classifier (MLP). Experimental study concluded that proposed model provides significant prediction results over other baseline models with an accuracy of 88.32% and MSE of 0.1168.

Download Full-text

Applying Convolutional-GRU for Term Deposit Likelihood Prediction

10.20944/preprints202007.0101.v1 ◽

2020 ◽

Author(s):

Shawni Dutta ◽

Samir Bandyopadhyay

Keyword(s):

Predictive Model ◽

Nearest Neighbor ◽

Automated System ◽

K Nearest Neighbor ◽

Decision Tree Classifier ◽

Saving Account ◽

Proposed Model ◽

Tree Classifier ◽

Customer Perspective ◽

Gated Recurrent Unit

Banks are normally offered two kinds of deposit accounts. It consists of deposits like current/saving account and term deposits like fixed or recurring deposits. For enhancing the maximized profit from bank as well as customer perspective, term deposit can accelerate uplifting of finance fields. This paper focuses on likelihood of term deposit subscription taken by the customers. Bank campaign efforts and customer detail analysis can influence term deposit subscription chances. An automated system is approached in this paper that works towards prediction of term deposit investment possibilities in advance. This paper proposes deep learning based hybrid model that stacks Convolutional layers and Recurrent Neural Network (RNN) layers as predictive model. For RNN, Gated Recurrent Unit (GRU) is employed. The proposed predictive model is later compared with other benchmark classifiers such as k-Nearest Neighbor (k-NN), Decision tree classifier (DT), and Multi-layer perceptron classifier (MLP). Experimental study concludes that proposed model attains an accuracy of 89.59% and MSE of 0.1041 which outperform well other baseline models.

Download Full-text

Applying Convolutional-GRU for Term Deposit Likelihood Prediction

International Journal of Engineering and Management Research ◽

10.31033/ijemr.11.3.42 ◽

2021 ◽

Vol 11 (3) ◽

Author(s):

Shawni Dutta ◽

Payal Bose ◽

Vishal Goyal ◽

Samir Kumar Bandyopadhyay

Keyword(s):

Predictive Model ◽

Nearest Neighbor ◽

Automated System ◽

K Nearest Neighbor ◽

Decision Tree Classifier ◽

Saving Account ◽

Proposed Model ◽

Tree Classifier ◽

Customer Perspective ◽

Gated Recurrent Unit

Banks are normally offered two kinds of deposit accounts. It consists of deposits like current/saving account and term deposits like fixed or recurring deposits.For enhancing the maximized profit from bank as well as customer perspective, term deposit can accelerate uplifting of finance fields. This paper focuses on likelihood of term deposit subscription taken by the customers. Bank campaign efforts and customer detail analysis caninfluence term deposit subscription chances. An automated system is approached in this paper that works towards prediction of term deposit investment possibilities in advance. This paper proposes deep learning based hybrid model that stacks Convolutional layers and Recurrent Neural Network (RNN) layers as predictive model. For RNN, Gated Recurrent Unit (GRU) is employed. The proposed predictive model is later compared with other benchmark classifiers such as k-Nearest Neighbor (k-NN), Decision tree classifier (DT), and Multi-layer perceptron classifier (MLP). Experimental study concludesthat proposed model attainsan accuracy of 89.59% and MSE of 0.1041 which outperform wellother baseline models.

Download Full-text

Map-Reduce based Distance Weighted k-Nearest Neighbor Machine Learning Algorithm for Big Data Applications

10.21203/rs.3.rs-684319/v1 ◽

2021 ◽

Author(s):

Gothai E ◽

Usha Moorthy ◽

Sathishkumar V E ◽

Abeer Ali Alnuaim ◽

Wesam Atef Hatamleh ◽

...

Keyword(s):

Big Data ◽

Nearest Neighbor ◽

Mean Squared Error ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

K Nearest Neighbor ◽

Squared Error ◽

Big Data Applications ◽

Distance Weighted ◽

Classification Tasks

Abstract With the evolution of Internet standards and advancements in various Internet and mobile technologies, especially since web 4.0, more and more web and mobile applications emerge such as e-commerce, social networks, online gaming applications and Internet of Things based applications. Due to the deployment and concurrent access of these applications on the Internet and mobile devices, the amount of data and the kind of data generated increases exponentially and the new era of Big Data has come into existence. Presently available data structures and data analyzing algorithms are not capable to handle such Big Data. Hence, there is a need for scalable, flexible, parallel and intelligent data analyzing algorithms to handle and analyze the complex massive data. In this article, we have proposed a novel distributed supervised machine learning algorithm based on the MapReduce programming model and Distance Weighted k-Nearest Neighbor algorithm called MR-DWkNN to process and analyze the Big Data in the Hadoop cluster environment. The proposed distributed algorithm is based on supervised learning performs both regression tasks as well as classification tasks on large-volume of Big Data applications. Three performance metrics, such as Root Mean Squared Error (RMSE), Determination coefficient (R2) for regression task, and Accuracy for classification tasks are utilized for the performance measure of the proposed MR-DWkNN algorithm. The extensive experimental results shows that there is an average increase of 3–4.5% prediction and classification performances as compared to standard distributed k-NN algorithm and a considerable decrease of Root Mean Squared Error (RMSE) with good parallelism characteristics of scalability and speedup thus, proves its effectiveness in Big Data predictive and classification applications.

Download Full-text

A Comparative Study to Evaluate the Performance of Classification Algorithms in Mammogram Analysis

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i3.6.14960 ◽

2018 ◽

Vol 7 (3.6) ◽

pp. 154

Author(s):

S K. Sajan ◽

M Germanus Alex

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Decision Tree ◽

Automated System ◽

Support Vector ◽

Classification Algorithms ◽

Neural Network Classifier ◽

Decision Tree Classifier ◽

Tree Classifier ◽

Mammogram Image

Breast cancer is a major threat humans are facing irrespective of geographical limits. The awareness about breast cancer has increased during the last decade and many preventive measures were in practice to detect the breast cancer before the symptoms were felt. Mammography is a screening methodology currently in practice. In this paper the mammogram image is analyzed using automated system. The automated system is designed to be capable of distinguishing the mammogram image into a normal or malignant. This process involves image enhancement and image segmentation at preprocessing level. Histogram equalization technique is used to transform low contrast region of the mammogram into region with higher contrast and Fuzzy C Means (FCM) algorithm is used to segment the mammogram image into regions suitable for further analysis. After enhancement and segmentation at preprocessing level the classification is done using three classification algorithms like decision tree classifier, Neural Network classifier and Support Vector Machine (SVM). The performance of the classification algorithms is evaluated using the following criteria like speed, flexibility, robustness, scalability, interpretability, Time complexity and also based on accuracy, sensitivity and specificity. The results obtained in classification are compared with other classification algorithms. It is found that the neural network classifier approach produces better results compared to other classifiers.The average accuracy in diagnosis by Neural Network approach classifier is around 91%. Also it is found that the decision tree approach is much flexible and easy to use compared to other approaches.

Download Full-text

Wind-Induced Pressure Prediction on Tall Buildings Using Generative Adversarial Imputation Network

Sensors ◽

10.3390/s21072515 ◽

2021 ◽

Vol 21 (7) ◽

pp. 2515

Author(s):

Bubryur Kim ◽

N. Yuvaraj ◽

K. R. Sri Preethaa ◽

Gang Hu ◽

Dong-Eun Lee

Keyword(s):

Wind Tunnel ◽

Mean Squared Error ◽

Pressure Sensors ◽

Tall Buildings ◽

Wind Pressure ◽

Data Imputation ◽

K Nearest Neighbor ◽

Time Intervals ◽

Squared Error ◽

Proposed Model

Wind tunnel testing techniques are the main research tools for evaluating the wind loadings of buildings. They are significant in designing structurally safe and comfortable buildings. The wind tunnel pressure measurement technique using pressure sensors is significant for assessing the cladding pressures of buildings. However, some pressure sensors usually fail and cause loss of data, which are difficult to restore. In the literature, numerous techniques are implemented for imputing the single instance data values and data imputation for multiple instantaneous time intervals with accurate predictions needs to be addressed. Thus, the data imputation capacity of machine learning models is used to predict the missing wind pressure data for tall buildings in this study. A generative adversarial imputation network (GAIN) is proposed to predict the pressure coefficients at various instantaneous time intervals on tall buildings. The proposed model is validated by comparing the performance of GAIN with that of the K-nearest neighbor and multiple imputations by chained equation models. The experimental results show that the GAIN model provides the best fit, achieving more accurate predictions with the minimum average variance and minimum average standard deviation. The average mean-squared error for all four sides of the building was the minimum (0.016), and the average R-squared error was the maximum (0.961). The proposed model can ensure the health and prolonged existence of a structure based on wind environment.

Download Full-text

Prediction of Diagnosing Chronic Kidney Disease using Machine Learning: Classification Algorithms

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.f3989.049620 ◽

2020 ◽

Vol 9 (4) ◽

pp. 1922-1924

Keyword(s):

Chronic Kidney Disease ◽

Kidney Disease ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Decision Tree Classifier ◽

Renal Disorder ◽

Machine Learning Classification ◽

Knn Classifier ◽

Tree Classifier

Chronic Kidney Disease is a very dangerous health problem that has been spreading as well as growing due to diversification in life style such as food habits, changes in the atmosphere, etc. The branch of biosciences has progressive to a bigger extent and has bring out huge amounts of data from Electronic Health Records. The primary aim of this paper is to classify using various Classification techniques like Logistic Regression (LR), K-Nearest Neighbor (KNN) Classifier, Decision Tree Classifier Tree, Random Forest Classifier, Support Vector Machine (SVM), and SGD Classifier. According to the health statistics of India 63538 cases has been registered on chronic renal disorder. Average age of men and women susceptible to renal disorders occurs within the range of 48 to 70 years.

Download Full-text

The Shallow Gibbs Network, Double Backpropagation and Differential Machine learning

10.14293/s2199-1006.1.sor-.pps25dj.v1 ◽

2021 ◽

Author(s):

Nonvikan Karl-Augustt ALAHASSA ◽

alejandro Murua

Keyword(s):

Neural Network ◽

Network Model ◽

Nearest Neighbor ◽

Mean Squared Error ◽

Approximation Theorem ◽

Multivariate Interpolation ◽

Training Dataset ◽

Squared Error ◽

Taylor Approximation ◽

Compound Symmetry

We have built a Shallow Gibbs Network model as a Random Gibbs Network Forest to reach the performance of the Multilayer feedforward Neural Network in a few numbers of parameters, and fewer backpropagation iterations. To make it happens, we propose a novel optimization framework for our Bayesian Shallow Network, called the {Double Backpropagation Scheme} (DBS) that can also fit perfectly the data with appropriate learning rate, and which is convergent and universally applicable to any Bayesian neural network problem. The contribution of this model is broad. First, it integrates all the advantages of the Potts Model, which is a very rich random partitions model, that we have also modified to propose its Complete Shrinkage version using agglomerative clustering techniques. The model takes also an advantage of Gibbs Fields for its weights precision matrix structure, mainly through Markov Random Fields, and even has five (5) variants structures at the end: the Full-Gibbs, the Sparse-Gibbs, the Between layer Sparse Gibbs which is the B-Sparse Gibbs in a short, the Compound Symmetry Gibbs (CS-Gibbs in short), and the Sparse Compound Symmetry Gibbs (Sparse-CS-Gibbs) model. The Full-Gibbs is mainly to remind fully-connected models, and the other structures are useful to show how the model can be reduced in terms of complexity with sparsity and parsimony. All those models have been experimented with the Mulan project multivariate regression dataset, and the results arouse interest in those structures, in a sense that different structures help to reach different results in terms of Mean Squared Error (MSE) and Relative Root Mean Squared Error (RRMSE). For the Shallow Gibbs Network model, we have found the perfect learning framework : it is the $(l_1, \boldsymbol{\zeta}, \epsilon_{dbs})-\textbf{DBS}$ configuration, which is a combination of the \emph{Universal Approximation Theorem}, and the DBS optimization, coupled with the (\emph{dist})-Nearest Neighbor-(h)-Taylor Series-Perfect Multivariate Interpolation (\emph{dist}-NN-(h)-TS-PMI) model [which in turn is a combination of the research of the Nearest Neighborhood for a good Train-Test association, the Taylor Approximation Theorem, and finally the Multivariate Interpolation Method]. It indicates that, with an appropriate number $l_1$ of neurons on the hidden layer, an optimal number $\zeta$ of DBS updates, an optimal DBS learnnig rate $\epsilon_{dbs}$, an optimal distance \emph{dist}$_{opt}$ in the research of the nearest neighbor in the training dataset for each test data $x_i^{\mbox{test}}$, an optimal order $h_{opt}$ of the Taylor approximation for the Perfect Multivariate Interpolation (\emph{dist}-NN-(h)-TS-PMI) model once the {\bfseries DBS} has overfitted the training dataset, the train and the test error converge to zero (0).

Download Full-text

Age Estimation from fMRI Data Using Recurrent Neural Network

Applied Sciences ◽

10.3390/app12020749 ◽

2022 ◽

Vol 12 (2) ◽

pp. 749

Author(s):

Yunfei Gao ◽

Albert No

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Mean Squared Error ◽

Fmri Data ◽

Imaging Data ◽

Squared Error ◽

Proposed Model ◽

Brain Imaging Data ◽

Preprocessing Technique ◽

Gated Recurrent Unit

Finding a biomarker that indicates the subject’s age is one of the most important topics in biology. Several recent studies tried to extract a biomarker from brain imaging data including fMRI data. However, most of them focused on MRI data, which do not provide dynamics and lack attempts to apply recently proposed deep learning models. We propose a deep neural network model that estimates the age of a subject from fMRI images using a recurrent neural network (RNN), more precisely, a gated recurrent unit (GRU). However, applying neural networks is not trivial due to the high dimensional nature of fMRI data. In this work, we propose a novel preprocessing technique using the Automated Anatomical Labeling (AAL) atlas, which significantly reduces the input dimension. The proposed dimension reduction technique allows us to train our model with 640 training and validation samples from different projects under mean squared error (MSE). Finally, we obtain the correlation value of 0.905 between the predicted age and the actual age on 155 test samples. The proposed model estimates the age within the range of ±12 on most of the test samples. Our model is written in Python and is freely available for download.

Download Full-text

The Shallow Gibbs Network, Double Backpropagation and Differential Machine learning

10.14293/s2199-1006.1.sor-.pps25dj.v2 ◽

2021 ◽

Author(s):

Nonvikan Karl-Augustt ALAHASSA ◽

alejandro Murua

Keyword(s):

Neural Network ◽

Network Model ◽

Nearest Neighbor ◽

Mean Squared Error ◽

Approximation Theorem ◽

Multivariate Interpolation ◽

Training Dataset ◽

Squared Error ◽

Taylor Approximation ◽

Compound Symmetry

We have built a Shallow Gibbs Network model as a Random Gibbs Network Forest to reach the performance of the Multilayer feedforward Neural Network in a few numbers of parameters, and fewer backpropagation iterations. To make it happens, we propose a novel optimization framework for our Bayesian Shallow Network, called the {Double Backpropagation Scheme} (DBS) that can also fit perfectly the data with appropriate learning rate, and which is convergent and universally applicable to any Bayesian neural network problem. The contribution of this model is broad. First, it integrates all the advantages of the Potts Model, which is a very rich random partitions model, that we have also modified to propose its Complete Shrinkage version using agglomerative clustering techniques. The model takes also an advantage of Gibbs Fields for its weights precision matrix structure, mainly through Markov Random Fields, and even has five (5) variants structures at the end: the Full-Gibbs, the Sparse-Gibbs, the Between layer Sparse Gibbs which is the B-Sparse Gibbs in a short, the Compound Symmetry Gibbs (CS-Gibbs in short), and the Sparse Compound Symmetry Gibbs (Sparse-CS-Gibbs) model. The Full-Gibbs is mainly to remind fully-connected models, and the other structures are useful to show how the model can be reduced in terms of complexity with sparsity and parsimony. All those models have been experimented with the Mulan project multivariate regression dataset, and the results arouse interest in those structures, in a sense that different structures help to reach different results in terms of Mean Squared Error (MSE) and Relative Root Mean Squared Error (RRMSE). For the Shallow Gibbs Network model, we have found the perfect learning framework : it is the $(l_1, \boldsymbol{\zeta}, \epsilon_{dbs})-\textbf{DBS}$ configuration, which is a combination of the \emph{Universal Approximation Theorem}, and the DBS optimization, coupled with the (\emph{dist})-Nearest Neighbor-(h)-Taylor Series-Perfect Multivariate Interpolation (\emph{dist}-NN-(h)-TS-PMI) model [which in turn is a combination of the research of the Nearest Neighborhood for a good Train-Test association, the Taylor Approximation Theorem, and finally the Multivariate Interpolation Method]. It indicates that, with an appropriate number $l_1$ of neurons on the hidden layer, an optimal number $\zeta$ of DBS updates, an optimal DBS learnnig rate $\epsilon_{dbs}$, an optimal distance \emph{dist}$_{opt}$ in the research of the nearest neighbor in the training dataset for each test data $x_i^{\mbox{test}}$, an optimal order $h_{opt}$ of the Taylor approximation for the Perfect Multivariate Interpolation (\emph{dist}-NN-(h)-TS-PMI) model once the {\bfseries DBS} has overfitted the training dataset, the train and the test error converge to zero (0).

Download Full-text

Recommender System for Term Deposit Likelihood Prediction Using Cross-Validated Neural Network