Developing a Scalable and Accurate Job Recommendation System with Distributed Cluster System using Machine Learning Algorithm

The purpose of this research is to develop a job recommender system based on the Hadoop MapReduce framework to achieve scalability of the system when it processes big data. Also, a machine learning algorithm is implemented inside the job recommender to produce an accurate job recommendation. The project begins by collecting sample data to build an accurate job recommender system with a centralized program architecture. Then a job recommender with a distributed system program architecture is implemented using Hadoop MapReduce which then deployed to a Hadoop cluster. After the implementation, both systems are tested using a large number of applicants and job data, with the time required for the program to compute the data is recorded to be analyzed. Based on the experiments, we conclude that the recommender produces the most accurate result when the cosine similarity measure is used inside the algorithm. Also, the centralized job recommender system is able to process the data faster compared to the distributed cluster job recommender system. But as the size of the data grows, the centralized system eventually will lack the capacity to process the data, while the distributed cluster job recommender is able to scale according to the size of the data.

Download Full-text

Job Recommendation System Implementation in Python vs. C++

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.d8132.118419 ◽

2019 ◽

Vol 8 (4) ◽

pp. 2299-2302

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Programming Language ◽

Mathematical Description ◽

Recommendation System ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Learning Problem ◽

Wide Range ◽

Job Recommendation

Implementing a machine learning algorithm gives you a deep and practical appreciation for how the algorithm works. This knowledge can also help you to internalize the mathematical description of the algorithm by thinking of the vectors and matrices as arrays and the computational intuitions for the transformations on those structures. There are numerous micro-decisions required when implementing a machine learning algorithm, like Select programming language, Select Algorithm, Select Problem, Research Algorithm, Unit Test and these decisions are often missing from the formal algorithm descriptions. The notion of implementing a job recommendation (a classic machine learning problem) system using to two algorithms namely, KNN [3] and logistic regression [3] in more than one programming language (C++ and python) is introduced and we bring here the analysis and comparison of performance of each. We specifically focus on building a model for predictions of jobs in the field of computer sciences but they can be applied to a wide range of other areas as well. This paper can be used by implementers to deduce which language will best suite their needs to achieve accuracy along with efficiency We are using more than one algorithm to establish the fact that our finding is not just singularly applicable.

Download Full-text

The Machine Learning Algorithm for Solving the Problem of Generating Recommendations for Goods and Services

Modelling and Data Analysis ◽

10.17759/mda.2020100401 ◽

2020 ◽

Vol 10 (4) ◽

pp. 5-16

Author(s):

V.A. Sudakov ◽

I.A. Trofimov

Keyword(s):

Machine Learning ◽

Collaborative Filtering ◽

Recommender System ◽

Recommendation System ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Goods And Services ◽

Complex Models ◽

Combined Algorithm ◽

Content Based Filtering

The article proposes an unsupervised machine learning algorithm for assessing the most possible relationship between two elements of a set of customers and goods / services in order to build a recommendation system. Methods based on collaborative filtering and content-based filtering are considered. A combined algorithm for identifying relationships on sets has been developed, which combines the advantages of the analyzed approaches. The complexity of the algorithm is estimated. Recommendations are given on the efficient implementation of the algorithm in order to reduce the amount of memory used. Using the book recommendation problem as an example, the application of this combined algorithm is shown. This algorithm can be used for a “cold start” of a recommender system, when there are no labeled quality samples of training more complex models.

Download Full-text

Personalized Book Recommendation System using Machine Learning Algorithm

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2021.0120126 ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Dhiman Sarma ◽

Tanni Mittra ◽

Mohammad Shahadat

Keyword(s):

Machine Learning ◽

Recommendation System ◽

Learning Algorithm ◽

Machine Learning Algorithm

Download Full-text

Utilizing High Performance Computing to Improve the Application of Machine Learning for Time-Efficient Prediction of Buildings’ Daylighting Performance

Journal of Physics Conference Series ◽

10.1088/1742-6596/2069/1/012153 ◽

2021 ◽

Vol 2069 (1) ◽

pp. 012153

Author(s):

Rania Labib

Keyword(s):

Machine Learning ◽

High Performance Computing ◽

High Performance ◽

Learning Algorithm ◽

Energy Performance ◽

Machine Learning Algorithms ◽

Machine Learning Algorithm ◽

Validation Data ◽

Time Required ◽

Performance Computing

Abstract Architects often investigate the daylighting performance of hundreds of design solutions and configurations to ensure an energy-efficient solution for their designs. To shorten the time required for daylighting simulations, architects usually reduce the number of variables or parameters of the building and facade design. This practice usually results in the elimination of design variables that could contribute to an energy-optimized design configuration. Therefore, recent research has focused on incorporating machine learning algorithms that require the execution of only a relatively small subset of the simulations to predict the daylighting and energy performance of buildings. Although machine learning has been shown to be accurate, it still becomes a time-consuming process due to the time required to execute a set of simulations to be used as training and validation data. Furthermore, to save time, designers often decide to use a small simulation subset, which leads to a poorly designed machine learning algorithm that produces inaccurate results. Therefore, this study aims to introduce an automated framework that utilizes high performance computing (HPC) to execute the simulations necessary for the machine learning algorithm while saving time and effort. High performance computing facilitates the execution of thousands of tasks simultaneously for a time-efficient simulation process, therefore allowing designers to increase the size of the simulation’s subset. Pairing high performance computing with machine learning allows for accurate and nearly instantaneous building performance predictions.

Download Full-text

A Recommender System Based on a Machine Learning Algorithm for B2C Portals

2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology ◽

10.1109/wi-iat.2009.87 ◽

2009 ◽

Cited By ~ 2

Author(s):

L. M. Lopez-Lopez ◽

J. J. Castro-Schez ◽

D. Vallejo-Fernandez ◽

J. Albusac

Keyword(s):

Machine Learning ◽

Recommender System ◽

Learning Algorithm ◽

Machine Learning Algorithm

Download Full-text

Improving the Shilling Attack Detection in Recommender Systems Using an SVM Gaussian Mixture Model

Journal of Information & Knowledge Management ◽

10.1142/s0219649219500114 ◽

2019 ◽

Vol 18 (01) ◽

pp. 1950011 ◽

Cited By ~ 1

Author(s):

Jasem M. Alostad

Keyword(s):

Machine Learning ◽

Recommender Systems ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Recommender System ◽

Detection Rate ◽

Learning Algorithm ◽

Gaussian Mixture ◽

Support Vector ◽

Machine Learning Algorithm

With recent advances in e-commerce platforms, the information overload has grown due to increasing number of users, rapid generation of data and items in the recommender system. This tends to create serious problems in such recommender systems. The increasing features in recommender systems pose some new challenges due to poor resilience to mitigate against vulnerable attacks. In particular, the recommender systems are more prone to be attacked by shilling attacks, which creates more vulnerability. A recommender system with poor detection of attacks leads to a reduced detection rate. The performance of the recommender system is thus affected with poor detection ability. Hence, in this paper, we improve the resilience against shilling attacks using a modified Support Vector Machine (SVM) and a machine learning algorithm. The Gaussian Mixture Model is used as a machine learning algorithm to increase the detection rate and it further reduces the dimensionality of data in recommender systems. The proposed method is evaluated against several result metrics, such as the recall rate, precision rate and false positive rate between different attacks. The results of the proposed system are evaluated against probabilistic recommender approaches to demonstrate the efficacy of machine learning language in recommender systems.

Download Full-text

A Knowledge-Oriented Recommendation System for Machine Learning Algorithm Finding and Data Processing

International Journal of Embedded and Real-Time Communication Systems ◽

10.4018/ijertcs.2019100102 ◽

2019 ◽

Vol 10 (4) ◽

pp. 20-38

Author(s):

Man Tianxing ◽

Ildar Raisovich Baimuratov ◽

Natalia Alexandrovna Zhukova

Keyword(s):

Machine Learning ◽

Data Processing ◽

Data Science ◽

Recommendation System ◽

Learning Algorithm ◽

Machine Learning Algorithms ◽

Machine Learning Algorithm ◽

Science Field ◽

Information Theoretic ◽

Computer Professional

With the development of the Big Data, data analysis technology has been actively developed, and now it is used in various subject fields. More and more non-computer professional researchers use machine learning algorithms in their work. Unfortunately, datasets can be messy and knowledge cannot be directly extracted, which is why they need preprocessing. Because of the diversity of the algorithms, it is difficult for researchers to find the most suitable algorithm. Most of them choose algorithms through their intuition. The result is often unsatisfactory. Therefore, this article proposes a recommendation system for data processing. This system consists of an ontology subsystem and an estimation subsystem. Ontology technology is used to represent machine learning algorithm taxonomy, and information-theoretic based criteria are used to form recommendations. This system helps users to apply data processing algorithms without specific knowledge from the data science field.

Download Full-text

Machine Learning Algorithm to Predict Early Complications after Brain Tumor Surgery

10.1055/s-0038-1660728 ◽

2018 ◽

Author(s):

C.H.B. van Niftrik ◽

F. van der Wouden ◽

V. Staartjes ◽

J. Fierstra ◽

M. Stienen ◽

...

Keyword(s):

Machine Learning ◽

Brain Tumor ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Tumor Surgery ◽

Early Complications ◽

Brain Tumor Surgery

Download Full-text

Feature extraction and prediction of Dengue Outbreaks

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit206544 ◽

2020 ◽

pp. 216-222

Author(s):

Kunal Parikh ◽

Tanvi Makadia ◽

Harshil Patel

Keyword(s):

Public Health ◽

Machine Learning ◽

Developing Countries ◽

Feature Extraction ◽

Predictive Analytics ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Health Concerns ◽

The World ◽

Dengue Outbreaks

Dengue is unquestionably one of the biggest health concerns in India and for many other developing countries. Unfortunately, many people have lost their lives because of it. Every year, approximately 390 million dengue infections occur around the world among which 500,000 people are seriously infected and 25,000 people have died annually. Many factors could cause dengue such as temperature, humidity, precipitation, inadequate public health, and many others. In this paper, we are proposing a method to perform predictive analytics on dengue’s dataset using KNN: a machine-learning algorithm. This analysis would help in the prediction of future cases and we could save the lives of many.

Download Full-text

Opinion Mining from Text Reviews Using Machine Learning Algorithm

International Journal of Innovative Research in Computer and Communication Engineering ◽

10.15680/ijircce.2015.0303024 ◽

2015 ◽

Vol 03 (03) ◽

pp. 1567-1570 ◽

Cited By ~ 3

Author(s):

Poobana S, Sashi Rekha k

Keyword(s):

Machine Learning ◽

Opinion Mining ◽

Learning Algorithm ◽

Machine Learning Algorithm

Download Full-text