CAN MACHINE LEARNING ALGORITHMS ASSOCIATED WITH TEXT MINING FROM INTERNET DATA IMPROVE HOUSING PRICE PREDICTION PERFORMANCE?

Housing frenzies in China have attracted widespread global attention over the past few years, but the key is how to more accurately forecast housing prices in order to establish an effective real estate policy. Based on the ubiquitousness and immediacy of Internet data, this research adopts a broader version of text mining to search for keywords in relation to housing prices and then evaluates the predictive abilities using machine learning algorithms. Our findings indicate that this new method, especially random forest, not only detects turning points, but also offers prediction ability that clearly outperforms traditional regression analysis. Overall, the prediction based on online search data through a machine learning mechanism helps us better understand the trends of house prices in China.

Download Full-text

Corrigendum to “Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data” [Expert Systems with Applications 42 (2015) 2928–2934]

Expert Systems with Applications ◽

10.1016/j.eswa.2015.03.005 ◽

2015 ◽

Vol 42 (19) ◽

pp. 6806 ◽

Cited By ~ 2

Author(s):

Byeonghwa Park ◽

Jae Kwon Bae

Keyword(s):

Machine Learning ◽

Expert Systems ◽

Learning Algorithms ◽

Housing Price ◽

Machine Learning Algorithms ◽

Fairfax County ◽

Price Prediction

Download Full-text

Machine Learning Housing Price Prediction in Petaling Jaya, Selangor, Malaysia

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b1084.0982s1119 ◽

2019 ◽

Vol 8 (2S11) ◽

pp. 542-546

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Prediction Models ◽

Learning Algorithms ◽

Housing Price ◽

Machine Learning Algorithms ◽

Selling Price ◽

Prediction Problem ◽

Real Dataset ◽

Price Prediction

This paper demonstrates the utilization of machine learning algorithms in the prediction of housing selling prices on real dataset collected from the Petaling Jaya area, Selangor, Malaysia. To date, literature about research on machine learning prediction of housing selling price in Malaysia is scarce. This paper provides a brief review of the existing machine learning algorithms for the prediction problem and presents the characteristics of the collected datasets with different groups of feature selection. The findings indicate that using irrelevant features from the dataset can decrease the accuracy of the prediction models.

Download Full-text

Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data

Expert Systems with Applications ◽

10.1016/j.eswa.2014.11.040 ◽

2015 ◽

Vol 42 (6) ◽

pp. 2928-2934 ◽

Cited By ~ 78

Author(s):

Byeonghwa Park ◽

Jae Kwon Bae

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Housing Price ◽

Machine Learning Algorithms ◽

Fairfax County ◽

Price Prediction

Download Full-text

Housing-Price Prediction in Colombia using Machine Learning

10.31219/osf.io/w85z2 ◽

2021 ◽

Author(s):

MIGUEL ANGEL CORREA MANRIQUE ◽

Omar Becerra Sierra ◽

Daniel Otero Gomez ◽

Henry Laniado ◽

Rafael Mateus C ◽

...

Keyword(s):

Machine Learning ◽

Evaluation Studies ◽

Housing Prices ◽

House Price ◽

Housing Price ◽

Machine Learning Algorithms ◽

Learning Tools ◽

Price Prediction ◽

Statistical Indicator ◽

Real Estate Company

It is a common practice to price a house without proper evaluation studies being performed for assurance. That is why the purpose of this study provide an explanatory model by establishing parameters for accuracy in interpretation and projection of housing prices. In addition, it is intentioned to establish proper data preprocessing practices in order to increase the accuracy of machine learning algorithms. Indeed, according to our literature review, there are few articles and reports on the use of Machine Learning tools for the prediction of property prices in Colombia. The dataset in which the research is built upon was provided by an existing real estate company. It contains near 940,000 items (housing advertisements) posted on the platform from the year 2018 to 2020. The database was enriched using statistical imputation techniques. Housing prices prediction was performed using Decision Tree Regressors and LightGBM methods, thus deriving in better alternatives for house price prediction in Colombia. Moreover, to measure the accuracy of the proposed models, the Root Mean Squared Logarithmic Error (RMSLE) statistical indicator was used. The best cross validation results obtained were 0.25354±0.00699 for the LightGBM, 0.25296 ±0.00511 for the Bagging Regressor, and 0.25312±0.00559 for the ExtraTree Regressor with Bagging Regressor, and it was not found a statistical difference between their performances.

Download Full-text

Housing Price Prediction Using Machine Learning Algorithms: The Case of Melbourne City, Australia

2018 International Conference on Machine Learning and Data Engineering (iCMLDE) ◽

10.1109/icmlde.2018.00017 ◽

2018 ◽

Cited By ~ 5

Author(s):

The Danh Phan

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Housing Price ◽

Machine Learning Algorithms ◽

Price Prediction

Download Full-text

Crop price prediction using supervised machine learning algorithms

Journal of Physics Conference Series ◽

10.1088/1742-6596/1916/1/012042 ◽

2021 ◽

Vol 1916 (1) ◽

pp. 012042

Author(s):

Ranjani Dhanapal ◽

A AjanRaj ◽

S Balavinayagapragathish ◽

J Balaji

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Price Prediction

Download Full-text

Spatial Prediction of Housing Prices in Beijing Using Machine Learning Algorithms

Proceedings of the 2020 4th High Performance Computing and Cluster Technologies Conference & 2020 3rd International Conference on Big Data and Artificial Intelligence ◽

10.1145/3409501.3409543 ◽

2020 ◽

Author(s):

Ziyue Yan ◽

Lu Zong

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Housing Prices ◽

Spatial Prediction ◽

Machine Learning Algorithms

Download Full-text

Machine Learning Algorithms for Diamond Price Prediction

Proceedings of the 2020 2nd International Conference on Image, Video and Signal Processing ◽

10.1145/3388818.3393715 ◽

2020 ◽

Cited By ~ 1

Author(s):

Waad Alsuraihi ◽

Ekram Al-hazmi ◽

Kholoud Bawazeer ◽

Hanan Alghamdi

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Price Prediction

Download Full-text

House Price Prediction Using Machine Learning Algorithms

Soft Computing Systems - Communications in Computer and Information Science ◽

10.1007/978-981-13-1936-5_45 ◽

2018 ◽

pp. 425-433 ◽

Cited By ~ 1

Author(s):

Naalla Vineeth ◽

Maturi Ayyappa ◽

B. Bharathi

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

House Price ◽

Machine Learning Algorithms ◽

Price Prediction

Download Full-text

Machine learning algorithms for predicting undernutrition among under-five children in Ethiopia

Public Health Nutrition ◽

10.1017/s1368980021004262 ◽

2021 ◽

pp. 1-29

Author(s):

Fikrewold H. Bitew ◽

Corey S. Sparks ◽

Samuel H. Nyarko

Keyword(s):

Machine Learning ◽

Linear Models ◽

Learning Algorithms ◽

Public Health Problem ◽

Water Source ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

Global Public Health ◽

Prediction Ability ◽

Extreme Gradient Boosting

Abstract Objective: Child undernutrition is a global public health problem with serious implications. In this study, estimate predictive algorithms for the determinants of childhood stunting by using various machine learning (ML) algorithms. Design: This study draws on data from the Ethiopian Demographic and Health Survey of 2016. Five machine learning algorithms including eXtreme gradient boosting (xgbTree), k-nearest neighbors (K-NN), random forest (RF), neural network (NNet), and the generalized linear models (GLM) were considered to predict the socio-demographic risk factors for undernutrition in Ethiopia. Setting: Households in Ethiopia. Participants: A total of 9,471 children below five years of age. Results: The descriptive results show substantial regional variations in child stunting, wasting, and underweight in Ethiopia. Also, among the five ML algorithms, xgbTree algorithm shows a better prediction ability than the generalized linear mixed algorithm. The best predicting algorithm (xgbTree) shows diverse important predictors of undernutrition across the three outcomes which include time to water source, anemia history, child age greater than 30 months, small birth size, and maternal underweight, among others. Conclusions: The xgbTree algorithm was a reasonably superior ML algorithm for predicting childhood undernutrition in Ethiopia compared to other ML algorithms considered in this study. The findings support improvement in access to water supply, food security, and fertility regulation among others in the quest to considerably improve childhood nutrition in Ethiopia.

Download Full-text