Building a Predictive Model for Predicting Real Estate Prices Based on the Generated Database

Abstract

The work is devoted to solving the current problem of forecasting real estate prices by building a predictive model based on the generated database of real estate in Moscow, posted on the Move Real Estate website. Existing machine learning methods for solving the forecasting problem are considered and one of them is applied - multiple linear regression. A regression analysis of the obtained results of solving the forecasting problem was carried out. Eleven independent variables are considered as control parameters. The influence of the variables taken into account when constructing the model on the results of solving the problem of forecasting real estate prices was studied. It was determined which of the independent variables have the greatest impact on the results of the model. To improve the quality of the model, preprocessing and standardization of features were carried out. Identification of outliers and omissions of values was carried out during the formation of the database. The coefficients of the multiple linear regression model were determined using the least squares method. To assess the quality of the model, the following model parameters are analyzed: R-squared, adjusted R-squared, p-value. The result of constructing a predictive model is the resulting regression equation. The application of the resulting equation can be used to subsequently take into account specific characteristics when solving the problem of forecasting real estate prices. The work shows the advantages of using this method and the prospects for applying the obtained result.

About the authors

Polina A. Konyaeva

RUDN University

Email: 1032212116@pfur.ru
Master’s student, Academy of Engineering Moscow, Russia

Olga A. Saltykova

RUDN University

Author for correspondence.
Email: saltykova-oa@rudn.ru
ORCID iD: 0000-0002-3880-6662
SPIN-code: 3969-6701

Doctor of Sciences (Techn.), Associate Professor of the Department of Mechanics and Control Processes, Academy of Engineering

Moscow, Russia

Sergei A. Kupreev

RUDN University

Email: kupreev-sa@rudn.ru
ORCID iD: 0000-0002-8657-2282
SPIN-code: 2287-2902

Doctor of Sciences (Techn.), Professor of the Department of Mechanics and Control Processes, Academy of Engineering

Moscow, Russia

References

  1. Alekseev G. Introduction to machine learning. Habr. 2019. (In Russ.) Available from: https://habr.com/ru/articles/448892 / (accessed: 03.27.2023).
  2. Leifer LA, Chernaya EV. Machine learning techniques for real estate mass valuation. Analysis of accuracy for various methods on the example of the appraisal of apartments. Property relations in the Russian Federation. 2020;3:32–42. (In Russ.) EDN: BQRFXJ
  3. Kok N, Koponen E-L, Martinez-Barbosa CA. Big Data in Real Estate From Manual Appraisal to Automated Valuation». The Journal of Portfolio Management. 2017; 43(6):202–211. https://doi.org/10.3905/jpm.2017.43.6.202
  4. Yasnitskiy VL. Using a neural network to solve the problem of mass real estate appraisal of city Perm. Fundamental Research. 2015;10–3:650–653. (In Russ.) EDN: UNXWSX
  5. Surkov FA, Petkova NV, Sukhovskiy SF. Neural network data analysis methods in real estate valuation. News of universities. North Caucasus region. Technical science. 2016;3:38–45. (In Russ.) https://doi.org/10.17213/0321-2653-2016-3-38-45
  6. Arefieva E.A, Kostyaev D S. Using neural NETWORKS for evaluation of market cost of real estate. News of the Tula State University. Technical science. 2017; 10:177–184. (In Russ.) EDN: ZVLGJH
  7. Vykhodtsev NA. Artificial intelligence in price estimation of real estate. Proceedings of the TUSUR University. 2021;24(1):68–72. (In Russ.) https://doi.org/10.21293/1818-0442-2021-24-1-68-72
  8. Arzamastsev SA, Bgatov MV, Kartysheva EN, Derkunsky VA, Semenchikov DN. Predicting subscriber churn: comparison of machine learning methods. Computer tools in education. 2018;5:5–23. (In Russ.) https://doi.org/10.32603/2071-2340-2018-5-5-23
  9. Radchuk MA, Kopytina EA. Development of a software tool for predicting customer churn using machine learning methods. Collection of student scientific works of the Faculty of Computer Science of VSU. 2019. p. 190–196. (In Russ.) EDN: PSWAXM
  10. Lalwani P, Mishra MK, Chadha JS, Sethi P. Customer churn prediction system: a machine learning approach. Computing. 2022;104(2):271–294. https://doi.org/10.1007/s00607-021-00908-y
  11. Khodabandehlou S, Zivari Rahman M. Comparison of supervised machine learning techniques for customer churn prediction based on analysis of customer behavior. Journal of Systems and Information Technology. 2017;19(1/2):65–93. https://doi.org/10.1108/JSIT-102016-0061
  12. Andrianova EG, Novikova OA. The role of text mining methods in automating stock market forecasting. Cloud of science. 2018;5(1):196–211. (In Russ.) EDN: YUTIIN
  13. Kovalenko IA. Use of artificial intelligence in the exchange and over-the-counter securities markets. Bulletin of Science. 2023;3(6):75–80. (In Russ.) Available from: https://www.xn----8sbempclcwd3bmt.xn--p1ai/article/ 8956 (accessed: 30.03.2023).
  14. Henrique BM, Sobreiro VA, Kimura H. Literature review: Machine learning techniques applied to financial market prediction. Expert Systems with Applications. 2019; 124:226–251. https://doi.org/10.1016/j.eswa.2019.01.012
  15. Kumbure MM, Lohrmann C, Luukka P, Porras J. Machine learning techniques and data for stock market forecasting: A literature review. Expert Systems with Applications. 2022;197:116659.
  16. Mahesh B. Machine learning algorithms-a review. International Journal of Science and Research. 2020; 9(1):381–386. https://doi.org/10.21275/ART20203995
  17. Salnikov VA, Mikheeva OM. Models for forecasting prices on the Moscow residential real estate market. Problems of forecasting. 2018;1(166):129–139. (In Russ.) EDN: YLXJZZ
  18. Sternik GM, Pechenkina AV. Forecast of supply prices for apartments on the Moscow housing market (macroeconomic approach). Property relations in the Russian Federation. 2007;10:11–18. (In Russ.) EDN: JXADIB
  19. Nazarov A. Regression analysis in DataScience. Simple linear regression. statsmodels library. Habr. 2022. (In Russ.) Available from: https://habr.com/ru/articles690414/ (accessed: 30.03.2023).
  20. Dronov V. Linear regression using Scikit-Learn in Python. Learning Python. 2021. (In Russ.) Available from: https://tonais.ru/library/lineynaya-regressiya-s-pomo schyu-scikit-learn-v-python (accessed: 05.04.2023).
  21. Aylin А. Normalization vs. standardization in linear regression. Machine learning. 2023. Available from: https://www.baeldung.com/cs/normalization-vs-standardization (accessed: 15.04.2023).

Copyright (c) 2024 Konyaeva P.A., Saltykova O.A., Kupreev S.A.

License URL: https://creativecommons.org/licenses/by-nc/4.0/legalcode

This website uses cookies

You consent to our cookies if you continue to use our website.

About Cookies