MACHINE LEARNING ALGORITHMS IN FORECASTING: A COMPARATIVE ANALYSIS OF LINEAR REGRESSION, RANDOM FOREST, AND XGBOOST
Keywords:
Machine Learning, Forecasting, Linear Regression, Random Forest, XGBoost, Predictive Modeling, Regression AnalysisAbstract
Forecasting plays a crucial role in various domains such as finance, healthcare, energy, and economics. With the advancement of machine learning techniques, predictive accuracy has significantly improved compared to traditional statistical methods. This study presents a comparative analysis of three widely used machine learning algorithms—Linear Regression, Random Forest, and XGBoost—in forecasting tasks. Using benchmark datasets and standardized evaluation metrics such as Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and R² score, the performance of each algorithm is analyzed. The results demonstrate that ensemble-based models outperform linear models in capturing complex nonlinear relationships, while linear regression remains effective for interpretable and low-variance datasets.
References
James, G., Witten, D., Hastie, T., Tibshirani, R. An Introduction to Statistical Learning, Springer, 2013, pp. 15–45.
Bishop, C. M. Pattern Recognition and Machine Learning, Springer, 2006, pp. 23–67.
Zhou, Z.-H. Ensemble Methods: Foundations and Algorithms, CRC Press, 2012, pp. 89–120.
Harrison, D., Rubinfeld, D. Hedonic Housing Prices and the Demand for Clean Air, Journal of Environmental Economics, 1978, pp. 81–102.
Han, J., Kamber, M., Pei, J. Data Mining: Concepts and Techniques, Elsevier, 2011, pp. 55–78.
Draper, N., Smith, H. Applied Regression Analysis, Wiley, 1998, pp. 101–150.
Breiman, L. Random Forests, Machine Learning Journal, 2001, pp. 5–32.
Chen, T., Guestrin, C. XGBoost: A Scalable Tree Boosting System, KDD Conference, 2016, pp. 785–794.
Willmott, C., Matsuura, K. Advantages of MAE over RMSE, Climate Research, 2005, pp. 79–82.
Tsanas, A., Xifara, A. Accurate Quantitative Estimation of Energy Performance, Energy and Buildings, 2012, pp. 560–567.
Molnar, C. Interpretable Machine Learning, 2020, pp. 45–90.
Nielsen, D. Tree Boosting With XGBoost, Master’s Thesis, NTNU, 2016, pp. 34–60.






Azerbaijan
Türkiye
Uzbekistan
Kazakhstan
Turkmenistan
Kyrgyzstan
Republic of Korea
Japan
India
United States of America
Kosovo