Prediction of Daily Streamflow Data Using Ensemble Learning Models
Main Article Content
Abstract
Estimating river streamflow is a key task for both flood protection and optimal water resource management. The high degree of uncertainty regarding watershed characteristics, hydrological processes, and climatic factors affecting river flows makes streamflow estimation a challenging problem. These reasons, combined with the increasing prevalence of data on streamflow and precipitation, often lead to data-driven models being preferred over physically-based or conceptual forecasting models. The goal of this study is to predict daily river streamflow data with high accuracy using bagging and boosting approaches, which are ensemble learning methods. In addition, the effect of tributary streamflow on the forecast performance was analyzed in the estimation of the streamflow data. According to the results obtained, it has been shown that ensemble learning models are successful in estimating daily streamflow data, and if the tributary streamflow data is also used as input in the estimation of the streamflow, the determination and correlation performance parameters are improved, and the streamflow data can be estimated using tributary streamflow data.
Downloads
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
References
Khazaee Poul, A., Shourian, M., & Ebrahimi, H., (2019). A comparative study of MLR, KNN, ANN and ANFIS models with wavelet transform in monthly streamflow prediction. Water Resources Management, 33(8), 2907–2923. DOI: https://doi.org/10.1007/s11269-019-02273-0
de Santana Moreira, R. M., & Celeste, A. B., (2017). Performance evaluation of implicit stochastic reservoir operation optimization supported by long-term mean inflow forecast. Stochastic Environmental Research and Risk Assessment, 31(9), 2357–2364. DOI: https://doi.org/10.1007/s00477-016-1341-4
Lehner, F., Wood, A. W., Llewellyn, D., Blatchford, D. B., et al.,(2017). Mitigating the impacts of climate nonstationarity on seasonal streamflow predictability in the US Southwest. Geophysical Research Letters, 44(24), 12–208. DOI: https://doi.org/10.1002/2017GL076043
Hamlet, A. F., Huppert, D., & Lettenmaier, D. P., (2002). Economic value of long-lead streamflow forecasts for Columbia River hydropower. Journal of Water Resources Planning and Management, 128(2), 91–101. DOI: https://doi.org/10.1061/(ASCE)0733-9496(2002)128:2(91)
Yaseen, Z. M., Sulaiman, S. O., Deo, R. C., & Chau, K.-W., (2019). An enhanced extreme learning machine model for river flow forecasting: State-of-the-art, practical applications in water resource engineering area and future research direction. Journal of Hydrology, 569, 387–408. DOI: https://doi.org/10.1016/j.jhydrol.2018.11.069
Hadi, S. J., & Tombul, M., (2018). Monthly streamflow forecasting using continuous wavelet and multi-gene genetic programming combination. Journal of Hydrology, 561, 674–687. DOI: https://doi.org/10.1016/j.jhydrol.2018.04.036
Hadi, S. J., & Tombul, M., (2018). Streamflow forecasting using four wavelet transformation combinations approaches with data-driven models: a comparative study. Water Resources Management, 32(14): 4661–4679. DOI: https://doi.org/10.1007/s11269-018-2077-3
Toth, E., (2009). Classification of hydro-meteorological conditions and multiple artificial neural networks for streamflow forecasting. Hydrology and Earth System Sciences, 13(9), 1555–1566. DOI: https://doi.org/10.5194/hess-13-1555-2009
Boucher, M., Quilty, J., & Adamowski, J., (2020). Data assimilation for streamflow forecasting using extreme learning machines and multilayer perceptrons. Water Resources Research, 56(6), e2019WR026226. DOI: https://doi.org/10.1029/2019WR026226
Wu, C. L., Chau, K. W., & Li, Y. S., (2009). Predicting monthly streamflow using data‐driven models coupled with data‐preprocessing techniques. Water Resources Research, 45(8). DOI: https://doi.org/10.1029/2007WR006737
Quilty, J., & Adamowski, J., (2020). A stochastic wavelet-based data-driven framework for forecasting uncertain multiscale hydrological and water resources processes. Environmental Modelling & Software, 130, 104718. DOI: https://doi.org/10.1016/j.envsoft.2020.104718
Wu, C. L., & Chau, K.-W., (2010). Data-driven models for monthly streamflow time series prediction. Engineering Applications of Artificial Intelligence, 23(8), 1350–1367. DOI: https://doi.org/10.1016/j.engappai.2010.04.003
Wang, W., Van Gelder, P. H., Vrijling, J. K., & Ma, J., (2006). Forecasting daily streamflow using hybrid ANN models. Journal of Hydrology, 324(1–4), 383–399. DOI: https://doi.org/10.1016/j.jhydrol.2005.09.032
Makridakis, S., & Hibon, M., (1997). ARMA models and the Box–Jenkins methodology. Journal of forecasting, 16(3), 147–163. DOI: https://doi.org/10.1002/(SICI)1099-131X(199705)16:3<147::AID-FOR652>3.0.CO;2-X
Kurunç, A., Yürekli, K., & Cevik, O., (2005). Performance of two stochastic approaches for forecasting water quality and streamflow data from Yeşilιrmak River, Turkey. Environmental Modelling & Software, 20(9), 1195–1200. DOI: https://doi.org/10.1016/j.envsoft.2004.11.001
Musa, J. J., (2013). Stochastic Modeling of Shiroro River Streamflow Process. .
Darlington, R. B., & Hayes, A. F., (2016). Regression Analysis and Linear Models: Concepts, Applications, and Implementation. Guilford Publications.
Tian, P., Lu, H., Feng, W., Guan, Y., et al., (2020). Large decrease in streamflow and sediment load of Qinghai–Tibetan Plateau driven by future climate change: A case study in Lhasa River Basin. Catena, 187, 104340. DOI: https://doi.org/10.1016/j.catena.2019.104340
Chu, H., Wei, J., Wu, W., Jiang, Y., et al., (2021). A classification-based deep belief networks model framework for daily streamflow forecasting. Journal of Hydrology, 595 (January), 125967. (https://doi.org/10.1016/j.jhydrol.2021.125967) DOI: https://doi.org/10.1016/j.jhydrol.2021.125967
Jothiprakash, V., & Magar, R. B., (2012). Multi-time-step ahead daily and hourly intermittent reservoir inflow prediction by artificial intelligent techniques using lumped and distributed data. Journal of hydrology, 450, 293–307. DOI: https://doi.org/10.1016/j.jhydrol.2012.04.045
Dietterich, T. G., (2002). Ensemble learning. The handbook of brain theory and neural networks, 2(1): 110–125.
Zhou, Z.-H., (2009). When semi-supervised learning meets ensemble learning. International Workshop on Multiple Classifier Systems, 529–538, Springer. DOI: https://doi.org/10.1007/978-3-642-02326-2_53
DSİ Akım Gözlem Yıllıkları. https://www.dsi.gov.tr/Sayfa/Detay/744.
Zounemat-Kermani, M., Batelaan, O., Fadaee, M., & Hinkelmann, R., (2021). Ensemble machine learning paradigms in hydrology: A review. Journal of Hydrology, 598, 126266. (https://doi.org/10.1016/j.jhydrol.2021.126266) DOI: https://doi.org/10.1016/j.jhydrol.2021.126266
Opitz, D., & Maclin, R., (1999). Popular ensemble methods: An empirical study. Journal of artificial intelligence research, 11, 169–198. DOI: https://doi.org/10.1613/jair.614
Collins, R., (2018). Machine Learning with Bagging and Boosting. Amazon Digital Services LLC - Kdp Print Us. Retrieved from https://books.google.com.tr/books?id=Ch23vAEACAAJ
Bühlmann, P., (2012). Bagging, boosting and ensemble methods. In Handbook of computational statistics, 985–1022, Springer. DOI: https://doi.org/10.1007/978-3-642-21551-3_33
Willmott, C. J., (1982). Some comments on the evaluation of model performance. Bulletin of the American Meteorological Society, 63(11), 1309–1313. DOI: https://doi.org/10.1175/1520-0477(1982)063<1309:SCOTEO>2.0.CO;2
Chicco, D., Warrens, M. J., & Jurman, G., (2021). The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Computer Science, 7: e623. DOI: https://doi.org/10.7717/peerj-cs.623
Kim, S., Alizamir, M., Kim, N. W., & Kisi, O., (2020). Bayesian model averaging: A unique model enhancing forecasting accuracy for daily streamflow based on different antecedent time series. Sustainability (Switzerland), 12(22), 1–22. (https://doi.org/10.3390/su12229720) DOI: https://doi.org/10.3390/su12229720
Adnan, R. M., Liang, Z., Trajkovic, S., Zounemat-Kermani, M., et al., (2019). Daily streamflow prediction using optimally pruned extreme learning machine. Journal of Hydrology, 577, 123981. DOI: https://doi.org/10.1016/j.jhydrol.2019.123981