Prediction of Daily Streamflow Data Using Ensemble Learning Models

Main Article Content

Levent Latifoğlu
Ümit Canpolat

Abstract

Estimating river streamflow is a key task for both flood protection and optimal water resource management. The high degree of uncertainty regarding watershed characteristics, hydrological processes, and climatic factors affecting river flows makes streamflow estimation a challenging problem. These reasons, combined with the increasing prevalence of data on streamflow and precipitation, often lead to data-driven models being preferred over physically-based or conceptual forecasting models. The goal of this study is to predict daily river streamflow data with high accuracy using bagging and boosting approaches, which are ensemble learning methods. In addition, the effect of tributary streamflow on the forecast performance was analyzed in the estimation of the streamflow data. According to the results obtained, it has been shown that ensemble learning models are successful in estimating daily streamflow data, and if the tributary streamflow data is also used as input in the estimation of the streamflow, the determination and correlation performance parameters are improved, and the streamflow data can be estimated using tributary streamflow data.

Downloads

Download data is not yet available.

Article Details

How to Cite
Latifoğlu, L., & Canpolat, Ümit. (2022). Prediction of Daily Streamflow Data Using Ensemble Learning Models. The European Journal of Research and Development, 2(4), 356–371. https://doi.org/10.56038/ejrnd.v2i4.218
Section
Articles

References

Khazaee Poul, A., Shourian, M., & Ebrahimi, H., (2019). A comparative study of MLR, KNN, ANN and ANFIS models with wavelet transform in monthly streamflow prediction. Water Resources Management, 33(8), 2907–2923. DOI: https://doi.org/10.1007/s11269-019-02273-0

de Santana Moreira, R. M., & Celeste, A. B., (2017). Performance evaluation of implicit stochastic reservoir operation optimization supported by long-term mean inflow forecast. Stochastic Environmental Research and Risk Assessment, 31(9), 2357–2364. DOI: https://doi.org/10.1007/s00477-016-1341-4

Lehner, F., Wood, A. W., Llewellyn, D., Blatchford, D. B., et al.,(2017). Mitigating the impacts of climate nonstationarity on seasonal streamflow predictability in the US Southwest. Geophysical Research Letters, 44(24), 12–208. DOI: https://doi.org/10.1002/2017GL076043

Hamlet, A. F., Huppert, D., & Lettenmaier, D. P., (2002). Economic value of long-lead streamflow forecasts for Columbia River hydropower. Journal of Water Resources Planning and Management, 128(2), 91–101. DOI: https://doi.org/10.1061/(ASCE)0733-9496(2002)128:2(91)

Yaseen, Z. M., Sulaiman, S. O., Deo, R. C., & Chau, K.-W., (2019). An enhanced extreme learning machine model for river flow forecasting: State-of-the-art, practical applications in water resource engineering area and future research direction. Journal of Hydrology, 569, 387–408. DOI: https://doi.org/10.1016/j.jhydrol.2018.11.069

Hadi, S. J., & Tombul, M., (2018). Monthly streamflow forecasting using continuous wavelet and multi-gene genetic programming combination. Journal of Hydrology, 561, 674–687. DOI: https://doi.org/10.1016/j.jhydrol.2018.04.036

Hadi, S. J., & Tombul, M., (2018). Streamflow forecasting using four wavelet transformation combinations approaches with data-driven models: a comparative study. Water Resources Management, 32(14): 4661–4679. DOI: https://doi.org/10.1007/s11269-018-2077-3

Toth, E., (2009). Classification of hydro-meteorological conditions and multiple artificial neural networks for streamflow forecasting. Hydrology and Earth System Sciences, 13(9), 1555–1566. DOI: https://doi.org/10.5194/hess-13-1555-2009

Boucher, M., Quilty, J., & Adamowski, J., (2020). Data assimilation for streamflow forecasting using extreme learning machines and multilayer perceptrons. Water Resources Research, 56(6), e2019WR026226. DOI: https://doi.org/10.1029/2019WR026226

Wu, C. L., Chau, K. W., & Li, Y. S., (2009). Predicting monthly streamflow using data‐driven models coupled with data‐preprocessing techniques. Water Resources Research, 45(8). DOI: https://doi.org/10.1029/2007WR006737

Quilty, J., & Adamowski, J., (2020). A stochastic wavelet-based data-driven framework for forecasting uncertain multiscale hydrological and water resources processes. Environmental Modelling & Software, 130, 104718. DOI: https://doi.org/10.1016/j.envsoft.2020.104718

Wu, C. L., & Chau, K.-W., (2010). Data-driven models for monthly streamflow time series prediction. Engineering Applications of Artificial Intelligence, 23(8), 1350–1367. DOI: https://doi.org/10.1016/j.engappai.2010.04.003

Wang, W., Van Gelder, P. H., Vrijling, J. K., & Ma, J., (2006). Forecasting daily streamflow using hybrid ANN models. Journal of Hydrology, 324(1–4), 383–399. DOI: https://doi.org/10.1016/j.jhydrol.2005.09.032

Makridakis, S., & Hibon, M., (1997). ARMA models and the Box–Jenkins methodology. Journal of forecasting, 16(3), 147–163. DOI: https://doi.org/10.1002/(SICI)1099-131X(199705)16:3<147::AID-FOR652>3.0.CO;2-X

Kurunç, A., Yürekli, K., & Cevik, O., (2005). Performance of two stochastic approaches for forecasting water quality and streamflow data from Yeşilιrmak River, Turkey. Environmental Modelling & Software, 20(9), 1195–1200. DOI: https://doi.org/10.1016/j.envsoft.2004.11.001

Musa, J. J., (2013). Stochastic Modeling of Shiroro River Streamflow Process. .

Darlington, R. B., & Hayes, A. F., (2016). Regression Analysis and Linear Models: Concepts, Applications, and Implementation. Guilford Publications.

Tian, P., Lu, H., Feng, W., Guan, Y., et al., (2020). Large decrease in streamflow and sediment load of Qinghai–Tibetan Plateau driven by future climate change: A case study in Lhasa River Basin. Catena, 187, 104340. DOI: https://doi.org/10.1016/j.catena.2019.104340

Chu, H., Wei, J., Wu, W., Jiang, Y., et al., (2021). A classification-based deep belief networks model framework for daily streamflow forecasting. Journal of Hydrology, 595 (January), 125967. (https://doi.org/10.1016/j.jhydrol.2021.125967) DOI: https://doi.org/10.1016/j.jhydrol.2021.125967

Jothiprakash, V., & Magar, R. B., (2012). Multi-time-step ahead daily and hourly intermittent reservoir inflow prediction by artificial intelligent techniques using lumped and distributed data. Journal of hydrology, 450, 293–307. DOI: https://doi.org/10.1016/j.jhydrol.2012.04.045

Dietterich, T. G., (2002). Ensemble learning. The handbook of brain theory and neural networks, 2(1): 110–125.

Zhou, Z.-H., (2009). When semi-supervised learning meets ensemble learning. International Workshop on Multiple Classifier Systems, 529–538, Springer. DOI: https://doi.org/10.1007/978-3-642-02326-2_53

DSİ Akım Gözlem Yıllıkları. https://www.dsi.gov.tr/Sayfa/Detay/744.

Zounemat-Kermani, M., Batelaan, O., Fadaee, M., & Hinkelmann, R., (2021). Ensemble machine learning paradigms in hydrology: A review. Journal of Hydrology, 598, 126266. (https://doi.org/10.1016/j.jhydrol.2021.126266) DOI: https://doi.org/10.1016/j.jhydrol.2021.126266

Opitz, D., & Maclin, R., (1999). Popular ensemble methods: An empirical study. Journal of artificial intelligence research, 11, 169–198. DOI: https://doi.org/10.1613/jair.614

Collins, R., (2018). Machine Learning with Bagging and Boosting. Amazon Digital Services LLC - Kdp Print Us. Retrieved from https://books.google.com.tr/books?id=Ch23vAEACAAJ

Bühlmann, P., (2012). Bagging, boosting and ensemble methods. In Handbook of computational statistics, 985–1022, Springer. DOI: https://doi.org/10.1007/978-3-642-21551-3_33

Willmott, C. J., (1982). Some comments on the evaluation of model performance. Bulletin of the American Meteorological Society, 63(11), 1309–1313. DOI: https://doi.org/10.1175/1520-0477(1982)063<1309:SCOTEO>2.0.CO;2

Chicco, D., Warrens, M. J., & Jurman, G., (2021). The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Computer Science, 7: e623. DOI: https://doi.org/10.7717/peerj-cs.623

Kim, S., Alizamir, M., Kim, N. W., & Kisi, O., (2020). Bayesian model averaging: A unique model enhancing forecasting accuracy for daily streamflow based on different antecedent time series. Sustainability (Switzerland), 12(22), 1–22. (https://doi.org/10.3390/su12229720) DOI: https://doi.org/10.3390/su12229720

Adnan, R. M., Liang, Z., Trajkovic, S., Zounemat-Kermani, M., et al., (2019). Daily streamflow prediction using optimally pruned extreme learning machine. Journal of Hydrology, 577, 123981. DOI: https://doi.org/10.1016/j.jhydrol.2019.123981