Prediction of Dew Point Temperature Using Tree and Kernel Based Models

Document Type : Full Length Article

Authors

1 M.Sc. Student, Water Engineering and Management Department, Faculty of Agriculture, Tarbiat Modares University, Tehran, Iran

2 Assoc. Prof., Department of Water Engineering, Faculty of Agriculture, University of Tabriz, Tabriz, Iran

Abstract

Iran’s agricultural sector faces unique obstacles due to the diversity of its climatic conditions, underscoring the crucial importance of safeguarding crops against the impacts of climate change. One effective strategy to mitigate damage to the agricultural industry is the ability to accurately predict dew point temperature. The dew point refers to the temperature at which water vapor in the air condenses into liquid water or dew, given a constant air pressure. Notably, an excessively high dew point can adversely affect the performance of air conditioning systems and reduce the efficiency of coolant-based ventilation mechanisms. The formation of dew in ecosystems is influenced by a triad of key factors: radiative exchange between the Earth’s surface and the atmosphere, turbulent heat transfer, and vapor pressure. While the dew point is typically measured using a moisture meter, there also exist empirical equations that relate air temperature and humidity. However, reliable dew point forecasting often requires common meteorological parameters, such as relative humidity and precipitation, which are not consistently measured at many weather stations or may be subject to significant error. As such, regression-based estimation methods are frequently employed. Recognizing the importance of data-driven approaches in dew point estimation, this study explores the use of several predictive models, including support vector regression (SVR), the M5P tree model, the M5Rules rule-generation algorithm, Gaussian process regression (GPR), linear regression (LR), random forest (RF), and random tree (RT) models. These models were applied to data collected from two stations in Gorgan and Shahrekord, Iran, to estimate dew point temperature.
Methodology
The Gorgan basin is geographically situated between 25°54’ east longitude and 36°50’ north latitude, while the Shahrekord basin is located between 50°49’ east longitude and 32°20’ north latitude. Topographically, Shahrekord lies in the eastern segment of the Zagros Mountain range, along the Zagros fault margin. The input parameters for this study were obtained from the Iran Meteorological Organization, covering the period from 1990 to 2021. These parameters include daily maximum temperature (Tmax), daily minimum temperature (Tmin), daily average temperature (Tm), sunshine hours (sshn), average wind speed (ffm), average relative humidity (RHm), maximum relative humidity (RHmax), and minimum relative humidity (RHmin). To assess the accuracy of the input parameters and models, the dew point (DP) was extracted from the testing data and evaluated using regression and tree-based models. Additionally, eight potential scenarios were defined to estimate the daily DP.
Findings
The study findings revealed that scenarios 1, 2, and 3 exhibited the highest correlation with dew point temperature, while scenarios 4, 5, 6, 7, and 8 displayed the lowest correlation. However, upon evaluating the scenarios based on the established criteria, it was determined that scenarios 1, 2, and 3 performed less effectively than the other scenarios. Consequently, the placement of the majority of parameters (scenarios 6, 7, and 8) led to a decrease in the models’ errors. The results obtained for the Gorgan models showed that the R-value ranged from 1 to 0.952. In the Gorgan station, the highest RMSE was observed for RT-3 at 2.0307, and the lowest RMSE was for SVR-8 at 0.222. Furthermore, the best fit for SVR-8 was characterized by RMSE: 0.222, NSE: 0.999, MBE: 0.092, MAE: 0.147, WI: 1, and SI: 0.017, while the worst fit was for RT-3 with RMSE: 2.307, NSE: 0.882, MBE: 0.875, MAE: 1.745, WI: 0.971, and SI: 0.179. The results for the Shahrekord models indicated that the R-value ranged between 0.996 and 0.615. In the Shahrekord station, the highest RMSE was observed for RT-2 at 4.952, and the lowest RMSE was for SVR-7 at 0.550. Additionally, the best fit for SVR-7 was characterized by RMSE: 0.550, NSE: 0.989, MBE: 0.374, MAE: 0.346, WI: 0.997, and SI: -0.15, while the worst fit was for RT-2 with RMSE: 4.957, NSE: 0.131, MBE: 1.43, MAE: 3.914, WI: 0.754, and SI: -1.352.
Conclusion
In this study, meteorological data were fitted using various modeling techniques, including Gaussian Process Regression (GPR), Linear Regression (LR), M5P, M5Rules, Random Forest (RF), Regression Tree (RT), and Support Vector Regression (SVR), to obtain the dew point temperature at the Gorgan and Shahrekord weather stations. The performance of these models was then evaluated under different scenarios, and the best-performing models were selected for estimating the dew point temperature. The results indicate that the estimation accuracy of the models using a single input parameter, such as minimum temperature (Tmin), was lower compared to the other models in both the Shahrekord and Gorgan stations. The SVR model with scenario 8 and the M5P model with scenario 7 demonstrated the best performance in estimating the dew point temperature for the Gorgan and Shahrekord stations, respectively. Furthermore, the comparison of the selected models revealed that the SVR model had the highest accuracy among the models evaluated. For the Gorgan station, the models were ranked from high to low accuracy as follows: SVR, M5P, M5Rules, GPR, RF, and LR. For the Shahrekord station, the ranking was M5P, M5Rules, GPR, RF, RT, and LR. Additionally, the comparison of the tree-based and regression-based models showed that the regression models, such as SVR and LR, had higher accuracy in estimating the dew point temperature compared to the tree-based models, such as M5P, M5Rules, and RT.

Keywords


Alizamir M, Kim S, Zounemat-Kermani M, Heddam S, Kim NW and Singh VP, 2020. Kernel extreme learning machine: An efficient model for estimating daily dew point temperature using weather data. Water 12(9): 2600.
Mojahedi MA, Mohammadi K, Shamshirband S, Seyed Danesh A, Mostafaeipour A and Kamsin A, 2016. A hybrid computational intelligence method for predicting dew point temperature. Environmental Earth Sciences 75: 415.
Brieman L, 2001. Random Forests. Machine Learning, 45: 5-23.
Deka P, Patil AP, Yeswanth Kumar P and Naganna SR, 2017. Estimation of dew point temperature using SVM and ELM for humid and semi-arid regions of India. ISH Journal of Hydraulic Engineering 24(2): 190-197.
Dong J, Wu L, Liu X, Li Z, Gao Y, Zhang Y and Yang Q, 2020. Estimation of daily dew point temperature by using bat algorithm optimization based extreme learning machine. Applied Thermal Engineering 165: 114569.
Dong J, Zeng W, Lei G, Wu Lifeng, Chen H, Wu J, Huang J, Gaiser T, and Srivastava AK, 2022. Simulation of dew point temperature in different time scales based on grasshopper algorithm optimized extreme gradient boosting. Journal of Hydrology 606: 127452.
Fathollahzadeh N, Khalili K, Behmanesh J and Khanmohammadi N, 2018. On the reliability of soft computing methods in the estimation of dew point temperature: The case of arid regions of Iran. Computers and Electronics in Agriculture 153: 334-346.
Fischer MM, 2015. Spatial Analysis in Geography. International Encyclopedia of the Social & Behavioral Sciences. Elsevier 12: 14752-14758.
Holmes G, Hall M and Prank B, 2007. Generating rule sets from model trees. Advanced topics in artificial intelligence. Lecture Notes in Computer Science: 1774.
Kalmeg S, 2015. Analysis of weka data mining algorithm reptree, simple cart and random tree for classification of indian news. Engineering Technician 2(2): 444602.
Kermani MZ, 2011. Hourly predictive Levenberg–Marquardt ANN and multi linear regression models for predicting of dew point temperature. Meteorology and Atmospheric Physics 117: 181-192.
Kisi O, Kim S and Shiri J, 2013. Estimation of dew point temperature using neuro-fuzzy and neural network techniques. Theoretical and Applied Climatology 114: 365-373.
Lawrence MG, 2005. The relationship between relative humidity and the dewpoint temperature in moist air: a simple conversion and applications. Bulletin of the American Meteorological Society 86: 225-234.
Mahdizadeh S, Behmanesh J and Khalili K, 2017. Using MARS, SVM, GEP and empirical equations for estimation of monthly mean reference evapotranspiration. Computers and Electronics in Agriculture 139: 103-114.
Mohammadi K, Shamshirband S, Motamedi S, Petković D, Hashim R and Gocic M, 2015. Extreme learning machine based prediction of daily dew point temperature. Computers and Electronics in Agriculture 117: 214-225.
Mohammadi K, Shamshirband S, Petković D, Yee PL and Mansor Z, 2016. Using ANFIS for selection of more relevant parameters to predict dew point temperature. Applied Thermal Engineering 96: 311–319.
Naganna S, Deka P, Ghorbani M, Biazar S, Al-Ansari N and Yaseen Z, 2019. Dew point temperature estimation: application of artificial intelligence model integrated with natureinspired optimization algorithms. Water 11(4): 742.
Osouli M, Mikaeili F and Samadianfard S, 2022. Evaluating capabilities of gradient boosted tree and optimized random forest models in estimating daily dew point temperature. Environment and Water Engineering 8(3): 654–668 (in Persian with English abstract)
Qasem SN, Samadianfard S, Sadri Nahand H, Mosavi A, Shamshirband S and Chau KW, 2019. Estimating daily dew point temperature using machine learning algorithms. Water 11(3): 582.
Shank DB, McClendon RW, Paz J and Hoogenboom G, 2008. Ensemble artificial neural networks for prediction of dew point temperature. Applied Artificial Intelligence 22(6): 523-542.
Shiri J, 2019. Prediction vs. estimation of dew point temperature: assessing GEP, MARS and RF models. Hydrology: Current Research 50(2): 633–643. 
Shiri J, Kim S and Kisi O, 2014. Estimation of daily dew point temperature using genetic programming and neural networks approaches. Hydrology: Current Research 45(2): 165- 181.
Zhan C, Gan A and Hadi M, 2011. Prediction of lane clearance time of freeway incidents using the M5P tree algorithm. IEEE Transactions on Intelligent Transportation Systems 12(4): 1549–1557.