Site-specific Deterministic Temperature and Humidity Forecasts with Explainable and Reliable Machine Learning

2404.03310

Published 4/5/2024 by MengMeng Han, Tennessee Leeuwenburg, Brad Murphy

🌿

Abstract

Site-specific weather forecasts are essential to accurate prediction of power demand and are consequently of great interest to energy operators. However, weather forecasts from current numerical weather prediction (NWP) models lack the fine-scale detail to capture all important characteristics of localised real-world sites. Instead they provide weather information representing a rectangular gridbox (usually kilometres in size). Even after post-processing and bias correction, area-averaged information is usually not optimal for specific sites. Prior work on site optimised forecasts has focused on linear methods, weighted consensus averaging, time-series methods, and others. Recent developments in machine learning (ML) have prompted increasing interest in applying ML as a novel approach towards this problem. In this study, we investigate the feasibility of optimising forecasts at sites by adopting the popular machine learning model gradient boosting decision tree, supported by the Python version of the XGBoost package. Regression trees have been trained with historical NWP and site observations as training data, aimed at predicting temperature and dew point at multiple site locations across Australia. We developed a working ML framework, named 'Multi-SiteBoost' and initial testing results show a significant improvement compared with gridded values from bias-corrected NWP models. The improvement from XGBoost is found to be comparable with non-ML methods reported in literature. With the insights provided by SHapley Additive exPlanations (SHAP), this study also tests various approaches to understand the ML predictions and increase the reliability of the forecasts generated by ML.

Create account to get full access

Overview

The paper investigates using machine learning, specifically gradient boosting decision trees, to improve weather forecasts at specific locations, beyond what is possible with traditional numerical weather prediction models.
The proposed "Multi-SiteBoost" framework aims to better capture the local nuances of weather conditions at individual sites by training regression trees on historical weather data.
The study reports significant improvements in forecasting temperature and dew point compared to bias-corrected numerical weather prediction models, with the machine learning approach providing results comparable to other non-ML methods.
The paper also explores using SHAP to gain insights into the machine learning model's predictions and increase the reliability of the forecasts.

Plain English Explanation

Weather forecasts are essential for energy operators to accurately predict power demand. However, current weather prediction models often lack the detailed, site-specific information needed for this purpose. These models provide weather data for large grid areas, which may not accurately represent the conditions at a particular location.

The researchers in this study explored using a machine learning approach, specifically gradient boosting decision trees, to improve weather forecasts at specific sites. They developed a framework called "Multi-SiteBoost" that trains regression trees on historical weather data from both numerical weather prediction models and actual site observations.

The goal was to capture the unique local characteristics of each site and generate more accurate forecasts of temperature and dew point. The results showed that the machine learning-based approach significantly outperformed the gridded weather data from the traditional numerical models, even after the models were corrected for bias.

The improvements achieved by the machine learning method were comparable to those reported in previous studies using other non-ML techniques. Additionally, the researchers used SHAP to understand the factors driving the machine learning model's predictions, which can help increase the reliability of the forecasts.

Overall, this research demonstrates the potential of machine learning to enhance site-specific weather forecasting, which is crucial for energy operators to accurately plan and manage power supply and demand.

Technical Explanation

The paper investigates the use of gradient boosting decision trees, implemented using the Python XGBoost package, to optimize weather forecasts at specific site locations. The researchers developed a framework called "Multi-SiteBoost" that trains regression trees on historical numerical weather prediction (NWP) data and site observations to predict temperature and dew point.

The motivation for this work is that current NWP models, even after post-processing and bias correction, often do not provide the fine-scale detail needed to capture the unique characteristics of localized real-world sites. The area-averaged information from these models is usually not optimal for specific sites.

The researchers trained the XGBoost regression trees using NWP model outputs and site-level observational data as inputs, with the goal of predicting temperature and dew point at multiple locations across Australia. Initial testing results show a significant improvement in forecast accuracy compared to the gridded values from the bias-corrected NWP models.

The performance improvement achieved by the XGBoost approach was found to be comparable to that of other non-ML methods reported in the literature, such as linear methods, weighted consensus averaging, and time-series techniques.

Additionally, the study leverages SHAP to gain insights into the machine learning model's predictions and increase the reliability of the forecasts generated. This aligns with the growing interest in explainable AI to better understand and trust the outputs of complex models.

Critical Analysis

The paper presents a promising approach to improving site-specific weather forecasting using machine learning, which is an important problem for energy operators. The authors have developed a working framework and demonstrated encouraging initial results.

However, the study is limited to temperature and dew point forecasts, and it would be interesting to see the model's performance on other weather variables relevant to energy demand, such as wind speed, solar radiation, and precipitation. Additionally, the evaluation was conducted on a set of sites across Australia, and further testing on a broader range of locations would help validate the generalizability of the approach.

The paper also does not provide detailed information on the specific machine learning hyperparameters used or the feature engineering process. Sharing more insights into the model development and optimization process could help other researchers and practitioners better understand the factors contributing to the improved forecast accuracy.

Furthermore, the paper does not address the potential computational and deployment challenges of implementing the "Multi-SiteBoost" framework in an operational setting. Exploring the scalability and real-time performance of the approach would be valuable for assessing its practical feasibility.

Overall, the research presents an interesting application of machine learning to enhance site-specific weather forecasting, which could have significant implications for energy market forecasting and optimizing power grid operations. Further development and validation of the proposed framework could lead to valuable improvements in weather-related decision-making for energy systems.

Conclusion

This study investigates the use of gradient boosting decision trees, implemented with the XGBoost library, to optimize weather forecasts at specific site locations. The researchers developed a "Multi-SiteBoost" framework that trains regression trees on historical numerical weather prediction data and site observations to predict temperature and dew point.

The results show a significant improvement in forecast accuracy compared to the gridded values from bias-corrected numerical weather prediction models, with the machine learning approach providing results comparable to other non-ML methods reported in the literature. The study also demonstrates the use of SHAP to gain insights into the machine learning model's predictions, which can help increase the reliability of the forecasts.

This research highlights the potential of machine learning to enhance site-specific weather forecasting, a crucial capability for energy operators to accurately plan and manage power supply and demand. Further development and validation of the proposed framework could lead to valuable improvements in weather-related decision-making for energy systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

WeatherGNN: Exploiting Meteo- and Spatial-Dependencies for Local Numerical Weather Prediction Bias-Correction

Binqing Wu, Weiqi Chen, Wengwei Wang, Bingqing Peng, Liang Sun, Ling Chen

Due to insufficient local area information, numerical weather prediction (NWP) may yield biases for specific areas. Previous studies correct biases mainly by employing handcrafted features or applying data-driven methods intuitively, overlooking the complicated dependencies between weather factors and between areas. To address this issue, we propose WeatherGNN, a local NWP bias-correction method that utilizes Graph Neural Networks (GNNs) to exploit meteorological dependencies and spatial dependencies under the guidance of domain knowledge. Specifically, we introduce a factor GNN to capture area-specific meteorological dependencies adaptively based on spatial heterogeneity and a fast hierarchical GNN to capture dynamic spatial dependencies efficiently guided by Tobler's first and second laws of geography. Our experimental results on two real-world datasets demonstrate that WeatherGNN achieves the state-of-the-art performance, outperforming the best baseline with an average of 4.75 % on RMSE.

6/12/2024

cs.LG

Wind Power Prediction across Different Locations using Deep Domain Adaptive Learning

Md Saiful Islam Sajol, Md Shazid Islam, A S M Jahid Hasan, Md Saydur Rahman, Jubair Yusuf

Accurate prediction of wind power is essential for the grid integration of this intermittent renewable source and aiding grid planners in forecasting available wind capacity. Spatial differences lead to discrepancies in climatological data distributions between two geographically dispersed regions, consequently making the prediction task more difficult. Thus, a prediction model that learns from the data of a particular climatic region can suffer from being less robust. A deep neural network (DNN) based domain adaptive approach is proposed to counter this drawback. Effective weather features from a large set of weather parameters are selected using a random forest approach. A pre-trained model from the source domain is utilized to perform the prediction task, assuming no source data is available during target domain prediction. The weights of only the last few layers of the DNN model are updated throughout the task, keeping the rest of the network unchanged, making the model faster compared to the traditional approaches. The proposed approach demonstrates higher accuracy ranging from 6.14% to even 28.44% compared to the traditional non-adaptive method.

5/21/2024

cs.LG

📈

Beyond Ensemble Averages: Leveraging Climate Model Ensembles for Subseasonal Forecasting

Elena Orlova, Haokun Liu, Raphael Rossellini, Benjamin A. Cash, Rebecca Willett

Producing high-quality forecasts of key climate variables, such as temperature and precipitation, on subseasonal time scales has long been a gap in operational forecasting. This study explores an application of machine learning (ML) models as post-processing tools for subseasonal forecasting. Lagged numerical ensemble forecasts (i.e., an ensemble where the members have different initialization dates) and observational data, including relative humidity, pressure at sea level, and geopotential height, are incorporated into various ML methods to predict monthly average precipitation and two-meter temperature two weeks in advance for the continental United States. For regression, quantile regression, and tercile classification tasks, we consider using linear models, random forests, convolutional neural networks, and stacked models (a multi-model approach based on the prediction of the individual ML models). Unlike previous ML approaches that often use ensemble mean alone, we leverage information embedded in the ensemble forecasts to enhance prediction accuracy. Additionally, we investigate extreme event predictions that are crucial for planning and mitigation efforts. Considering ensemble members as a collection of spatial forecasts, we explore different approaches to using spatial information. Trade-offs between different approaches may be mitigated with model stacking. Our proposed models outperform standard baselines such as climatological forecasts and ensemble means. In addition, we investigate feature importance, trade-offs between using the full ensemble or only the ensemble mean, and different modes of accounting for spatial variability.

6/5/2024

cs.LG

↗️

Dynamic Line Rating using Hyper-local Weather Predictions: A Machine Learning Approach

Henri Manninen, Markus Lippus, Georg Rute

Dynamic Line Rating (DLR) systems are crucial for renewable energy integration in transmission networks. However, traditional methods relying on sensor data face challenges due to the impracticality of installing sensors on every pole or span. Additionally, sensor-based approaches may struggle predicting DLR in rapidly changing weather conditions. This paper proposes a novel approach, leveraging machine learning (ML) techniques alongside hyper-local weather forecast data. Unlike conventional methods, which solely rely on sensor data, this approach utilizes ML models trained to predict hyper-local weather parameters on a full network scale. Integrating topographical data enhances prediction accuracy by accounting for landscape features and obstacles around overhead lines. The paper introduces confidence intervals for DLR assessments to mitigate risks associated with uncertainties. A case study from Estonia demonstrates the practical implementation of the proposed methodology, highlighting its effectiveness in real-world scenarios. By addressing limitations of sensor-based approaches, this research contributes to the discourse of renewable energy integration in transmission systems, advancing efficiency and reliability in the power grid.

5/22/2024

cs.LG cs.SY eess.SY