Advances in Land Surface Model-based Forecasting: A comparative study of LSTM, Gradient Boosting, and Feedforward Neural Network Models as prognostic state emulators

Read original: arXiv:2407.16463 - Published 7/24/2024 by Marieke Wesselkamp, Matthew Chantry, Ewan Pinnington, Margarita Choulga, Souhail Boussetta, Maria Kalweit, Joschka Boedecker, Carsten F. Dormann, Florian Pappenberger, Gianpaolo Balsamo

Advances in Land Surface Model-based Forecasting: A comparative study of LSTM, Gradient Boosting, and Feedforward Neural Network Models as prognostic state emulators

Overview

This research paper compares the performance of three different machine learning models - Long Short-Term Memory (LSTM), Gradient Boosting, and Feedforward Neural Network - in emulating the prognostic states of a Land Surface Model.
The models were trained and evaluated on a dataset of meteorological variables and land surface states.
The goal was to assess the ability of these models to serve as efficient surrogates for the computationally expensive Land Surface Model, enabling faster forecasting.

Plain English Explanation

The research in this paper focused on finding better ways to forecast the future state of the land surface, which is important for understanding things like drought, water routing, and energy usage.

The researchers tested three different machine learning models - LSTM, Gradient Boosting, and Feedforward Neural Network - to see how well they could predict the future state of the land surface based on current weather conditions and other data. This is important because the main model used for land surface forecasting, called a Land Surface Model, is very computationally expensive and slow.

The goal was to find a faster, more efficient machine learning model that could act as a "stand-in" for the full Land Surface Model, allowing for quicker forecasts without sacrificing too much accuracy. The researchers trained and evaluated the three models on a dataset of weather and land surface measurements to see which one performed the best at this task.

Technical Explanation

The researchers in this paper conducted a comparative study to evaluate the performance of three different machine learning models - LSTM, Gradient Boosting, and Feedforward Neural Network - in emulating the prognostic states of a Land Surface Model.

The experiment setup involved training each of the three models on a dataset containing historical meteorological variables (e.g., temperature, precipitation, radiation) and corresponding land surface states (e.g., soil moisture, evapotranspiration, surface temperature) simulated by a Land Surface Model. The models were then tasked with predicting future land surface states given the meteorological inputs.

The performance of the models was assessed using various error metrics, such as root mean squared error and mean absolute error, calculated between the model predictions and the "ground truth" Land Surface Model outputs. The researchers also analyzed the computational efficiency of the models in terms of training and inference times.

The results showed that the LSTM model generally outperformed the other two approaches, demonstrating superior accuracy in emulating the Land Surface Model's prognostic states. The Feedforward Neural Network also performed well, while the Gradient Boosting model exhibited the highest errors. Additionally, the LSTM model was found to be the most computationally efficient, with faster training and inference times compared to the other two models.

Critical Analysis

The paper provides a comprehensive and rigorous comparative analysis of the three machine learning models in the context of Land Surface Model emulation. The researchers have carefully designed the experiments, utilized appropriate evaluation metrics, and presented a detailed discussion of the results.

One potential limitation of the study is the use of a single Land Surface Model and dataset. It would be valuable to expand the evaluation to include multiple Land Surface Models and a wider range of environmental conditions to assess the generalizability of the findings.

Additionally, the paper does not delve into the interpretability of the models, which is an important consideration for real-world applications. Further research could explore explainable AI techniques to understand the inner workings of the models and their decision-making processes.

Finally, the paper could have discussed potential challenges and future research directions, such as the integration of the surrogate models into operational forecasting systems or the exploration of hybrid approaches that combine the strengths of multiple models.

Conclusion

This research paper presents a comparative study of LSTM, Gradient Boosting, and Feedforward Neural Network models in the context of emulating the prognostic states of a Land Surface Model. The results demonstrate the superior performance of the LSTM model in terms of accuracy and computational efficiency, suggesting its potential as an effective surrogate for the computationally expensive Land Surface Model.

The findings of this study have important implications for the field of land surface modeling, as the development of efficient surrogate models can significantly enhance the speed and accessibility of land surface forecasting, ultimately leading to improved decision-making in various environmental and climate-related applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Advances in Land Surface Model-based Forecasting: A comparative study of LSTM, Gradient Boosting, and Feedforward Neural Network Models as prognostic state emulators

Marieke Wesselkamp, Matthew Chantry, Ewan Pinnington, Margarita Choulga, Souhail Boussetta, Maria Kalweit, Joschka Boedecker, Carsten F. Dormann, Florian Pappenberger, Gianpaolo Balsamo

Most useful weather prediction for the public is near the surface. The processes that are most relevant for near-surface weather prediction are also those that are most interactive and exhibit positive feedback or have key role in energy partitioning. Land surface models (LSMs) consider these processes together with surface heterogeneity and forecast water, carbon and energy fluxes, and coupled with an atmospheric model provide boundary and initial conditions. This numerical parametrization of atmospheric boundaries being computationally expensive, statistical surrogate models are increasingly used to accelerated progress in experimental research. We evaluated the efficiency of three surrogate models in speeding up experimental research by simulating land surface processes, which are integral to forecasting water, carbon, and energy fluxes in coupled atmospheric models. Specifically, we compared the performance of a Long-Short Term Memory (LSTM) encoder-decoder network, extreme gradient boosting, and a feed-forward neural network within a physics-informed multi-objective framework. This framework emulates key states of the ECMWF's Integrated Forecasting System (IFS) land surface scheme, ECLand, across continental and global scales. Our findings indicate that while all models on average demonstrate high accuracy over the forecast period, the LSTM network excels in continental long-range predictions when carefully tuned, the XGB scores consistently high across tasks and the MLP provides an excellent implementation-time-accuracy trade-off. The runtime reduction achieved by the emulators in comparison to the full numerical models are significant, offering a faster, yet reliable alternative for conducting numerical experiments on land surfaces.

7/24/2024

Toward Routing River Water in Land Surface Models with Recurrent Neural Networks

Mauricio Lima, Katherine Deck, Oliver R. A. Dunbar, Tapio Schneider

Machine learning is playing an increasing role in hydrology, supplementing or replacing physics-based models. One notable example is the use of recurrent neural networks (RNNs) for forecasting streamflow given observed precipitation and geographic characteristics. Training of such a model over the continental United States (CONUS) demonstrated that a single set of model parameters can be used across independent catchments, and that RNNs can outperform physics-based models. In this work, we take a next step and study the performance of RNNs for river routing in land surface models (LSMs). Instead of observed precipitation, the LSM-RNN uses instantaneous runoff calculated from physics-based models as an input. We train the model with data from river basins spanning the globe and test it in streamflow hindcasts. The model demonstrates skill at generalization across basins (predicting streamflow in catchments not used in training) and across time (predicting streamflow during years not used in training). We compare the predictions from the LSM-RNN to an existing physics-based model calibrated with a similar dataset and find that the LSM-RNN outperforms the physics based model. Our results show that RNNs are effective for global streamflow prediction from runoff inputs and motivate the development of complete routing models that can capture nested sub-basis connections.

9/16/2024

🔮

Long-term drought prediction using deep neural networks based on geospatial weather data

Alexander Marusov, Vsevolod Grabar, Yury Maximov, Nazar Sotiriadi, Alexander Bulkin, Alexey Zaytsev

The problem of high-quality drought forecasting up to a year in advance is critical for agriculture planning and insurance. Yet, it is still unsolved with reasonable accuracy due to data complexity and aridity stochasticity. We tackle drought data by introducing an end-to-end approach that adopts a spatio-temporal neural network model with accessible open monthly climate data as the input. Our systematic research employs diverse proposed models and five distinct environmental regions as a testbed to evaluate the efficacy of the Palmer Drought Severity Index (PDSI) prediction. Key aggregated findings are the exceptional performance of a Transformer model, EarthFormer, in making accurate short-term (up to six months) forecasts. At the same time, the Convolutional LSTM excels in longer-term forecasting. Both models achieved high ROC AUC scores: 0.948 for one month ahead and 0.617 for twelve months ahead forecasts, becoming closer to perfect ROC-AUC by $54%$ and $16%$, respectively, c.t. classic approaches.

7/2/2024

🌿

Site-specific Deterministic Temperature and Humidity Forecasts with Explainable and Reliable Machine Learning

MengMeng Han, Tennessee Leeuwenburg, Brad Murphy

Site-specific weather forecasts are essential to accurate prediction of power demand and are consequently of great interest to energy operators. However, weather forecasts from current numerical weather prediction (NWP) models lack the fine-scale detail to capture all important characteristics of localised real-world sites. Instead they provide weather information representing a rectangular gridbox (usually kilometres in size). Even after post-processing and bias correction, area-averaged information is usually not optimal for specific sites. Prior work on site optimised forecasts has focused on linear methods, weighted consensus averaging, time-series methods, and others. Recent developments in machine learning (ML) have prompted increasing interest in applying ML as a novel approach towards this problem. In this study, we investigate the feasibility of optimising forecasts at sites by adopting the popular machine learning model gradient boosting decision tree, supported by the Python version of the XGBoost package. Regression trees have been trained with historical NWP and site observations as training data, aimed at predicting temperature and dew point at multiple site locations across Australia. We developed a working ML framework, named 'Multi-SiteBoost' and initial testing results show a significant improvement compared with gridded values from bias-corrected NWP models. The improvement from XGBoost is found to be comparable with non-ML methods reported in literature. With the insights provided by SHapley Additive exPlanations (SHAP), this study also tests various approaches to understand the ML predictions and increase the reliability of the forecasts generated by ML.

4/5/2024