An Open and Large-Scale Dataset for Multi-Modal Climate Change-aware Crop Yield Predictions

Read original: arXiv:2406.06081 - Published 6/18/2024 by Fudong Lin, Kaleb Guillot, Summer Crawford, Yihe Zhang, Xu Yuan, Nian-Feng Tzeng
Total Score

0

An Open and Large-Scale Dataset for Multi-Modal Climate Change-aware Crop Yield Predictions

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • This paper presents an open and large-scale dataset for multi-modal climate change-aware crop yield predictions.
  • The dataset includes various data sources such as satellite imagery, weather data, and crop yield information to enable the development of advanced AI models for predicting crop yields.
  • The dataset covers major crop-producing regions globally and aims to facilitate research on the impact of climate change on agriculture.

Plain English Explanation

This research paper introduces a comprehensive dataset that can be used to build AI models for predicting crop yields. The dataset includes a wide range of information, such as satellite images, weather data, and actual crop yield measurements, all of which are important for understanding how climate change affects agriculture.

The researchers have gathered data from major crop-producing regions around the world, making this dataset a valuable resource for studying the impact of climate change on food production. By having access to this diverse and large-scale dataset, researchers and developers can create more accurate and reliable AI models to forecast crop yields, which can help farmers, policymakers, and others make more informed decisions about food security and sustainability.

Technical Explanation

The paper presents an open and large-scale dataset for multi-modal climate change-aware crop yield predictions. The dataset includes various data sources, such as satellite imagery, weather data, and crop yield information, to enable the development of advanced AI models for predicting crop yields.

The dataset covers major crop-producing regions globally, including the United States, Brazil, China, and India, among others. It consists of high-resolution satellite imagery, historical weather data (temperature, precipitation, etc.), and ground-truth crop yield measurements at the county or district level. The researchers have preprocessed and organized the data to make it easily accessible for researchers and developers.

The availability of this diverse and comprehensive dataset can foster the development of more accurate and robust crop yield prediction models that can account for the complex interactions between climate, weather, and agricultural practices. This can have significant implications for improving food security, resource allocation, and climate change adaptation strategies.

Critical Analysis

The dataset presented in this paper is a valuable resource for the scientific community, as it addresses an important gap in the availability of large-scale, multi-modal data for crop yield prediction research. By including a wide range of data sources, the dataset can enable the development of more sophisticated and accurate AI models that can better capture the complex relationships between climate, weather, and crop yields.

However, the paper does not provide a detailed assessment of the dataset's quality, potential biases, or limitations. It would be helpful to understand the data collection and preprocessing methods, as well as the representativeness of the selected regions and crops. Additionally, the paper does not discuss the potential challenges in using this dataset, such as the availability of ground-truth crop yield data, the alignment of different data sources, or the handling of missing or noisy data.

Further research could also explore the generalizability of the models developed using this dataset to other regions or crops, as well as the practical applications and implications of these models for farmers, policymakers, and other stakeholders.

Conclusion

This research paper presents an open and large-scale dataset for multi-modal climate change-aware crop yield predictions, which can be a significant contribution to the field of agricultural AI and climate change research. By providing access to a diverse range of data sources, this dataset can enable the development of more advanced and accurate crop yield prediction models that can help address the challenges posed by climate change and food security. The availability of this dataset can also foster collaboration and innovation among researchers, developers, and stakeholders in the agricultural sector.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

An Open and Large-Scale Dataset for Multi-Modal Climate Change-aware Crop Yield Predictions
Total Score

0

An Open and Large-Scale Dataset for Multi-Modal Climate Change-aware Crop Yield Predictions

Fudong Lin, Kaleb Guillot, Summer Crawford, Yihe Zhang, Xu Yuan, Nian-Feng Tzeng

Precise crop yield predictions are of national importance for ensuring food security and sustainable agricultural practices. While AI-for-science approaches have exhibited promising achievements in solving many scientific problems such as drug discovery, precipitation nowcasting, etc., the development of deep learning models for predicting crop yields is constantly hindered by the lack of an open and large-scale deep learning-ready dataset with multiple modalities to accommodate sufficient information. To remedy this, we introduce the CropNet dataset, the first terabyte-sized, publicly available, and multi-modal dataset specifically targeting climate change-aware crop yield predictions for the contiguous United States (U.S.) continent at the county level. Our CropNet dataset is composed of three modalities of data, i.e., Sentinel-2 Imagery, WRF-HRRR Computed Dataset, and USDA Crop Dataset, for over 2200 U.S. counties spanning 6 years (2017-2022), expected to facilitate researchers in developing versatile deep learning models for timely and precisely predicting crop yields at the county-level, by accounting for the effects of both short-term growing season weather variations and long-term climate change on crop yields. Besides, we develop the CropNet package, offering three types of APIs, for facilitating researchers in downloading the CropNet data on the fly over the time and region of interest, and flexibly building their deep learning models for accurate crop yield predictions. Extensive experiments have been conducted on our CropNet dataset via employing various types of deep learning solutions, with the results validating the general applicability and the efficacy of the CropNet dataset in climate change-aware crop yield predictions.

Read more

6/18/2024

Explainability of Sub-Field Level Crop Yield Prediction using Remote Sensing
Total Score

0

Explainability of Sub-Field Level Crop Yield Prediction using Remote Sensing

Hiba Najjar, Miro Miranda, Marlon Nuske, Ribana Roscher, Andreas Dengel

Crop yield forecasting plays a significant role in addressing growing concerns about food security and guiding decision-making for policymakers and farmers. When deep learning is employed, understanding the learning and decision-making processes of the models, as well as their interaction with the input data, is crucial for establishing trust in the models and gaining insight into their reliability. In this study, we focus on the task of crop yield prediction, specifically for soybean, wheat, and rapeseed crops in Argentina, Uruguay, and Germany. Our goal is to develop and explain predictive models for these crops, using a large dataset of satellite images, additional data modalities, and crop yield maps. We employ a long short-term memory network and investigate the impact of using different temporal samplings of the satellite data and the benefit of adding more relevant modalities. For model explainability, we utilize feature attribution methods to quantify input feature contributions, identify critical growth stages, analyze yield variability at the field level, and explain less accurate predictions. The modeling results show an improvement when adding more modalities or using all available instances of satellite data. The explainability results reveal distinct feature importance patterns for each crop and region. We further found that the most influential growth stages on the prediction are dependent on the temporal sampling of the input data. We demonstrated how these critical growth stages, which hold significant agronomic value, closely align with the existing literature in agronomy and crop development biology.

Read more

7/12/2024

EuroCropsML: A Time Series Benchmark Dataset For Few-Shot Crop Type Classification
Total Score

0

EuroCropsML: A Time Series Benchmark Dataset For Few-Shot Crop Type Classification

Joana Reuss, Jan Macdonald, Simon Becker, Lorenz Richter, Marco Korner

We introduce EuroCropsML, an analysis-ready remote sensing machine learning dataset for time series crop type classification of agricultural parcels in Europe. It is the first dataset designed to benchmark transnational few-shot crop type classification algorithms that supports advancements in algorithmic development and research comparability. It comprises 706 683 multi-class labeled data points across 176 classes, featuring annual time series of per-parcel median pixel values from Sentinel-2 L1C data for 2021, along with crop type labels and spatial coordinates. Based on the open-source EuroCrops collection, EuroCropsML is publicly available on Zenodo.

Read more

7/25/2024

🔮

Total Score

0

Corn Yield Prediction Model with Deep Neural Networks for Smallholder Farmer Decision Support System

Chollette Olisah, Lyndon Smith, Melvyn Smith, Lawrence Morolake, Osi Ojukwu

Crop yield prediction has been modeled on the assumption that there is no interaction between weather and soil variables. However, this paper argues that an interaction exists, and it can be finely modelled using the Kendall Correlation coefficient. Given the nonlinearity of the interaction between weather and soil variables, a deep neural network regressor (DNNR) is carefully designed with consideration to the depth, number of neurons of the hidden layers, and the hyperparameters with their optimizations. Additionally, a new metric, the average of absolute root squared error (ARSE) is proposed to combine the strengths of root mean square error (RMSE) and mean absolute error (MAE). With the ARSE metric, the proposed DNNR(s), optimised random forest regressor (RFR) and the extreme gradient boosting regressor (XGBR) achieved impressively small yield errors, 0.0172 t/ha, and 0.0243 t/ha, 0.0001 t/ha, and 0.001 t/ha, respectively. However, the DNNR(s), with changes to the explanatory variables to ensure generalizability to unforeseen data, DNNR(s) performed best. Further analysis reveals that a strong interaction does exist between weather and soil variables. Precisely, yield is observed to increase when precipitation is reduced and silt increased, and vice-versa. However, the degree of decrease or increase is not quantified in this paper. Contrary to existing yield models targeted towards agricultural policies and global food security, the goal of the proposed corn yield model is to empower the smallholder farmer to farm smartly and intelligently, thus the prediction model is integrated into a mobile application that includes education, and a farmer-to-market access module.

Read more

4/16/2024