Unlocking the Use of Raw Multispectral Earth Observation Imagery for Onboard Artificial Intelligence

Read original: arXiv:2305.11891 - Published 9/11/2024 by Gabriele Meoni, Roberto Del Prete, Federico Serva, Alix De Beussche, Olivier Colin, Nicolas Long'ep'e
Total Score

0

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • The paper presents a novel methodology to automate the creation of datasets for the detection of target events, such as wildfires or vessels, from raw Sentinel-2 satellite data and other multispectral Earth observation data.
  • The approach involves pre-processing the raw data through spatial band registration and georeferencing, then applying event-specific algorithms to detect the target events, which are then re-projected back onto the corresponding raw images.
  • The authors apply this methodology to create the THRawS (Thermal Hotspots in Raw Sentinel-2 data) dataset, which includes 1090 samples of warm thermal hotspots and 33,335 event-free acquisitions.
  • The dataset and associated toolkits are intended to enable research on energy-efficient pre-processing algorithms and end-to-end AI-based processing systems for Earth observation satellites.

Plain English Explanation

The paper describes a new way to automatically build datasets from the raw, unprocessed data collected by Earth observation satellites, like Sentinel-2. These raw data files contain the original measurements from the satellite's sensors, before they undergo any processing or interpretation.

The researchers developed a process to take this raw data and identify specific events or objects, like wildfires or ships. First, they pre-process the raw data to align the different color bands and georeference the pixels. Then, they use specialized algorithms to detect the target events within the processed data. Finally, they map those detected events back onto the original raw data files.

By creating this dataset, called THRawS, the researchers want to enable other scientists to study new ways of analyzing satellite data more efficiently. Currently, access to the raw satellite data is limited, which makes it hard to develop and test new AI-based techniques for processing the data directly onboard the satellites. The THRawS dataset and the methodology used to create it aim to provide a template and resources for future research in this area.

Technical Explanation

The key elements of the paper's technical approach are:

  1. Pre-processing pipeline: The raw satellite data is first processed through a pipeline that includes spatial band registration (aligning the different color bands) and georeferencing (mapping the pixels to geographic coordinates).

  2. Event detection: The pre-processed data is then analyzed using event-specific algorithms to detect target events, such as wildfires or vessels. These detected events are identified within the Level-1C processed data products.

  3. Re-projection: The detected events are then re-projected back onto the corresponding raw satellite data granules (individual image files).

The authors applied this methodology to create the THRawS dataset, which includes 1090 samples of warm thermal hotspots (e.g., wildfires, volcanic eruptions) and 33,335 event-free acquisitions from Sentinel-2 raw data. This dataset is intended to enable research on energy-efficient pre-processing algorithms and end-to-end AI-based processing systems that could be deployed directly on Earth observation satellites.

Critical Analysis

The paper introduces a valuable methodology and dataset to support research on more efficient onboard processing of raw satellite data. By providing access to the raw data alongside the detected events, the THRawS dataset addresses a key limitation in the field, where raw data availability has hindered the development of lightweight pre-processing techniques and end-to-end AI pipelines.

However, the paper does not discuss the accuracy or reliability of the event detection algorithms used to create the dataset. The authors also do not address potential issues with the georeferencing or band registration steps, which could introduce errors or distortions in the final dataset.

Additionally, the paper focuses solely on thermal hotspot detection, while other important Earth observation tasks, such as land cover classification or object tracking, are not covered. Expanding the dataset to include a wider range of event types and applications could further increase its value and impact.

Conclusion

This paper presents a novel methodology and dataset that can help advance research on efficient onboard processing of raw satellite data using AI techniques. The THRawS dataset, which includes both detected events and the corresponding raw data, provides a valuable resource for the community to develop and test new approaches.

By addressing the current limitations in raw data availability, this work has the potential to enable more accurate and responsive Earth observation applications, such as natural disaster monitoring and response. The authors' focus on creating a replicable methodology also suggests that the approach could be applied to other satellite missions and Earth observation data sources, further expanding the impact of this research.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Total Score

0

Unlocking the Use of Raw Multispectral Earth Observation Imagery for Onboard Artificial Intelligence

Gabriele Meoni, Roberto Del Prete, Federico Serva, Alix De Beussche, Olivier Colin, Nicolas Long'ep'e

Nowadays, there is growing interest in applying Artificial Intelligence (AI) on board Earth Observation (EO) satellites for time-critical applications, such as natural disaster response. However, the unavailability of raw satellite data currently hinders research on lightweight pre-processing techniques and limits the exploration of end-to-end pipelines, which could offer more efficient and accurate extraction of insights directly from the source data. To fill this gap, this work presents a novel methodology to automate the creation of datasets for the detection of target events (e.g., warm thermal hotspots) or objects (e.g., vessels) from Sentinel-2 raw data and other multispectral EO pushbroom raw imagery. The presented approach first processes the raw data by applying a pipeline consisting of spatial band registration and georeferencing of the raw data pixels. Then, it detects the target events by leveraging event-specific state-of-the-art algorithms on the Level-1C products, which are mosaicked and cropped on the georeferenced correspondent raw granule area. The detected events are finally re-projected back onto the corresponding raw images. We apply the proposed methodology to realize THRawS (Thermal Hotspots in Raw Sentinel-2 data), the first dataset of Sentinel-2 raw data containing warm thermal hotspots. THRawS includes 1090 samples containing wildfires, volcanic eruptions, and 33,335 event-free acquisitions to enable thermal hotspot detection and general classification applications. This dataset and associated toolkits provide the community with both an immediately useful resource as well as a framework and methodology acting as a template for future additions. With this work, we hope to pave the way for research on energy-efficient pre-processing algorithms and AI-based end-to-end processing systems on board EO satellites.

Read more

9/11/2024

M3LEO: A Multi-Modal, Multi-Label Earth Observation Dataset Integrating Interferometric SAR and RGB Data
Total Score

0

M3LEO: A Multi-Modal, Multi-Label Earth Observation Dataset Integrating Interferometric SAR and RGB Data

Matthew J Allen, Francisco Dorr, Joseph Alejandro Gallego Mejia, Laura Mart'inez-Ferrer, Anna Jungbluth, Freddie Kalaitzis, Ra'ul Ramos-Poll'an

Satellite-based remote sensing has revolutionised the way we address global challenges in a rapidly evolving world. Huge quantities of Earth Observation (EO) data are generated by satellite sensors daily, but processing these large datasets for use in ML pipelines is technically and computationally challenging. Specifically, different types of EO data are often hosted on a variety of platforms, with differing availability for Python preprocessing tools. In addition, spatial alignment across data sources and data tiling can present significant technical hurdles for novice users. While some preprocessed EO datasets exist, their content is often limited to optical or near-optical wavelength data, which is ineffective at night or in adverse weather conditions. Synthetic Aperture Radar (SAR), an active sensing technique based on microwave length radiation, offers a viable alternative. However, the application of machine learning to SAR has been limited due to a lack of ML-ready data and pipelines, particularly for the full diversity of SAR data, including polarimetry, coherence and interferometry. We introduce M3LEO, a multi-modal, multi-label EO dataset that includes polarimetric, interferometric, and coherence SAR data derived from Sentinel-1, alongside Sentinel-2 RGB imagery and a suite of labelled tasks for model evaluation. M3LEO spans 17.5TB and contains approximately 10M data chips across six geographic regions. The dataset is complemented by a flexible PyTorch Lightning framework, with configuration management using Hydra. We provide tools to process any dataset available on popular platforms such as Google Earth Engine for integration with our framework. Initial experiments validate the utility of our data and framework, showing that SAR imagery contains information additional to that extractable from RGB data. Data at huggingface.co/M3LEO, and code at github.com/spaceml-org/M3LEO.

Read more

6/7/2024

Total Score

0

New!Artificial intelligence to advance Earth observation: : A review of models, recent trends, and pathways forward

Devis Tuia, Konrad Schindler, Begum Demir, Xiao Xiang Zhu, Mrinalini Kochupillai, Sav{s}o Dv{z}eroski, Jan N. van Rijn, Holger H. Hoos, Fabio Del Frate, Mihai Datcu, Volker Markl, Bertrand Le Saux, Rochelle Schneider, Gustau Camps-Valls

Earth observation (EO) is a prime instrument for monitoring land and ocean processes, studying the dynamics at work, and taking the pulse of our planet. This article gives a bird's eye view of the essential scientific tools and approaches informing and supporting the transition from raw EO data to usable EO-based information. The promises, as well as the current challenges of these developments, are highlighted under dedicated sections. Specifically, we cover the impact of (i) Computer vision; (ii) Machine learning; (iii) Advanced processing and computing; (iv) Knowledge-based AI; (v) Explainable AI and causal inference; (vi) Physics-aware models; (vii) User-centric approaches; and (viii) the much-needed discussion of ethical and societal issues related to the massive use of ML technologies in EO.

Read more

9/18/2024

🤖

Total Score

0

EarthNets: Empowering AI in Earth Observation

Zhitong Xiong, Fahong Zhang, Yi Wang, Yilei Shi, Xiao Xiang Zhu

Earth observation (EO), aiming at monitoring the state of planet Earth using remote sensing data, is critical for improving our daily lives and living environment. With a growing number of satellites in orbit, an increasing number of datasets with diverse sensors and research domains are being published to facilitate the research of the remote sensing community. This paper presents a comprehensive review of more than 500 publicly published datasets, including research domains like agriculture, land use and land cover, disaster monitoring, scene understanding, vision-language models, foundation models, climate change, and weather forecasting. We systematically analyze these EO datasets from four aspects: volume, resolution distributions, research domains, and the correlation between datasets. Based on the dataset attributes, we propose to measure, rank, and select datasets to build a new benchmark for model evaluation. Furthermore, a new platform for EO, termed EarthNets, is released to achieve a fair and consistent evaluation of deep learning methods on remote sensing data. EarthNets supports standard dataset libraries and cutting-edge deep learning models to bridge the gap between the remote sensing and machine learning communities. Based on this platform, extensive deep-learning methods are evaluated on the new benchmark. The insightful results are beneficial to future research. The platform and dataset collections are publicly available at https://earthnets.github.io.

Read more

4/4/2024