Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery

2406.07482

Published 6/12/2024 by Biplov Bhandari, Timothy Mayer

Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery

Abstract

The Bhutanese government is increasing its utilization of technological approaches such as including Remote Sensing-based knowledge in their decision-making process. This study focuses on crop type and crop extent in Paro, one of the top rice-yielding districts in Bhutan, and employs publicly available NICFI high-resolution satellite imagery from Planet. Two Deep Learning (DL) approaches, point-based (DNN) and patch-based (U-Net), models were used in conjunction with cloud-computing platforms. Three different models per DL approaches (DNN and U-Net) were trained: 1) RGBN channels from Planet; 2) RGBN and elevation data (RGBNE); 3) RGBN and Sentinel-1 (S1) data (RGBNS), and RGBN with E and S1 data (RGBNES). From this comprehensive analysis, the U-Net displayed higher performance metrics across both model training and model validation efforts. Among the U-Net model sets, the RGBN, RGBNE, RGBNS, and RGBNES models had an F1-score of 0.8546, 0.8563, 0.8467, and 0.8500 respectively. An independent model evaluation was performed and found a high level of performance variation across all the metrics. For this independent model evaluation, the U-Net RGBN, RGBNE, RGBNES, and RGBN models displayed the F1-scores of 0.5935, 0.6154, 0.5882, and 0.6582, suggesting U-Net RGBNES as the best model. The study shows that the DL approaches can predict rice. Also, DL methods can be used with the survey-based approaches currently utilized by the Bhutan Department of Agriculture. Further, this study demonstrated the usage of regional land cover products such as SERVIR's RLCMS as a weak label approach to capture different strata addressing the class imbalance problem and improving the sampling design for DL application. Finally, through preliminary model testing and comparisons outlined it was shown that using additional features such as NDVI, EVI, and NDWI did not drastically improve model performance.

Create account to get full access

Overview

This paper explores the use of deep learning models for mapping rice cultivation in Bhutan using high-resolution satellite imagery.
The researchers compare the performance of several deep learning architectures, including Convolutional Neural Networks (CNNs), Generative Adversarial Networks (GANs), and Recurrent Neural Networks (RNNs), for accurately identifying and delineating rice fields.
The study aims to provide insights into the suitability of different deep learning approaches for agricultural mapping in a mountainous, high-altitude region like Bhutan.

Plain English Explanation

The paper investigates the use of advanced machine learning techniques, specifically deep learning models, to map the location and extent of rice farming in the mountainous country of Bhutan. Rice is an important crop in Bhutan, but accurately tracking its cultivation can be challenging due to the rugged terrain and small field sizes.

The researchers tested several different deep learning architectures, including Convolutional Neural Networks (CNNs), Generative Adversarial Networks (GANs), and Recurrent Neural Networks (RNNs), to see which one could best identify and delineate rice fields using high-resolution satellite imagery.

By comparing the performance of these different deep learning models, the study aims to provide guidance on the most suitable techniques for accurately mapping rice cultivation in Bhutan and similar mountainous regions. This information could be valuable for agricultural planning, monitoring, and decision-making in these challenging environments.

Technical Explanation

The researchers used high-resolution satellite imagery covering several districts in Bhutan to train and evaluate their deep learning models for rice mapping. They compared the performance of various deep learning architectures, including Convolutional Neural Networks (CNNs), Generative Adversarial Networks (GANs), and Recurrent Neural Networks (RNNs).

The CNN models were used for pixel-level classification, attempting to identify individual rice pixels within the satellite imagery. The GAN models were employed to generate synthetic rice field patterns, which could then be used to augment the training data and improve model performance. Finally, the RNN models were explored for their ability to capture temporal dynamics in the satellite imagery, such as the phenological changes of rice crops over the growing season.

The researchers evaluated the models using several metrics, including overall accuracy, precision, recall, and F1-score. They also analyzed the models' ability to accurately delineate rice field boundaries and capture the spatial distribution of rice cultivation across the study area.

Critical Analysis

The paper provides a comprehensive evaluation of deep learning models for rice mapping in Bhutan, a region with unique geographical and agricultural challenges. The researchers acknowledge the limited availability of ground truth data for training and validation, which is a common issue in many developing countries. To address this, they utilized data augmentation techniques, such as the GAN-based approach, to generate synthetic rice field patterns and expand the training dataset.

One potential limitation of the study is the reliance on a single year of satellite imagery. Incorporating multi-temporal data, which could capture the temporal dynamics of rice cultivation, may further improve the models' ability to accurately identify and delineate rice fields. Additionally, the paper does not delve into the computational complexity and inference time of the different deep learning architectures, which could be an important consideration for real-world deployment in resource-constrained environments.

The researchers also note that the performance of the deep learning models may vary depending on the specific geographical and climatic conditions of different regions within Bhutan. Further validation and adaptation of the models in other parts of the country would be necessary to ensure their broader applicability.

Conclusion

This study demonstrates the potential of deep learning techniques, such as CNNs, GANs, and RNNs, for accurate mapping of rice cultivation in the challenging mountainous environment of Bhutan. The findings could inform the development of advanced agricultural monitoring and management systems in Bhutan and other similar regions, contributing to more efficient and sustainable food production.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

BD-SAT: High-resolution Land Use Land Cover Dataset & Benchmark Results for Developing Division: Dhaka, BD

Ovi Paul, Abu Bakar Siddik Nayem, Anis Sarker, Amin Ahsan Ali, M Ashraful Amin, AKM Mahbubur Rahman

Land Use Land Cover (LULC) analysis on satellite images using deep learning-based methods is significantly helpful in understanding the geography, socio-economic conditions, poverty levels, and urban sprawl in developing countries. Recent works involve segmentation with LULC classes such as farmland, built-up areas, forests, meadows, water bodies, etc. Training deep learning methods on satellite images requires large sets of images annotated with LULC classes. However, annotated data for developing countries are scarce due to a lack of funding, absence of dedicated residential/industrial/economic zones, a large population, and diverse building materials. BD-SAT provides a high-resolution dataset that includes pixel-by-pixel LULC annotations for Dhaka metropolitan city and surrounding rural/urban areas. Using a strict and standardized procedure, the ground truth is created using Bing satellite imagery with a ground spatial distance of 2.22 meters per pixel. A three-stage, well-defined annotation process has been followed with support from GIS experts to ensure the reliability of the annotations. We performed several experiments to establish benchmark results. The results show that the annotated BD-SAT is sufficient to train large deep learning models with adequate accuracy for five major LULC classes: forest, farmland, built-up areas, water bodies, and meadows.

6/11/2024

cs.CV cs.AI

🖼️

Comparative Analysis of Hyperspectral Image Reconstruction Using Deep Learning for Agricultural and Biological Applications

Md. Toukir Ahmed, Arthur Villordon, Mohammed Kamruzzaman

Hyperspectral imaging (HSI) has become a key technology for non-invasive quality evaluation in various fields, offering detailed insights through spatial and spectral data. Despite its efficacy, the complexity and high cost of HSI systems have hindered their widespread adoption. This study addressed these challenges by exploring deep learning-based hyperspectral image reconstruction from RGB (Red, Green, Blue) images, particularly for agricultural products. Specifically, different hyperspectral reconstruction algorithms, such as Hyperspectral Convolutional Neural Network - Dense (HSCNN-D), High-Resolution Network (HRNET), and Multi-Scale Transformer Plus Plus (MST++), were compared to assess the dry matter content of sweet potatoes. Among the tested reconstruction methods, HRNET demonstrated superior performance, achieving the lowest mean relative absolute error (MRAE) of 0.07, root mean square error (RMSE) of 0.03, and the highest peak signal-to-noise ratio (PSNR) of 32.28 decibels (dB). Some key features were selected using the genetic algorithm (GA), and their importance was interpreted using explainable artificial intelligence (XAI). Partial least squares regression (PLSR) models were developed using the RGB, reconstructed, and ground truth (GT) data. The visual and spectra quality of these reconstructed methods was compared with GT data, and predicted maps were generated. The results revealed the prospect of deep learning-based hyperspectral image reconstruction as a cost-effective and efficient quality assessment tool for agricultural and biological applications.

6/4/2024

eess.IV cs.CV

Deep Learning for Slum Mapping in Remote Sensing Images: A Meta-analysis and Review

Anjali Raj, Adway Mitra, Manjira Sinha

The major Sustainable Development Goals (SDG) 2030, set by the United Nations Development Program (UNDP), include sustainable cities and communities, no poverty, and reduced inequalities. However, millions of people live in slums or informal settlements with poor living conditions in many major cities around the world, especially in less developed countries. To emancipate these settlements and their inhabitants through government intervention, accurate data about slum location and extent is required. While ground survey data is the most reliable, such surveys are costly and time-consuming. An alternative is remotely sensed data obtained from very high-resolution (VHR) imagery. With the advancement of new technology, remote sensing based mapping of slums has emerged as a prominent research area. The parallel rise of Artificial Intelligence, especially Deep Learning has added a new dimension to this field as it allows automated analysis of satellite imagery to identify complex spatial patterns associated with slums. This article offers a detailed review and meta-analysis of research on slum mapping using remote sensing imagery from 2014 to 2024, with a special focus on deep learning approaches. Our analysis reveals a trend towards increasingly complex neural network architectures, with advancements in data preprocessing and model training techniques significantly enhancing slum identification accuracy. We have attempted to identify key methodologies that are effective across diverse geographic contexts. While acknowledging the transformative impact Convolutional Neural Networks (CNNs) in slum detection, our review underscores the absence of a universally optimal model, suggesting the need for context-specific adaptations. We also identify prevailing challenges in this field, such as data limitations and a lack of model explainability and suggest potential strategies for overcoming these.

6/13/2024

cs.CV

🔎

Insight Into the Collocation of Multi-Source Satellite Imagery for Multi-Scale Vessel Detection

Tran-Vu La, Minh-Tan Pham, Marco Chini

Ship detection from satellite imagery using Deep Learning (DL) is an indispensable solution for maritime surveillance. However, applying DL models trained on one dataset to others having differences in spatial resolution and radiometric features requires many adjustments. To overcome this issue, this paper focused on the DL models trained on datasets that consist of different optical images and a combination of radar and optical data. When dealing with a limited number of training images, the performance of DL models via this approach was satisfactory. They could improve 5-20% of average precision, depending on the optical images tested. Likewise, DL models trained on the combined optical and radar dataset could be applied to both optical and radar images. Our experiments showed that the models trained on an optical dataset could be used for radar images, while those trained on a radar dataset offered very poor scores when applied to optical images.

5/24/2024

cs.CV eess.IV