Unveiling Deep Shadows: A Survey on Image and Video Shadow Detection, Removal, and Generation in the Era of Deep Learning

Read original: arXiv:2409.02108 - Published 9/4/2024 by Xiaowei Hu, Zhenghao Xing, Tianyu Wang, Chi-Wing Fu, Pheng-Ann Heng

Unveiling Deep Shadows: A Survey on Image and Video Shadow Detection, Removal, and Generation in the Era of Deep Learning

Overview

This paper provides a comprehensive survey on image and video shadow detection, removal, and generation using deep learning techniques.
It covers various aspects of shadow-related computer vision tasks, including instance shadow detection, shadow removal, and shadow generation.
The survey examines the latest advancements and challenges in these areas, highlighting the significant impact of deep learning on shadow-related research.

Plain English Explanation

This paper reviews the latest research on using deep learning, a type of artificial intelligence, to address different aspects of shadows in images and videos. Shadows can be a major challenge in computer vision tasks, as they can obscure important details or create unwanted effects. The survey looks at three main areas:

Shadow detection: Identifying where shadows are located in an image or video.
Shadow removal: Removing shadows from an image or video to reveal the underlying scene.
Shadow generation: Creating realistic-looking shadows to enhance the realism of computer-generated images and videos.

The paper examines how deep learning, a powerful AI technique, has significantly advanced research in these shadow-related tasks. It highlights the latest methods and techniques, as well as the challenges and limitations that researchers are still working to overcome.

Technical Explanation

The paper begins by providing an overview of the key shadow-related tasks in computer vision:

Shadow detection: Identifying the location and boundaries of shadows in an image or video. This is an important first step for tasks like shadow removal or scene understanding.
Shadow removal: Removing or restoring the underlying scene behind a shadow to create a shadow-free image or video. This can improve the quality and usability of visual data.
Shadow generation: Synthesizing realistic-looking shadows to enhance the realism of computer-generated imagery and animations. This is useful for applications like visual effects, augmented reality, and virtual environments.

The survey then delves into the impact of deep learning on these shadow-related tasks. Deep learning, a type of machine learning that uses artificial neural networks, has revolutionized computer vision by enabling more accurate and robust solutions. The paper examines the latest deep learning architectures, training techniques, and datasets that have been applied to shadow detection, removal, and generation.

For example, the paper discusses instance shadow detection, where deep learning models are trained to identify individual shadow instances within an image or video, rather than just the overall shadow regions. This can be particularly useful for applications like autonomous driving or robotics, where understanding the precise location and boundaries of shadows is critical.

The survey also covers shadow removal approaches that leverage deep learning to accurately restore the underlying scene behind shadows, with applications in image enhancement and editing. Additionally, the paper examines deep learning-based shadow generation techniques that can create realistic-looking shadows to improve the visual quality of synthetic scenes.

Critical Analysis

The survey provides a thorough and up-to-date overview of the state of the art in shadow-related computer vision tasks, highlighting the significant advancements enabled by deep learning. However, it also acknowledges several challenges and limitations that researchers are still working to address:

Generalization: While deep learning models have shown impressive performance on specific datasets and scenarios, there are concerns about their ability to generalize to a wider range of real-world conditions, such as varying lighting, camera angles, and scene complexity.
Interpretability: Deep learning models can often be treated as "black boxes," making it difficult to understand the underlying reasoning behind their decisions. Improving the interpretability of these models could lead to better insights and more trustworthy applications.
Computational Efficiency: Some deep learning-based shadow detection and removal approaches can be computationally intensive, which may hinder their deployment in real-time or resource-constrained applications.

The paper suggests that future research should focus on addressing these challenges, as well as exploring new frontiers, such as the integration of shadow-related tasks with higher-level scene understanding and reasoning capabilities.

Conclusion

This comprehensive survey provides a valuable overview of the latest advancements in shadow detection, removal, and generation using deep learning. The paper highlights the significant impact of deep learning on these shadow-related computer vision tasks, enabling more accurate and robust solutions with a wide range of practical applications.

However, the survey also identifies several challenges and limitations that researchers are still working to overcome, such as improving model generalization, interpretability, and computational efficiency. Addressing these issues could lead to even more impactful and versatile shadow-related technologies in the future.

Overall, this survey serves as a valuable resource for researchers and practitioners interested in understanding the current state of the art and future directions in the field of shadow-related computer vision and deep learning.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Unveiling Deep Shadows: A Survey on Image and Video Shadow Detection, Removal, and Generation in the Era of Deep Learning

Xiaowei Hu, Zhenghao Xing, Tianyu Wang, Chi-Wing Fu, Pheng-Ann Heng

Shadows are formed when light encounters obstacles, leading to areas of diminished illumination. In computer vision, shadow detection, removal, and generation are crucial for enhancing scene understanding, refining image quality, ensuring visual consistency in video editing, and improving virtual environments. This paper presents a comprehensive survey of shadow detection, removal, and generation in images and videos within the deep learning landscape over the past decade, covering tasks, deep models, datasets, and evaluation metrics. Our key contributions include a comprehensive survey of shadow analysis, standardization of experimental comparisons, exploration of the relationships among model size, speed, and performance, a cross-dataset generalization study, identification of open issues and future directions, and provision of publicly available resources to support further research.

9/4/2024

Single-Image Shadow Removal Using Deep Learning: A Comprehensive Survey

Laniqng Guo, Chong Wang, Yufei Wang, Siyu Huang, Wenhan Yang, Alex C. Kot, Bihan Wen

Shadow removal aims at restoring the image content within shadow regions, pursuing a uniform distribution of illumination that is consistent between shadow and non-shadow regions. {Comparing to other image restoration tasks, there are two unique challenges in shadow removal:} 1) The patterns of shadows are arbitrary, varied, and often have highly complex trace structures, making ``trace-less'' image recovery difficult. 2) The degradation caused by shadows is spatially non-uniform, resulting in inconsistencies in illumination and color between shadow and non-shadow areas. Recent developments in this field are primarily driven by deep learning-based solutions, employing a variety of learning strategies, network architectures, loss functions, and training data. Nevertheless, a thorough and insightful review of deep learning-based shadow removal techniques is still lacking. In this paper, we are the first to provide a comprehensive survey to cover various aspects ranging from technical details to applications. We highlight the major advancements in deep learning-based single-image shadow removal methods, thoroughly review previous research across various categories, and provide insights into the historical progression of these developments. Additionally, we summarize performance comparisons both quantitatively and qualitatively. Beyond the technical aspects of shadow removal methods, we also explore potential future directions for this field.

7/15/2024

🔎

Video Instance Shadow Detection

Zhenghao Xing, Tianyu Wang, Xiaowei Hu, Haoran Wu, Chi-Wing Fu, Pheng-Ann Heng

Instance shadow detection, crucial for applications such as photo editing and light direction estimation, has undergone significant advancements in predicting shadow instances, object instances, and their associations. The extension of this task to videos presents challenges in annotating diverse video data and addressing complexities arising from occlusion and temporary disappearances within associations. In response to these challenges, we introduce ViShadow, a semi-supervised video instance shadow detection framework that leverages both labeled image data and unlabeled video data for training. ViShadow features a two-stage training pipeline: the first stage, utilizing labeled image data, identifies shadow and object instances through contrastive learning for cross-frame pairing. The second stage employs unlabeled videos, incorporating an associated cycle consistency loss to enhance tracking ability. A retrieval mechanism is introduced to manage temporary disappearances, ensuring tracking continuity. The SOBA-VID dataset, comprising unlabeled training videos and labeled testing videos, along with the SOAP-VID metric, is introduced for the quantitative evaluation of VISD solutions. The effectiveness of ViShadow is further demonstrated through various video-level applications such as video inpainting, instance cloning, shadow editing, and text-instructed shadow-object manipulation.

5/7/2024

SoftShadow: Leveraging Penumbra-Aware Soft Masks for Shadow Removal

Xinrui Wang, Lanqing Guo, Xiyu Wang, Siyu Huang, Bihan Wen

Recent advancements in deep learning have yielded promising results for the image shadow removal task. However, most existing methods rely on binary pre-generated shadow masks. The binary nature of such masks could potentially lead to artifacts near the boundary between shadow and non-shadow areas. In view of this, inspired by the physical model of shadow formation, we introduce novel soft shadow masks specifically designed for shadow removal. To achieve such soft masks, we propose a textit{SoftShadow} framework by leveraging the prior knowledge of pretrained SAM and integrating physical constraints. Specifically, we jointly tune the SAM and the subsequent shadow removal network using penumbra formation constraint loss and shadow removal loss. This framework enables accurate predictions of penumbra (partially shaded regions) and umbra (fully shaded regions) areas while simultaneously facilitating end-to-end shadow removal. Through extensive experiments on popular datasets, we found that our SoftShadow framework, which generates soft masks, can better restore boundary artifacts, achieve state-of-the-art performance, and demonstrate superior generalizability.

9/12/2024