Ensemble single image deraining network via progressive structural boosting constraints

https://doi.org/10.1016/j.image.2021.116460Get rights and content

Highlights

  • The ensemble network has capacity to solve rain removal of multiple types.

  • Struct-absolute loss benefits for embedding boosting strategy into network training.

  • The ensemble network achieves new SOTA performance on public deraining datasets.

Abstract

Image deraining is an extensively researched topic in low-level computer vision community. During the past, sufficient works have been proposed to address this problem. Though great improvements these methods have achieved, no derain network can confidently declare that it can solve rain removal problem perfectly. Single complex model may lead to overfitting, while simple model is too weak to achieve clear result. Therefore, in this paper, inspired from classic boosting idea, we have proposed an effective ensemble derain framework to aggregate multiple simple weak drain models to obtain a strong derain model. Cascade structural weighting-map are computed for adaptively emphasizing the quality of local derain regions. Struct-absolution losses are proposed to account for pixel-wise and local region-wise differences, and to facilitate embedding boosting idea into network training. The comprehensive experiments on public derain datasets and high-level vision tasks validate that our proposed model which just utilizes three generally weak derain subnets can achieve much better performance than compared state-of-the-art methods. Our ensemble framework has enough capacity that any state-of-the-art DL-based models can be taken as sub-modules to solve rain removal of multiple types within a single framework.

Introduction

Rainy day is one of the most common weather in our daily life. Images captured on rainy condition often suffer poor visual qualities with rain masks. Under different conditions, the rain-masks may appear different styles, such as linear-shape sparse rain-streaks, randomly scattered raindrops and accumulated hazy rain-mist. The image degradation caused by rain-masks brings great challenge to downstream high-level vision tasks, such as video surveillance, object recognition, auto-driving, etc. Therefore, it is important to develop novel and effective rain removal algorithm to restore image’s clear content. In recent years, with increasing demands of industrial applications, image derain becomes a very hot research topic in computer vision communities.

During the past, many state-of-the-art methods have been proposed to solve the image derain problems. Though great improvements have been achieved, there are several deficiencies that need paying much attention on. On one hand, many derain methods were criticized that they were specifically designed for only one rain-mask, especially for rain-streak. Assuming degradation too much on one rain case may oversimplify the derain problem. On another hand, since image derain is a challenge ill-posed image restoration problem, in most cases, there does not exist a single derain model that is perfect for rain removal. Single complex derain model may lead to overfitting, resulting content details smoothed, while simple general model may be too weak to remove rain clearly. Therefore, some ensemble strategies are needed to combine methods for different rain-masks or methods considering different rain information aspects, taking their complementary advantages.

In this paper, we follow the ensemble learning strategy, and propose an ensemble single image derain network inspired from classic adaboost [1] idea. As we know, adaboost algorithm adaptively gives much emphasis on wrongly classified samples. For the derain problem, we take pixel-wise value and region’s structural consistence as “sample labels”. Therefore, we design cascade weighting map computation modules, and construct progressive structural boosting constraints on pre-trained weak derain subnets. Auxiliary struct-absolute losses (SALoss) are calculated on each subnet for fine-tuning. Then coarsely derained results from each “weak” subnet are aggregated and refined to generate final optimal clear output. We have performed comprehensive experiments on two public derain datasets to evaluate the effectiveness our ensemble framework. Though our initial motivation is to validate the boosting thought on derain nets, the experiment results amazingly demonstrate that our proposed ensemble model just aggregating three weak derain subnets can achieve more superior performance than state-of-the-art methods compared in this paper.

The contributions of this paper are summarized into three folds.

  • We have successfully constructed an effective ensemble single deraining network via progressive structural boosting constraints. The ensemble framework has enough capacity that any state-of-the-art derain networks can be taken as its sub-modules to potentially solve rain removal of multiple types within a single framework.

  • We have proposed a struct-absolute loss to consider both pixel-wise and local region-wise differences for network training. As auxiliary loss for fine-tuning, it can benefit for embedding boosting into network training.

  • We have experimented our proposed ensemble network on public synthesized derain datasets and real-world rainy images. The experiment results demonstrate that our ensemble boosting derain network can achieve much better performance than many state-of-the-art methods, though we just only utilize three pre-trained “weak” derain models as subnets. On high-level vision tasks, our experimented model can also benefit high-level vision algorithms for reaching almost the same results on clear ground-truth.

In the following sections, we will briefly review some related works in Section 2. Then in Section 3, we will describe our ensemble boosting derain framework in details. In Section 4, we will comprehensively demonstrate our experiments. Finally, in Section 5, we give our conclusion and discus the future work.

Section snippets

Related work

Image deraining is an extensively researched topic in the low-level computer vision community. During the past, sufficient works have been proposed to address this problem. Since our proposed ensemble network belongs to deep neural network routine, in this section, we focus on reviewing deep learning based single image derain methods. Much more comprehensive analysis on image derain methods can be referred in recent survey works [2], [3], [4].

As we know, the era of deep-learning based deraining

Materials and methods

The architecture of our network is demonstrated in Fig. 1. The whole network is composed of two stages, which are ensemble derain boosting stage and refined derain stage.

At the ensemble derain stage, we follow the idea of boosting to specifically design a cascade structure to organize available rain-mask removing algorithms. At the refined derain stage, we comprehensively reutilize multiple coarsely derain results to finely learn final clear image. In the following subsections, we will describe

Experiment settings

For simplicity, Res-Net, U-Net and W-Net [26] are taken as sub-modules of our ensemble framework. Their architectures are illustrated in Fig. 6. However, we still would like to emphasize that our ensemble framework are with enough capacity that any SOTA derain networks can be taken as our sub-modules.

We train and test our derain networks on two publicly available synthesized datasets. The first one is Rain800 from ID-CGAN dataset [15]. It contains about 700 training sets and 100 test sets.

Conclusion

In this paper, we have proposed an effective ensemble single image derain framework inspired from classic boosting idea. Cascade structural weighting-map are computed for adaptively emphasizing the quality of local derain regions. Struct-absolution losses are proposed to account for pixel-wise and local region-wise differences, and to facilitate embedding boosting idea into network training. Finally, coarsely derained results from each subnet are aggregated and refined to generate final optimal

CRediT authorship contribution statement

Long Peng: Design, Formal analysis, Interpretation of data for the work. Aiwen Jiang: Conceptualization, Design, Funding acquisition, Formal analysis, Interpretation of data for the work, Writing – original draft, Writing – review & editing. Haoran Wei: Formal analysis. Bo Liu: Formal analysis, Interpretation of data for the work. Mingwen Wang: Conceptualization, Funding acquisition, Formal analysis.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Long Peng is an undergraduate student in School of Computer and Information Engineering, Jiangxi Normal University. His research interest is image enhancement.

References (35)

  • FreundYoav et al.

    A decision-theoretic generalization of on-line learning and an application to boosting

    J. Comput. System Sci.

    (1997)
  • LiS. et al.

    Single image deraining: A comprehensive benchmark analysis

  • WangHong et al.

    A survey on rain removal from video and single image

    (2019)
  • YangWenhan et al.

    Single image deraining: From model-based to data-driven and beyond

    (2019)
  • YouS. et al.

    Adherent raindrop modeling, detectionand removal in video

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2016)
  • ZhangZheng et al.

    Going clear from misty rain in dark channel guided network

  • FuXueyang et al.

    Clearing the skies: A deep network architecture for single-image rain removal

    IEEE Trans. Image Process.

    (2017)
  • Xueyang Fu, Jiabin Huang, Delu Zeng, Huang Yue, Xinghao Ding, John Paisley, Removing rain from single images via a deep...
  • FuXueyang et al.

    Lightweight pyramid networks for image deraining

    IEEE Trans. Neural Netw. Learn. Syst.

    (2020)
  • Wenhan Yang, Tan Robby T., Jiashi Feng, Jiaying Liu, Zongming Guo, Shuicheng Yan, Deep joint rain detection and removal...
  • YangWenhan et al.

    Joint rain detection and removal from a single image with contextualized deep networks

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2019)
  • LiXia et al.

    Recurrent squeeze-and-excitation context aggregation net for single image deraining

  • YangWenhan et al.

    Scale-free single image deraining via visibility-enhanced recurrent wavelet learning

    IEEE Trans. Image Process.

    (2019)
  • Dongwei Ren, Wangmeng Zuo, Qinghua Hu, Pengfei Zhu, Deyu Meng, Progressive image deraining networks: A better and...
  • ZhangHe et al.

    Image de-raining using a conditional generative adversarial network

    IEEE Trans. Circuits Syst. Video Technol.

    (2019)
  • He Zhang, Vishal M. Patel, Density-aware single image de-raining using a multi-stream dense network, in: Proc. of...
  • Ruoteng Li, Loong-Fah Cheong, Tan Robby T., Heavy rain image restoration: Integrating physics model and conditional...
  • Cited by (14)

    View all citing articles on Scopus

    Long Peng is an undergraduate student in School of Computer and Information Engineering, Jiangxi Normal University. His research interest is image enhancement.

    Aiwen Jiang is full professor in School of Computer and Information Engineering, Jiangxi Normal University. He received his PhD. degree from Institute of Automation, Chinese Academy of Sciences, China, in 2010. He received his bachelor degree from Nanjing University of Post and Telecommunication, China, in 2005. His research interests are computer vision, machine learning.

    Haoran Wei is a PhD candidate in the Department of Electrical and Computer Engineering at the University of Texas at Dallas, Richardson, TX. He received his BE degree in communication engineering from Shanghai Normal University, Shanghai, China, in 2014, and the MS degree in communication and information system from Shanghai Normal University, Shanghai, China, in 2017. His research interests include real-time image processing, video processing, speech processing, and machine learning.

    Bo Liu is current a tenure-track assistant professor in College of Computer Science and Software Engineering at Auburn University, Auburn. He received his Ph.D. degree in computer science from University of Massachusetts Amherst, Massachusetts, in 2015. He is the recipient of the UAI-2015 Facebook Best Student Paper Award. His primary research area covers machine learning, deep learning, stochastic optimization.

    Mingwen Wang is full professor and director in School of Computer and Information Engineering, Jiangxi Normal University. He received his PhD degree from Shanghai Jiaotong university, China, in 2001. His research interests are machine learning.

    This work was supported by the National Natural Science Foundation of China under Grant No. 61966018 and 61876074.

    View full text