3DCD: Scene Independent End-to-End Spatiotemporal Feature Learning Framework for Change Detection in Unseen Videos,IEEE Transactions on Image Processing

当前位置： X-MOL 学术 › IEEE Trans. Image Process. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

3DCD: Scene Independent End-to-End Spatiotemporal Feature Learning Framework for Change Detection in Unseen Videos
IEEE Transactions on Image Processing ( IF 10.8 ) Pub Date : 2020-11-18 , DOI: 10.1109/tip.2020.3037472
Murari Mandal , Vansh Dhar , Abhishek Mishra , Santosh Kumar Vipparthi , Mohamed Abdel-Mottaleb

Change detection is an elementary task in computer vision and video processing applications. Recently, a number of supervised methods based on convolutional neural networks have reported high performance over the benchmark dataset. However, their success depends upon the availability of certain proportions of annotated frames from test video during training. Thus, their performance on completely unseen videos or scene independent setup is undocumented in the literature. In this work, we present a scene independent evaluation (SIE) framework to test the supervised methods in completely unseen videos to obtain generalized models for change detection. In addition, a scene dependent evaluation (SDE) is also performed to document the comparative analysis with the existing approaches. We propose a fast (speed-25 fps) and lightweight (0.13 million parameters, model size-1.16 MB) end-to-end 3D-CNN based change detection network (3DCD) with multiple spatiotemporal learning blocks. The proposed 3DCD consists of a gradual reductionist block for background estimation from past temporal history. It also enables motion saliency estimation, multi-schematic feature encoding-decoding, and finally foreground segmentation through several modular blocks. The proposed 3DCD outperforms the existing state-of-the-art approaches evaluated in both SIE and SDE setup over the benchmark CDnet 2014, LASIESTA and SBMI2015 datasets. To the best of our knowledge, this is a first attempt to present results in clearly defined SDE and SIE setups in three change detection datasets.

中文翻译：

3DCD：场景无关的端到端时空特征学习框架，用于在看不见的视频中进行变化检测

更改检测是计算机视觉和视频处理应用程序中的一项基本任务。最近，许多基于卷积神经网络的监督方法都报告了基准数据集的高性能。但是，它们的成功取决于训练期间测试视频中某些比例的带注释帧的可用性。因此，它们在完全看不见的视频或与场景无关的设置上的表现在文献中没有记载。在这项工作中，我们提出了一个场景独立评估（SIE）框架，以测试完全看不见的视频中的监督方法，以获得用于变化检测的通用模型。此外，还进行了场景依赖评估（SDE），以记录与现有方法进行的比较分析。我们建议使用快速（速度为25 fps）和轻量级（0。1300万个参数，模型大小为1.16 MB）基于端到端3D-CNN的变更检测网络（3DCD），具有多个时空学习块。拟议的3DCD包含一个渐进式归约块，用于根据过去的时间历史进行背景估计。它还通过几个模块块实现运动显着性估计，多模式特征编码-解码以及最终前景分割。拟议的3DCD优于通过基准CDnet 2014，LASIESTA和SBMI2015数据集在SIE和SDE设置中评估的现有最新方法。据我们所知，这是首次尝试在三个变更检测数据集中呈现清晰定义的SDE和SIE设置中的结果。拟议的3DCD包含一个渐进式归约块，用于根据过去的时间历史进行背景估计。它还通过几个模块块实现运动显着性估计，多模式特征编码-解码以及最终前景分割。拟议的3DCD优于通过基准CDnet 2014，LASIESTA和SBMI2015数据集在SIE和SDE设置中评估的现有最新方法。据我们所知，这是首次尝试在三个变更检测数据集中呈现清晰定义的SDE和SIE设置中的结果。拟议的3DCD包含一个渐进式归约块，用于根据过去的时间历史进行背景估计。它还通过几个模块块实现运动显着性估计，多模式特征编码-解码以及最终前景分割。拟议的3DCD优于通过基准CDnet 2014，LASIESTA和SBMI2015数据集在SIE和SDE设置中评估的现有最新方法。据我们所知，这是首次尝试在三个变更检测数据集中呈现清晰定义的SDE和SIE设置中的结果。拟议的3DCD优于通过基准CDnet 2014，LASIESTA和SBMI2015数据集在SIE和SDE设置中评估的现有最新方法。据我们所知，这是首次尝试在三个变更检测数据集中呈现清晰定义的SDE和SIE设置中的结果。拟议的3DCD优于通过基准CDnet 2014，LASIESTA和SBMI2015数据集在SIE和SDE设置中评估的现有最新方法。据我们所知，这是首次尝试在三个变更检测数据集中呈现清晰定义的SDE和SIE设置中的结果。

更新日期：2020-11-27

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11