Visibility-Aware Point-Based Multi-View Stereo Network,IEEE Transactions on Pattern Analysis and Machine Intelligence

当前位置： X-MOL 学术 › IEEE Trans. Pattern Anal. Mach. Intell. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Visibility-Aware Point-Based Multi-View Stereo Network
IEEE Transactions on Pattern Analysis and Machine Intelligence ( IF 20.8 ) Pub Date : 4-22-2020 , DOI: 10.1109/tpami.2020.2988729
Rui Chen , Songfang Han , Jing Xu , Hao Su

We introduce VA-Point-MVSNet, a novel visibility-aware point-based deep framework for multi-view stereo (MVS). Distinct from existing cost volume approaches, our method directly processes the target scene as point clouds. More specifically, our method predicts the depth in a coarse-to-fine manner. We first generate a coarse depth map, convert it into a point cloud and refine the point cloud iteratively by estimating the residual between the depth of the current iteration and that of the ground truth. Our network leverages 3D geometry priors and 2D texture information jointly and effectively by fusing them into a feature-augmented point cloud, and processes the point cloud to estimate the 3D flow for each point. This point-based architecture allows higher accuracy, more computational efficiency and more flexibility than cost-volume-based counterparts. Furthermore, our visibility-aware multi-view feature aggregation allows the network to aggregate multi-view appearance cues while taking into account visibility. Experimental results show that our approach achieves a significant improvement in reconstruction quality compared with state-of-the-art methods on the DTU and the Tanks and Temples dataset. The code of VA-Point-MVSNet proposed in this work will be released at https://github.com/callmeray/PointMVSNet.

中文翻译：

可见性感知的基于点的多视图立体网络

我们介绍了 VA-Point-MVSNet，这是一种新颖的、基于可见性的、基于点的多视图立体 (MVS) 深度框架。与现有的成本量方法不同，我们的方法直接将目标场景处理为点云。更具体地说，我们的方法以从粗到细的方式预测深度。我们首先生成一个粗略的深度图，将其转换为点云，并通过估计当前迭代的深度与地面真实深度之间的残差来迭代地细化点云。我们的网络通过将 3D 几何先验和 2D 纹理信息融合到特征增强点云中，有效地联合利用它们，并处理点云以估计每个点的 3D 流。与基于成本量的架构相比，这种基于点的架构具有更高的精度、更高的计算效率和更大的灵活性。此外，我们的可见性感知多视图特征聚合允许网络在考虑可见性的同时聚合多视图外观线索。实验结果表明，与 DTU 和 Tanks and Temples 数据集上最先进的方法相比，我们的方法在重建质量方面取得了显着提高。本工作中提出的 VA-Point-MVSNet 代码将在 https://github.com/callmeray/PointMVSNet 发布。

更新日期：2024-08-22

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11