Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation,arXiv - CS - Computer Vision and Pattern Recognition

当前位置： X-MOL 学术 › arXiv.cs.CV › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation
arXiv - CS - Computer Vision and Pattern Recognition Pub Date : 2020-04-07 , DOI: arxiv-2004.03143
Zhe Wang, Daeyun Shin, Charless C. Fowlkes

Monocular estimation of 3d human pose has attracted increased attention with the availability of large ground-truth motion capture datasets. However, the diversity of training data available is limited and it is not clear to what extent methods generalize outside the specific datasets they are trained on. In this work we carry out a systematic study of the diversity and biases present in specific datasets and its effect on cross-dataset generalization across a compendium of 5 pose datasets. We specifically focus on systematic differences in the distribution of camera viewpoints relative to a body-centered coordinate frame. Based on this observation, we propose an auxiliary task of predicting the camera viewpoint in addition to pose. We find that models trained to jointly predict viewpoint and pose systematically show significantly improved cross-dataset generalization.

中文翻译：

预测相机视点改进了 3D 人体姿势估计的跨数据集泛化

随着大型地面实况运动捕捉数据集的可用性，3d 人体姿势的单目估计引起了越来越多的关注。然而，可用的训练数据的多样性是有限的，并且不清楚方法在它们所训练的特定数据集之外推广到什么程度。在这项工作中，我们对特定数据集中存在的多样性和偏差及其对 5 个姿势数据集的跨数据集泛化的影响进行了系统研究。我们特别关注相机视点相对于以身体为中心的坐标系分布的系统差异。基于这一观察，我们提出了除了姿势之外预测相机视点的辅助任务。

更新日期：2020-04-08

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>