Modeling human–human interaction with attention-based high-order GCN for trajectory prediction,The Visual Computer

当前位置： X-MOL 学术 › Vis. Comput. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Modeling human–human interaction with attention-based high-order GCN for trajectory prediction
The Visual Computer ( IF 3.0 ) Pub Date : 2021-07-01 , DOI: 10.1007/s00371-021-02109-2
Yanyan Fang , Zhiyu Jin , Zhenhua Cui , Qiaowen Yang , Tianyi Xie , Bo Hu

This paper presents a novel high-order graph convolutional network (GCN) for pedestrian trajectory prediction. Specifically, the walking state of a target pedestrian depends on both its historical trajectory, which encodes its speed, walking direction and acceleration information, as well as the movement of its neighbors. Thus we propose to leverage GCNs to aggregate the trajectory features of the target pedestrian and its neighbors to predict the movement of the target pedestrian. Considering that the movement of the neighbors’ neighbors affects the movement of the target pedestrian’s neighbors, thus indirectly affecting the movement of the target pedestrian, we propose to use a high-order GCN for human–human interaction modelling. Such a high-order GCN considers the target pedestrian’s neighbors as well as its neighbors’ neighbors. Further, a pedestrian avoids collision with others by estimating its locations and its neighbors’ upcoming locations, and it slows down or changes direction if it believes a collision may occur, especially in very crowded scenes. In light of this, we propose to model such anticipation-based decision making behavior as attention and combine it with our high-order GCN. Thus we first roughly estimate the future trajectories of all pedestrians with a simple method. By using the coarse predicted future trajectory and GCN outputs, we calculate the attention in our attention-based high-order GCN and predict future trajectory. Extensive experiments validate the effectiveness of our approach. In addition, our model shows a higher data efficiency. On the ETH&UCY dataset, using only 5\(\%\) of the training data for each training epoch, our model outperforms the state of the art.

中文翻译：

使用基于注意力的高阶 GCN 对人与人的交互进行建模以进行轨迹预测

本文提出了一种用于行人轨迹预测的新型高阶图卷积网络（GCN）。具体来说，目标行人的步行状态取决于其历史轨迹（编码其速度、步行方向和加速度信息）及其邻居的运动。因此，我们建议利用 GCN 来聚合目标行人及其邻居的轨迹特征来预测目标行人的运动。考虑到邻居邻居的运动会影响目标行人邻居的运动，从而间接影响目标行人的运动，我们建议使用高阶GCN进行人与人交互建模。这种高阶 GCN 既考虑目标行人的邻居，也考虑其邻居的邻居。更多，行人通过估计自己的位置和邻居即将到来的位置来避免与他人发生碰撞，如果认为可能发生碰撞，它会放慢速度或改变方向，尤其是在非常拥挤的场景中。鉴于此，我们建议将基于预期的决策行为建模为注意力，并将其与我们的高阶 GCN 相结合。因此，我们首先用一种简单的方法粗略估计所有行人的未来轨迹。通过使用粗略预测的未来轨迹和 GCN 输出，我们计算基于注意力的高阶 GCN 中的注意力并预测未来轨迹。大量实验验证了我们方法的有效性。此外，我们的模型显示出更高的数据效率。在 ETH&UCY 数据集上，仅使用 5 如果它认为可能发生碰撞，它会减慢速度或改变方向，尤其是在非常拥挤的场景中。鉴于此，我们建议将基于预期的决策行为建模为注意力，并将其与我们的高阶 GCN 相结合。因此，我们首先用一种简单的方法粗略估计所有行人的未来轨迹。通过使用粗略预测的未来轨迹和 GCN 输出，我们计算基于注意力的高阶 GCN 中的注意力并预测未来轨迹。大量实验验证了我们方法的有效性。此外，我们的模型显示出更高的数据效率。在 ETH&UCY 数据集上，仅使用 5 如果它认为可能发生碰撞，它会减慢速度或改变方向，尤其是在非常拥挤的场景中。鉴于此，我们建议将基于预期的决策行为建模为注意力，并将其与我们的高阶 GCN 相结合。因此，我们首先用一种简单的方法粗略估计所有行人的未来轨迹。通过使用粗略预测的未来轨迹和 GCN 输出，我们计算基于注意力的高阶 GCN 中的注意力并预测未来轨迹。大量实验验证了我们方法的有效性。此外，我们的模型显示出更高的数据效率。在 ETH&UCY 数据集上，仅使用 5 我们建议将这种基于预期的决策行为建模为注意力，并将其与我们的高阶 GCN 相结合。因此，我们首先用一种简单的方法粗略估计所有行人的未来轨迹。通过使用粗略预测的未来轨迹和 GCN 输出，我们计算基于注意力的高阶 GCN 中的注意力并预测未来轨迹。大量实验验证了我们方法的有效性。此外，我们的模型显示出更高的数据效率。在 ETH&UCY 数据集上，仅使用 5 我们建议将这种基于预期的决策行为建模为注意力，并将其与我们的高阶 GCN 相结合。因此，我们首先用一种简单的方法粗略估计所有行人的未来轨迹。通过使用粗略预测的未来轨迹和 GCN 输出，我们计算基于注意力的高阶 GCN 中的注意力并预测未来轨迹。大量实验验证了我们方法的有效性。此外，我们的模型显示出更高的数据效率。在 ETH&UCY 数据集上，仅使用 5 大量实验验证了我们方法的有效性。此外，我们的模型显示出更高的数据效率。在 ETH&UCY 数据集上，仅使用 5 大量实验验证了我们方法的有效性。此外，我们的模型显示出更高的数据效率。在 ETH&UCY 数据集上，仅使用 5\(\%\)每个训练时期的训练数据，我们的模型优于现有技术。

更新日期：2021-07-01

点击分享查看原文

点击收藏

阅读更多本刊最新论文