当前位置: X-MOL 学术arXiv.cs.RO › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting
arXiv - CS - Robotics Pub Date : 2021-08-03 , DOI: arxiv-2108.01316
Jiachen Li, Fan Yang, Hengbo Ma, Srikanth Malla, Masayoshi Tomizuka, Chiho Choi

Motion forecasting plays a significant role in various domains (e.g., autonomous driving, human-robot interaction), which aims to predict future motion sequences given a set of historical observations. However, the observed elements may be of different levels of importance. Some information may be irrelevant or even distracting to the forecasting in certain situations. To address this issue, we propose a generic motion forecasting framework (named RAIN) with dynamic key information selection and ranking based on a hybrid attention mechanism. The general framework is instantiated to handle multi-agent trajectory prediction and human motion forecasting tasks, respectively. In the former task, the model learns to recognize the relations between agents with a graph representation and to determine their relative significance. In the latter task, the model learns to capture the temporal proximity and dependency in long-term human motions. We also propose an effective double-stage training pipeline with an alternating training strategy to optimize the parameters in different modules of the framework. We validate the framework on both synthetic simulations and motion forecasting benchmarks in different domains, demonstrating that our method not only achieves state-of-the-art forecasting performance, but also provides interpretable and reasonable hybrid attention weights.

中文翻译:

RAIN:用于运动预测的强化混合注意推理网络

运动预测在各个领域(例如,自动驾驶、人机交互)中发挥着重要作用,其目的是在给定一组历史观察的情况下预测未来的运动序列。然而,观察到的元素可能具有不同的重要性级别。在某些情况下,某些信息可能与预测无关甚至会分散预测的注意力。为了解决这个问题,我们提出了一个通用的运动预测框架(命名为 RAIN),具有基于混合注意力机制的动态关键信息选择和排名。通用框架被实例化以分别处理多智能体轨迹预测和人体运动预测任务。在前一个任务中,模型学习使用图形表示识别代理之间的关系并确定它们的相对重要性。在后面的任务中,该模型学习捕捉长期人类运动中的时间接近度和依赖性。我们还提出了一种有效的双阶段训练管道,采用交替训练策略来优化框架不同模块中的参数。我们在不同领域的综合模拟和运动预测基准上验证了该框架,证明我们的方法不仅实现了最先进的预测性能,而且提供了可解释且合理的混合注意力权重。
更新日期:2021-08-04
down
wechat
bug