Content4All Open Research Sign Language Translation Datasets,arXiv - CS - Computer Vision and Pattern Recognition

当前位置： X-MOL 学术 › arXiv.cs.CV › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Content4All Open Research Sign Language Translation Datasets
arXiv - CS - Computer Vision and Pattern Recognition Pub Date : 2021-05-05 , DOI: arxiv-2105.02351
Necati Cihan Camgoz, Ben Saunders, Guillaume Rochette, Marco Giovanelli, Giacomo Inches, Robin Nachtrab-Ribback, Richard Bowden

Computational sign language research lacks the large-scale datasets that enables the creation of useful reallife applications. To date, most research has been limited to prototype systems on small domains of discourse, e.g. weather forecasts. To address this issue and to push the field forward, we release six datasets comprised of 190 hours of footage on the larger domain of news. From this, 20 hours of footage have been annotated by Deaf experts and interpreters and is made publicly available for research purposes. In this paper, we share the dataset collection process and tools developed to enable the alignment of sign language video and subtitles, as well as baseline translation results to underpin future research.

中文翻译：

Content4All开放研究手语翻译数据集

计算手语研究缺乏能够创建有用的现实生活应用程序的大规模数据集。迄今为止，大多数研究仅限于在小范围话语（例如天气预报）上的原型系统。为了解决这个问题并推动这一领域的发展，我们发布了六个数据集，其中包含有关更大新闻领域的190小时素材。据此，聋人专家和口译员对20个小时的录像进行了注释，并已公开供研究使用。在本文中，我们将共享开发的数据集收集过程和工具，以使手语视频和字幕以及基线翻译结果保持一致，以支持未来的研究。

更新日期：2021-05-07

点击分享查看原文

点击收藏

阅读更多本刊最新论文