当前位置:
X-MOL 学术
›
arXiv.cs.MM
›
论文详情
Our official English website, www.x-mol.net, welcomes your
feedback! (Note: you will need to create a separate account there.)
Visual Methods for Sign Language Recognition: A Modality-Based Review
arXiv - CS - Multimedia Pub Date : 2020-09-22 , DOI: arxiv-2009.10370 Bassem Seddik and Najoua Essoukri Ben Amara
arXiv - CS - Multimedia Pub Date : 2020-09-22 , DOI: arxiv-2009.10370 Bassem Seddik and Najoua Essoukri Ben Amara
Sign language visual recognition from continuous multi-modal streams is still
one of the most challenging fields. Recent advances in human actions recognition are exploiting the ascension of
GPU-based learning from massive data, and are getting closer to human-like
performances. They are then prone to creating interactive services for the deaf and
hearing-impaired communities. A population that is expected to grow considerably in the years to come. This paper aims at reviewing the human actions recognition literature with
the sign-language visual understanding as a scope. The methods analyzed will be mainly organized according to the different
types of unimodal inputs exploited, their relative multi-modal combinations and
pipeline steps. In each section, we will detail and compare the related datasets, approaches
then distinguish the still open contribution paths suitable for the creation of
sign language related services. Special attention will be paid to the approaches and commercial solutions
handling facial expressions and continuous signing.
中文翻译:
手语识别的视觉方法:基于模态的评论
来自连续多模态流的手语视觉识别仍然是最具挑战性的领域之一。人类动作识别的最新进展正在利用基于 GPU 的海量数据学习的提升,并且越来越接近于类人的表现。然后,他们倾向于为聋人和听障社区创建互动服务。预计未来几年人口将大幅增长。本文旨在回顾以手语视觉理解为范围的人类行为识别文献。分析的方法将主要根据所利用的不同类型的单模态输入、它们的相关多模态组合和管道步骤来组织。在每个部分,我们将详细和比较相关的数据集,方法然后区分适合创建手语相关服务的仍然开放的贡献路径。将特别关注处理面部表情和连续签名的方法和商业解决方案。
更新日期:2020-09-23
中文翻译:
手语识别的视觉方法:基于模态的评论
来自连续多模态流的手语视觉识别仍然是最具挑战性的领域之一。人类动作识别的最新进展正在利用基于 GPU 的海量数据学习的提升,并且越来越接近于类人的表现。然后,他们倾向于为聋人和听障社区创建互动服务。预计未来几年人口将大幅增长。本文旨在回顾以手语视觉理解为范围的人类行为识别文献。分析的方法将主要根据所利用的不同类型的单模态输入、它们的相关多模态组合和管道步骤来组织。在每个部分,我们将详细和比较相关的数据集,方法然后区分适合创建手语相关服务的仍然开放的贡献路径。将特别关注处理面部表情和连续签名的方法和商业解决方案。