Eye-gaze Estimation with HEOG and Neck EMG using Deep Neural Networks,arXiv - CS - Human-Computer Interaction

当前位置： X-MOL 学术 › arXiv.cs.HC › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Eye-gaze Estimation with HEOG and Neck EMG using Deep Neural Networks
arXiv - CS - Human-Computer Interaction Pub Date : 2021-03-03 , DOI: arxiv-2103.02186
Zhen Fu, Bo Wang, Fei Chen, Xihong Wu, Jing Chen

Hearing-impaired listeners usually have troubles attending target talker in multi-talker scenes, even with hearing aids (HAs). The problem can be solved with eye-gaze steering HAs, which requires listeners eye-gazing on the target. In a situation where head rotates, eye-gaze is subject to both behaviors of saccade and head rotation. However, existing methods of eye-gaze estimation did not work reliably, since the listener's strategy of eye-gaze varies and measurements of the two behaviors were not properly combined. Besides, existing methods were based on hand-craft features, which could overlook some important information. In this paper, a head-fixed and a head-free experiments were conducted. We used horizontal electrooculography (HEOG) and neck electromyography (NEMG), which separately measured saccade and head rotation to commonly estimate eye-gaze. Besides traditional classifier and hand-craft features, deep neural networks (DNN) were introduced to automatically extract features from intact waveforms. Evaluation results showed that when the input was HEOG with inertial measurement unit, the best performance of our proposed DNN classifiers achieved 93.3%; and when HEOG was with NEMG together, the accuracy reached 72.6%, higher than that with HEOG (about 71.0%) or NEMG (about 35.7%) alone. These results indicated the feasibility to estimate eye-gaze with HEOG and NEMG.

中文翻译：

使用深度神经网络的HEOG和颈部EMG的视线估计

听力受损的听众即使在使用助听器（HAs）的情况下，通常也很难在多说话者场景中陪伴目标说话者。该问题可以通过视线转向HA来解决，该方法要求听众目光对准目标。在头部旋转的情况下，视线会受到扫视和头部旋转的影响。但是，现有的眼神估计方法无法可靠地工作，因为听众的眼神策略会发生变化，并且两种行为的测量结果无法正确组合。此外，现有方法基于手工功能，可能会忽略一些重要信息。在本文中，进行了头固定和无头实验。我们使用了水平眼电图（HEOG）和颈部肌电图（NEMG），分别测量扫视和头部旋转以通常估计视线。除了传统的分类器和手工特征外，还引入了深度神经网络（DNN），可以从完整的波形中自动提取特征。评估结果表明，当输入为带有惯性测量单元的HEOG时，我们提出的DNN分类器的最佳性能达到了93.3％；当HEOG与NEMG一起使用时，准确度达到72.6％，高于单独使用HEOG（约71.0％）或NEMG（约35.7％）的准确性。这些结果表明用HEOG和NEMG估计眼睛注视的可行性。评估结果表明，当输入为带有惯性测量单元的HEOG时，我们提出的DNN分类器的最佳性能达到了93.3％；当HEOG与NEMG一起使用时，准确度达到72.6％，高于单独使用HEOG（约71.0％）或NEMG（约35.7％）的准确性。这些结果表明用HEOG和NEMG估计眼睛注视的可行性。评估结果表明，当输入为带有惯性测量单元的HEOG时，我们提出的DNN分类器的最佳性能达到了93.3％；当HEOG与NEMG一起使用时，准确度达到72.6％，高于单独使用HEOG（约71.0％）或NEMG（约35.7％）的准确性。这些结果表明用HEOG和NEMG估计眼睛注视的可行性。

更新日期：2021-03-04

点击分享查看原文

点击收藏

阅读更多本刊最新论文