Recovering depth from stereo-input without using any oculomotor information: A computation model,Journal of Mathematical Psychology

当前位置： X-MOL 学术 › J. Math. Psychol. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Recovering depth from stereo-input without using any oculomotor information: A computation model
Journal of Mathematical Psychology ( IF 1.8 ) Pub Date : 2021-04-29 , DOI: 10.1016/j.jmp.2021.102538
Tadamasa Sawada

Human beings can perceive depth by using binocular disparity, namely the difference between a stereo-pair of retinal images. It is commonly believed that the visual system requires oculomotor information about the relative orientation between the two eyes to perceive depth on the basis of binocular disparity. Such oculomotor information can be obtained from the efference copy of the oculomotor signal, or it can be derived from the vertical component of the binocular disparity (vertical disparity). Psychophysical studies have shown that stereo depth perception is affected by both the efference copy and by the vertical disparity. But note that any role for the efference signal for spatial perception is limited and the process using vertical disparity is slow. These effects are too restricted to explain depth perception under natural viewing conditions in which natural eye movements are made. In the present study, I describe a computational model that can recover depth from a stereo-pair of retinal images based only on the geometrical optics. This model treats depth recovery as a Direct problem rather than as an Inverse problem because it does not need to make use of any oculomotor information or of any a priori constraints. In this model, oculomotor information is not even implicitly recovered. Simply put, this paper shows that, at least from a computational perspective, oculomotor information and a priori constraints are not actually required for the stereo perception of depth. Having such a model allows us to discuss psychophysical and neurophysiological phenomena, as well as their mechanisms, without assuming that the human visual system needs oculomotor information to recover depth in a 3D scene from stereo retinal images.

中文翻译：

在不使用任何动眼信息的情况下从立体声输入恢复深度的计算模型

人类可以通过双眼视差感知深度，即一对立体视网膜图像之间的差异。通常认为，视觉系统需要关于两只眼睛之间的相对取向的动眼信息，以基于双眼视差来感知深度。这样的动眼信息可以从动眼信号的有效副本中获得，或者可以从双眼视差（垂直视差）的垂直分量中获得。心理物理研究表明，立体深度感知受复制效果和垂直视差的影响。但是请注意，对于空间感知，信号的作用是有限的，并且使用垂直视差的过程很慢。这些效果太局限了，无法解释在进行自然眼动的自然观看条件下的深度感知。在本研究中，我描述了一种计算模型，该模型可以仅基于几何光学从一对立体视网膜图像中恢复深度。该模型将深度恢复视为直接问题而不是逆问题，因为它不需要利用任何动眼信息或任何先验约束。在该模型中，动眼信息甚至没有被隐式恢复。简而言之，本文表明，至少从计算的角度来看，深度立体感实际上并不需要动眼信息和先验约束。有了这样的模型，我们就可以讨论心理生理和神经生理现象及其机制，而无需假设人类视觉系统需要动眼信息来从立体视网膜图像中恢复3D场景的深度。

更新日期：2021-04-29

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>