当前位置: X-MOL 学术Cogn. Syst. Res. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Role of the Secondary Visual Cortex in HMAX Model for Object Recognition
Cognitive Systems Research ( IF 2.1 ) Pub Date : 2020-12-01 , DOI: 10.1016/j.cogsys.2020.07.001
Hiwa Sufikarimi , Karim Mohammadi

Abstract The models inspired by visual systems of life creatures (e.g., human, mammals, etc.) have been very successful in addressing object recognition tasks. For example, Hierarchical Model And X (HMAX) effectively recognizes different objects by modeling the V1, V4, and IT regions of the human visual system. Although HMAX is one of the superior models in the field of object recognition, its implementation has been limited due to some disadvantages such as the unrepeatability of the process under constant conditions, extreme redundancy, high computational load, and time-consuming. In this paper, we aim at revising the HMAX approach by adding the model of the secondary region (V2) in the human visual system which leads to removing the mentioned drawbacks of standard HMAX. The added layer selects repeatable and more informative features that increase the accuracy of the proposed method by avoiding the redundancy existing in the conventional approaches. Furthermore, this feature selection strategy considerably reduces the huge computational load. Another contribution of our model is highlighted when a small number of training images is available where our model can efficiently cope with this issue. We evaluate our proposed approach using Caltech5 and GRAZ-02 database as two famous benchmarks for object recognition tasks. Additionally, the results are compared with standard HMAX that validate and highlight the efficiency of the proposed method.

中文翻译:

辅助视觉皮层在 HMAX 对象识别模型中的作用

摘要 受生命生物(例如人类、哺乳动物等)视觉系统启发的模型在解决物体识别任务方面非常成功。例如,Hierarchical Model And X (HMAX) 通过对人类视觉系统的 V1、V4 和 IT 区域进行建模来有效识别不同的对象。HMAX虽然是物体识别领域的优势模型之一,但由于过程在恒定条件下的不可重复性、极度冗余、计算量大、耗时等缺点,其实现受到了限制。在本文中,我们旨在通过在人类视觉系统中添加次要区域 (V2) 的模型来修改 HMAX 方法,从而消除标准 HMAX 的上述缺点。添加的层选择可重复且信息量更大的特征,通过避免传统方法中存在的冗余来提高所提出方法的准确性。此外,这种特征选择策略大大减少了巨大的计算量。当我们的模型可以有效地处理这个问题的少量训练图像可用时,我们的模型的另一个贡献就突出了。我们使用 Caltech5 和 GRAZ-02 数据库作为对象识别任务的两个著名基准来评估我们提出的方法。此外,将结果与标准 HMAX 进行比较,验证并突出了所提出方法的效率。这种特征选择策略大大减少了巨大的计算量。当我们的模型可以有效地处理这个问题的少量训练图像可用时,我们的模型的另一个贡献就突出了。我们使用 Caltech5 和 GRAZ-02 数据库作为对象识别任务的两个著名基准来评估我们提出的方法。此外,将结果与标准 HMAX 进行比较,验证并突出了所提出方法的效率。这种特征选择策略大大减少了巨大的计算量。当我们的模型可以有效地处理这个问题的少量训练图像可用时,我们的模型的另一个贡献就突出了。我们使用 Caltech5 和 GRAZ-02 数据库作为对象识别任务的两个著名基准来评估我们提出的方法。此外,将结果与标准 HMAX 进行比较,验证并突出了所提出方法的效率。
更新日期:2020-12-01
down
wechat
bug