当前位置: X-MOL 学术Comput. Intell. Neurosci. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Intelligence Is beyond Learning: A Context-Aware Artificial Intelligent System for Video Understanding
Computational Intelligence and Neuroscience ( IF 3.120 ) Pub Date : 2020-12-24 , DOI: 10.1155/2020/8813089
Ahmed Ghozia 1 , Gamal Attiya 1 , Emad Adly 1 , Nawal El-Fishawy 1
Affiliation  

Understanding video files is a challenging task. While the current video understanding techniques rely on deep learning, the obtained results suffer from a lack of real trustful meaning. Deep learning recognizes patterns from big data, leading to deep feature abstraction, not deep understanding. Deep learning tries to understand multimedia production by analyzing its content. We cannot understand the semantics of a multimedia file by analyzing its content only. Events occurring in a scene earn their meanings from the context containing them. A screaming kid could be scared of a threat or surprised by a lovely gift or just playing in the backyard. Artificial intelligence is a heterogeneous process that goes beyond learning. In this article, we discuss the heterogeneity of AI as a process that includes innate knowledge, approximations, and context awareness. We present a context-aware video understanding technique that makes the machine intelligent enough to understand the message behind the video stream. The main purpose is to understand the video stream by extracting real meaningful concepts, emotions, temporal data, and spatial data from the video context. The diffusion of heterogeneous data patterns from the video context leads to accurate decision-making about the video message and outperforms systems that rely on deep learning. Objective and subjective comparisons prove the accuracy of the concepts extracted by the proposed context-aware technique in comparison with the current deep learning video understanding techniques. Both systems are compared in terms of retrieval time, computing time, data size consumption, and complexity analysis. Comparisons show a significant efficient resource usage of the proposed context-aware system, which makes it a suitable solution for real-time scenarios. Moreover, we discuss the pros and cons of deep learning architectures.

中文翻译:

智力超越学习:用于视频理解的上下文感知人工智能系统

了解视频文件是一项艰巨的任务。尽管当前的视频理解技术依赖于深度学习,但是获得的结果却缺乏真正可信的含义。深度学习识别大数据中的模式,从而导致深度特征抽象,而不是深度理解。深度学习试图通过分析其内容来理解多媒体产品。我们仅通过分析多媒体文件的内容就无法理解其语义。场景中发生的事件从包含它们的上下文中获得其含义。一个尖叫的孩子可能会害怕威胁,或者被可爱的礼物吓到,或者只是在后院玩耍。人工智能是一个超越学习的异构过程。在本文中,我们将讨论AI的异质性,该过程包括先天知识,近似值,和情境意识。我们提出了一种上下文感知的视频理解技术,该技术使机器足够智能,可以理解视频流背后的消息。主要目的是通过从视频上下文中提取真正有意义的概念,情感,时间数据和空间数据来理解视频流。来自视频上下文的异构数据模式的扩散导致了有关视频消息的准确决策,并且胜过了依赖于深度学习的系统。客观和主观的比较证明了与当前的深度学习视频理解技术相比,所提出的上下文感知技术提取的概念的准确性。比较两种系统的检索时间,计算时间,数据大小消耗和复杂性分析。比较表明,所提出的上下文感知系统有效地利用了资源,这使其成为实时方案的合适解决方案。此外,我们讨论了深度学习架构的利弊。
更新日期:2020-12-24
down
wechat
bug