当前位置: X-MOL 学术J. Intell. Inf. Syst. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Representing unstructured text semantics for reasoning purpose
Journal of Intelligent Information Systems ( IF 3.4 ) Pub Date : 2020-10-03 , DOI: 10.1007/s10844-020-00621-w
Zohre Moteshakker Arani , Ahmad Abdollahzadeh Barforoush , Hossein Shirazi

To interpret a natural language text using a machine, we need to convert its semantics into structured information. In the field of Natural Language Processing, multiple tasks have been designed and developed to interpret the semantics of an unstructured text, and change words into meanings. However, there are some challenges in directly using the output of these tasks in subsequent applications such as logical inference. There has been a growing interest in building and enhancing state-of-the-art semantic representation systems in recent years. However, most of these systems involve supervised models that benefit from manually annotated data, which is not accessible for a wide range of languages. This paper presents a new framework for modeling text in order to extract its information, and through an inference system, obtain new information that is not explicitly stated in the text, but could be logically inferred. This framework is based on Open Information Extraction and Semantic Web techniques for machine reading. We translate the text into a machine-readable representation by using Semantic Types Identification and Question-based Semantic Role Labeling, which could be used in low-resource languages. We integrate the extracted information into the background knowledge by using existing Semantic Web standards. The proposed framework could increase generalization of labelling and reduce ambiguities, therefore, it is an appropriate solution for preparing text for reasoning systems.

中文翻译:

出于推理目的表示非结构化文本语义

要使用机器解释自然语言文本,我们需要将其语义转换为结构化信息。在自然语言处理领域,已经设计和开发了多个任务来解释非结构化文本的语义,并将单词转化为含义。但是,在逻辑推理等后续应用中直接使用这些任务的输出存在一些挑战。近年来,人们对构建和增强最先进的语义表示系统越来越感兴趣。然而,这些系统中的大多数都涉及受益于手动注释数据的监督模型,而这些数据对于多种语言是不可访问的。本文提出了一种新的文本建模框架,以提取其信息,并通过推理系统,获取文本中未明确说明但可以从逻辑上推断出来的新信息。该框架基于用于机器阅读的开放信息提取和语义 Web 技术。我们通过使用语义类型识别和基于问题的语义角色标签将文本翻译成机器可读的表示,这可以在低资源语言中使用。我们使用现有的语义 Web 标准将提取的信息集成到背景知识中。所提出的框架可以增加标签的泛化并减少歧义,因此,它是为推理系统准备文本的合适解决方案。我们通过使用语义类型识别和基于问题的语义角色标签将文本翻译成机器可读的表示,这可以在低资源语言中使用。我们使用现有的语义 Web 标准将提取的信息集成到背景知识中。所提出的框架可以增加标签的泛化并减少歧义,因此,它是为推理系统准备文本的合适解决方案。我们通过使用语义类型识别和基于问题的语义角色标签将文本翻译成机器可读的表示,这可以在低资源语言中使用。我们使用现有的语义 Web 标准将提取的信息集成到背景知识中。所提出的框架可以增加标签的泛化并减少歧义,因此,它是为推理系统准备文本的合适解决方案。
更新日期:2020-10-03
down
wechat
bug