当前位置: X-MOL 学术arXiv.cs.DB › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Towards a Natural Language Query Processing System
arXiv - CS - Databases Pub Date : 2020-09-25 , DOI: arxiv-2009.12414
Chantal Montgomery, Haruna Isah, Farhana Zulkernine

Tackling the information retrieval gap between non-technical database end-users and those with the knowledge of formal query languages has been an interesting area of data management and analytics research. The use of natural language interfaces to query information from databases offers the opportunity to bridge the communication challenges between end-users and systems that use formal query languages. Previous research efforts mainly focused on developing structured query interfaces to relational databases. However, the evolution of unstructured big data such as text, images, and video has exposed the limitations of traditional structured query interfaces. While the existing web search tools prove the popularity and usability of natural language query, they return complete documents and web pages instead of focused query responses and are not applicable to database systems. This paper reports our study on the design and development of a natural language query interface to a backend relational database. The novelty in the study lies in defining a graph database as a middle layer to store necessary metadata needed to transform a natural language query into structured query language that can be executed on backend databases. We implemented and evaluated our approach using a restaurant dataset. The translation results for some sample queries yielded a 90% accuracy rate.

中文翻译:

走向自然语言查询处理系统

解决非技术数据库最终用户与具有正式查询语言知识的用户之间的信息检索差距一直是数据管理和分析研究的一个有趣领域。使用自然语言接口从数据库查询信息提供了在最终用户和使用正式查询语言的系统之间架起沟通挑战的机会。以前的研究工作主要集中在开发关系数据库的结构化查询接口。然而,文本、图像、视频等非结构化大数据的演进暴露了传统结构化查询接口的局限性。虽然现有的网络搜索工具证明了自然语言查询的流行性和可用性,它们返回完整的文档和网页,而不是集中的查询响应,并且不适用于数据库系统。本文报告了我们对后端关系数据库的自然语言查询接口的设计和开发的研究。该研究的新颖之处在于将图形数据库定义为中间层,以存储将自然语言查询转换为可在后端数据库上执行的结构化查询语言所需的必要元数据。我们使用餐厅数据集实施并评估了我们的方法。一些示例查询的翻译结果产生了 90% 的准确率。该研究的新颖之处在于将图形数据库定义为中间层,以存储将自然语言查询转换为可在后端数据库上执行的结构化查询语言所需的必要元数据。我们使用餐厅数据集实施并评估了我们的方法。一些示例查询的翻译结果产生了 90% 的准确率。该研究的新颖之处在于将图形数据库定义为中间层,以存储将自然语言查询转换为可在后端数据库上执行的结构化查询语言所需的必要元数据。我们使用餐厅数据集实施并评估了我们的方法。一些示例查询的翻译结果产生了 90% 的准确率。
更新日期:2020-09-29
down
wechat
bug