当前位置: X-MOL 学术Comput. Linguist. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Abstract Syntax as Interlingua: Scaling Up the Grammatical Framework from Controlled Languages to Robust Pipelines
Computational Linguistics ( IF 9.3 ) Pub Date : 2020-06-01 , DOI: 10.1162/coli_a_00378
Aarne Ranta 1 , Krasimir Angelov 1 , Normunds Gruzitis 2 , Prasanth Kolachina 1
Affiliation  

Abstract syntax is an interlingual representation used in compilers. Grammatical Framework(GF) applies the abstract syntax idea to natural languages. The development of GF started in 1998, first as a tool for controlled language implementations, where it has gained an established position in both academic and commercial projects. GF provides grammar resources for over 40 languages, enabling accurate generation and translation, as well as grammar engineering tools and components for mobile and web applications. On the research side, the focus in the last ten years has been on scaling up GF to wide-coverage language processing. The concept of abstract syntax offers a unified view on many other approaches: Universal Dependencies, WordNets, FrameNets, Construction Grammars, and Abstract Meaning Representations. This makes it possible for GF to utilize data from the other approaches and to build robust pipelines. In return, GF can contribute to data-driven approaches by methods to transfer resources from one language to others, to augment data by rule-based generation, to check the conistency of handannotated corpora, and to pipe analyses into high-precision semantic back ends. This paper gives an overview of the use of abstract syntax as interlingua through both established and emerging NLP applications involving GF.

中文翻译:

作为 Interlingua 的抽象语法:将语法框架从受控语言扩展到强大的管道

摘要语法是编译器中使用的一种跨语言表示。语法框架(GF)将抽象语法思想应用于自然语言。GF 的开发始于 1998 年,最初是作为受控语言实现的工具,在那里它在学术和商业项目中都获得了既定的地位。GF 提供 40 多种语言的语法资源,可实现准确的生成和翻译,以及用于移动和 Web 应用程序的语法工程工具和组件。在研究方面,过去十年的重点是将 GF 扩展到广泛覆盖的语言处理。抽象语法的概念提供了许多其他方法的统一视图:通用依赖关系、WordNets、FrameNets、构造语法和抽象含义表示。这使得 GF 可以利用来自其他方法的数据并构建强大的管道。作为回报,GF 可以通过将资源从一种语言转移到另一种语言、通过基于规则的生成来扩充数据、检查标注语料库的一致性以及将分析输送到高精度语义后端的方法,为数据驱动方法做出贡献. 本文概述了通过涉及 GF 的既定和新兴 NLP 应用程序使用抽象语法作为中间语。
更新日期:2020-06-01
down
wechat
bug