当前位置: X-MOL 学术Lang. Resour. Eval. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
NorthEuraLex: a wide-coverage lexical database of Northern Eurasia
Language Resources and Evaluation ( IF 2.7 ) Pub Date : 2019-11-30 , DOI: 10.1007/s10579-019-09480-6
Johannes Dellert 1 , Thora Daneyko 1 , Alla Münch 1 , Alina Ladygina 1 , Armin Buch 1 , Natalie Clarius 1 , Ilja Grigorjew 1 , Mohamed Balabel 1 , Hizniye Isabella Boga 1 , Zalina Baysarova 1 , Roland Mühlenbernd 1 , Johannes Wahle 1 , Gerhard Jäger 1
Affiliation  

This article describes the first release version of a new lexicostatistical database of Northern Eurasia, which includes Europe as the most well-researched linguistic area. Unlike in other areas of the world, where databases are restricted to covering a small number of concepts as far as possible based on often sparse documentation, good lexical resources providing wide coverage of the lexicon are available even for many smaller languages in our target area. This makes it possible to attain near-completeness for a substantial number of concepts. The resulting database provides a basis for rich benchmarks that can be used to test automated methods which aim to derive new knowledge about language history in underresearched areas.

中文翻译:

NorthEuraLex:北欧亚大陆覆盖面广的词汇数据库

本文介绍了欧亚大陆北部新词汇统计数据库的第一个发布版本,该数据库将欧洲作为研究最充分的语言区域。与世界其他地区不同,在这些地区,数据库仅限于基于通常稀疏的文档尽可能涵盖少量概念,即使是我们目标地区的许多较小的语言,也可以使用提供广泛词汇覆盖的良好词汇资源。这使得大量概念的接近完整性成为可能。生成的数据库为丰富的基准测试提供了基础,这些基准测试可用于测试旨在在研究不足的领域中获取有关语言历史的新知识的自动化方法。
更新日期:2019-11-30
down
wechat
bug