当前位置: X-MOL 学术Journal of Slavic Linguistics › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Slavic Corpus and Computational Linguistics
Journal of Slavic Linguistics Pub Date : 2017-01-01 , DOI: 10.1353/jsl.2017.0008
Dagmar Divjak , Serge Sharoff , Tomaž Erjavec

Abstract:In this paper we focus on corpus-linguistic studies that address theoretical questions and on computational linguistic work on corpus annotation that makes corpora useful for linguistic analysis. First we discuss why the corpus linguistic approach was discredited by generative linguists in the second half of the 20th century, how it made a comeback through advances in computing and was finally adopted by usage-based linguistics at the beginning of the 21st century. Then we move on to an overview of necessary and common annotation layers and the issues that are encountered when performing automatic annotation, with special emphasis on Slavic languages. Finally we survey the types of research requiring corpora that Slavic linguists are involved in worldwide, and the resources they have at their disposal.

中文翻译:

斯拉夫语料库和计算语言学

摘要:在本文中,我们专注于解决理论问题的语料库语言学研究和关于语料库注释的计算语言学工作,这使得语料库可用于语言分析。首先我们讨论为什么语料库语言方法在 20 世纪下半叶被生成语言学家抹黑,它如何通过计算的进步卷土重来,并最终在 21 世纪初被基于用法的语言学采用。然后我们继续概述必要和常见的注释层以及执行自动注释时遇到的问题,特别强调斯拉夫语言。最后,我们调查了斯拉夫语言学家在世界范围内参与的需要语料库的研究类型,以及他们可以使用的资源。
更新日期:2017-01-01
down
wechat
bug