当前位置: X-MOL 学术Autom. Doc. Math. Linguist. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A Methodology of Using a Concordancer and Table Processor for Authorship Attribution
Automatic Documentation and Mathematical Linguistics ( IF 0.5 ) Pub Date : 2020-12-11 , DOI: 10.3103/s0005105520050088
V. A. Yatsko

The paper proposes an original methodology of authorship attribution based on the deviations from Zipf distribution and statistical data obtained with the help of a concordance program and computations performed in a table processor. The methodology involves finding distances between input texts and a reference text basing on deviations of stop-words frequencies. The results that have been achieved prove that the proposed methodology allows performing efficient authorship attribution and that it can be used in the educational process to develop student skills and competencies pertaining to natural language processing.



中文翻译:

使用协和器和表格处理器进行作者身份归属的方法

本文基于与Zipf分布的偏差和借助协调程序获得的统计数据以及在表处理器中执行的计算,提出了一种作者身份归属的原始方法。该方法涉及基于停用词频率的偏差来找到输入文本和参考文本之间的距离。已获得的结果证明,所提出的方法可以执行有效的作者身份归属,并且可以在教育过程中用于发展学生的技能和与自然语言处理有关的能力。

更新日期:2020-12-12
down
wechat
bug