当前位置: X-MOL 学术IETE Tech. Rev. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Studying the Effect of Syntactic Simplification on Text Summarization
IETE Technical Review ( IF 2.5 ) Pub Date : 2022-03-31 , DOI: 10.1080/02564602.2022.2055670
Niladri Chatterjee 1 , Raksha Agarwal 1
Affiliation  

The need for automatic text summarization (ATS) is increased manifold in recent times due to the overwhelming growth of textual data available in electronic form. However, existing ATS systems suffer from two major shortcomings. Summarizers of extractive type, that is, the ones which select important sentences of the documents in their original form as the output, tend to copy some irrelevant or unimportant parts of the input text in the output summary. On the other hand, abstractive summarizers, that is, the ones that produce a gist of the limited size of the original document, often fail to include important contents in the generated summary. Simplification of the input texts before submitting them to the ATS system(s) may obliterate the above difficulties. The present work examines the effectiveness of simplification of input for five different known ATS systems. In this work, DEPSYM++ simplifier has been used for the above purpose, which carries out four different kinds of simplification on sentences of the input text corresponding to the presence of appositive clause, relative clause, conjoint clause, and passive voice. The results obtained are found to be very encouraging when experiments were carried out on three different gold data sets and under different evaluation metrics commonly used for performance evaluation for summarizers.



中文翻译:

研究句法简化对文本摘要的影响

由于以电子形式提供的文本数据的压倒性增长,近来对自动文本摘要 (ATS) 的需求呈多方面增长。然而,现有的 ATS 系统存在两个主要缺点。提取式摘要器,即选择文档原始形式的重要句子作为输出的摘要器,倾向于在输出摘要中复制输入文本中一些不相关或不重要的部分。另一方面,抽象摘要器,即产生原始文档有限大小的要点的摘要器,通常无法在生成的摘要中包含重要内容。在将输入文本提交给 ATS 系统之前对其进行简化可能会消除上述困难。目前的工作检查了五种不同的已知 ATS 系统的输入简化的有效性。在这项工作中,DEPSYM++ 简化器已用于上述目的,它根据同位语从句、关系从句、连词从句和被动语态的存在,对输入文本的句子进行四种不同的简化。当在三个不同的黄金数据集上以及在通常用于总结器性能评估的不同评估指标下进行实验时,发现所获得的结果非常令人鼓舞。和被动语态。当在三个不同的黄金数据集上以及在通常用于总结器性能评估的不同评估指标下进行实验时,发现所获得的结果非常令人鼓舞。和被动语态。当在三个不同的黄金数据集上以及在通常用于总结器性能评估的不同评估指标下进行实验时,发现所获得的结果非常令人鼓舞。

更新日期:2022-03-31
down
wechat
bug