当前位置: X-MOL 学术Expert Syst. Appl. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
TransExplain: Using neural networks to find suitable explanations for Chinese phrases
Expert Systems with Applications ( IF 7.5 ) Pub Date : 2021-06-19 , DOI: 10.1016/j.eswa.2021.115440
Rongsheng Li , Zesong Li , Shaobin Huang , Ye Liu , Jiyu Qiu

When reading an article, especially a professional article, we often encounter words or phrases that we don't recognize. They may be specific domain terms or emerging entities. When we can't guess their meaning from the article, we generally refer to the terminology dictionary or search for related content on the Internet to understand them. Some researchers have used natural language generation (NLG) models to explain these unknown phrases in recent years automatically. Still, current NLG models have difficulties generating long sentences with good coherence, and they are difficult to generate multiple sentences that describe unknown phrases from different angles. Therefore, this paper proposes a model that can judge whether an existing sentence can explain a certain phrase, called TransExplain. TransExplain can use LSTM, convolutional neural network, and attention mechanism to extract multiple sentence features and map them to a fixed-dimensional semantic feature vector. By calculating the cosine similarity between the semantic feature vector and the unknown phrase vector, it is judged whether the sentence can explain the semantics of the unknown phrase. And a loss function called positive and negative means square error is introduced to improve the model's ability that distinguishes negative examples. For this task, we provided a Chinese dataset containing phrases and explanation pairs in 7 important domains. On this dataset, TransExplain can achieve better results than previous similar tasks.



中文翻译:

TransExplain:使用神经网络为中文短语寻找合适的解释

在阅读文章,尤其是专业文章时,我们经常会遇到不认识的单词或短语。它们可能是特定的领域术语或新兴实体。当我们无法从文章中猜出它们的含义时,我们一般会参考术语词典或在互联网上搜索相关内容来理解它们。近年来,一些研究人员使用自然语言生成 (NLG) 模型来自动解释这些未知短语。尽管如此,目前的 NLG 模型难以生成具有良好连贯性的长句,并且难以生成多个从不同角度描述未知短语的句子。因此,本文提出了一种可以判断现有句子是否可以解释某个短语的模型,称为TransExplain。TransExplain 可以使用 LSTM,卷积神经网络和注意力机制提取多个句子特征并将它们映射到固定维度的语义特征向量。通过计算语义特征向量与未知短语向量的余弦相似度,判断句子是否能解释未知短语的语义。并且引入了一个叫做正负均方误差的损失函数来提高模型区分负样本的能力。对于这项任务,我们提供了一个包含 7 个重要领域的短语和解释对的中文数据集。在这个数据集上,TransExplain 可以取得比之前类似任务更好的结果。通过计算语义特征向量与未知短语向量的余弦相似度,判断句子是否能解释未知短语的语义。并且引入了一个叫做正负均方误差的损失函数来提高模型区分负样本的能力。对于这项任务,我们提供了一个包含 7 个重要领域的短语和解释对的中文数据集。在这个数据集上,TransExplain 可以取得比之前类似任务更好的结果。通过计算语义特征向量与未知短语向量的余弦相似度,判断句子是否能解释未知短语的语义。并且引入了一个叫做正负均方误差的损失函数来提高模型区分负样本的能力。对于这项任务,我们提供了一个包含 7 个重要领域的短语和解释对的中文数据集。在这个数据集上,TransExplain 可以取得比之前类似任务更好的结果。对于这项任务,我们提供了一个包含 7 个重要领域的短语和解释对的中文数据集。在这个数据集上,TransExplain 可以取得比之前类似任务更好的结果。对于这项任务,我们提供了一个包含 7 个重要领域的短语和解释对的中文数据集。在这个数据集上,TransExplain 可以取得比之前类似任务更好的结果。

更新日期:2021-06-24
down
wechat
bug