当前位置: X-MOL 学术Knowl. Based Syst. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Sentence representation with manifold learning for biomedical texts
Knowledge-Based Systems ( IF 8.8 ) Pub Date : 2021-02-17 , DOI: 10.1016/j.knosys.2021.106869
Di Zhao , Jian Wang , Hongfei Lin , Yonghe Chu , Yan Wang , Yijia Zhang , Zhihao Yang

Sentence representation approaches based on deep learning have become a major part of natural language processing, and pretrained sentences have wide applications in biomedical texts. However, the geometric basis of sentence representations has not yet been carefully studied in biomedical texts. In this paper, we focus on exploiting the geometric structure of sentences to improve the biomedical text presentation effect. To mine the geometric structure information from sentence representations, we introduce manifold learning, which brings the similarity of sentences in Euclidean space closer to the sentence semantics, into biomedical sentence representations. First, we use the pretrained sentence representation method to obtain a representation of a biomedical text sentence and then use manifold learning to construct the adjacency graph structure of the sentence representation to characterize the local geometric structure information of the sentence representations, thus revealing the essential laws among the sentences. Through the manifold method, we can describe the potential relations among sentences, thus improving the effect based on downstream biomedical text tasks. Our sentence representation method was evaluated on biomedical text tasks. The experimental results show that our model achieved better results than several normal sentence representation methods.



中文翻译:

生物医学文本的句子表达与多种学习

基于深度学习的句子表示方法已经成为自然语言处理的重要组成部分,而经过预训练的句子在生物医学文本中具有广泛的应用。但是,尚未在生物医学文本中仔细研究过句子表示的几何基础。在本文中,我们专注于利用句子的几何结构来提高生物医学文本呈现效果。为了从句子表示中挖掘几何结构信息,我们引入了流形学习,它将欧氏空间中句子的相似性更接近于句子语义,从而将其引入生物医学句子表示中。第一的,我们使用预训练的句子表示方法来获取生物医学文本句子的表示,然后使用流形学习构造句子表示的邻接图结构来表征句子表示的局部几何结构信息,从而揭示出句子之间的基本规律。句子。通过流形方法,我们可以描述句子之间的潜在关系,从而提高基于下游生物医学文本任务的效果。我们的句子表示方法是在生物医学文本任务上进行评估的。实验结果表明,与几种常规句子表示方法相比,我们的模型取得了更好的效果。从而揭示了句子中的基本定律。通过流形方法,我们可以描述句子之间的潜在关系,从而提高基于下游生物医学文本任务的效果。我们的句子表示方法是在生物医学文本任务上进行评估的。实验结果表明,与几种常规句子表示方法相比,我们的模型取得了更好的效果。从而揭示了句子中的基本定律。通过流形方法,我们可以描述句子之间的潜在关系,从而提高基于下游生物医学文本任务的效果。我们的句子表示方法是在生物医学文本任务上进行评估的。实验结果表明,与几种常规句子表示方法相比,我们的模型取得了更好的效果。

更新日期:2021-02-25
down
wechat
bug