当前位置: X-MOL 学术arXiv.cs.CL › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Spanish Biomedical and Clinical Language Embeddings
arXiv - CS - Computation and Language Pub Date : 2021-02-25 , DOI: arxiv-2102.12843
Asier Gutiérrez-Fandiño, Jordi Armengol-Estapé, Casimiro Pio Carrino, Ona De Gibert, Aitor Gonzalez-Agirre, Marta Villegas

We computed both Word and Sub-word Embeddings using FastText. For Sub-word embeddings we selected Byte Pair Encoding (BPE) algorithm to represent the sub-words. We evaluated the Biomedical Word Embeddings obtaining better results than previous versions showing the implication that with more data, we obtain better representations.

中文翻译:

西班牙生物医学和临床语言嵌入

我们使用FastText计算了单词和子单词嵌入。对于子词嵌入,我们选择了字节对编码(BPE)算法来表示子词。我们评估了生物医学单词嵌入比以前的版本获得更好的结果,这表明如果使用更多的数据,我们可以获得更好的表示。
更新日期:2021-02-26
down
wechat
bug