Multilingual enrichment of disease biomedical ontologies,arXiv - CS - Information Retrieval

当前位置： X-MOL 学术 › arXiv.cs.IR › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Multilingual enrichment of disease biomedical ontologies
arXiv - CS - Information Retrieval Pub Date : 2020-04-07 , DOI: arxiv-2004.03181
L\'eo Bouscarrat (QARMA, TALEP), Antoine Bonnefoy, C\'ecile Capponi (LIF, QARMA), Carlos Ramisch (TALEP)

Translating biomedical ontologies is an important challenge, but doing it manually requires much time and money. We study the possibility to use open-source knowledge bases to translate biomedical ontologies. We focus on two aspects: coverage and quality. We look at the coverage of two biomedical ontologies focusing on diseases with respect to Wikidata for 9 European languages (Czech, Dutch, English, French, German, Italian, Polish, Portuguese and Spanish) for both ontologies, plus Arabic, Chinese and Russian for the second one. We first use direct links between Wikidata and the studied ontologies and then use second-order links by going through other intermediate ontologies. We then compare the quality of the translations obtained thanks to Wikidata with a commercial machine translation tool, here Google Cloud Translation.

中文翻译：

疾病生物医学本体的多语言丰富

翻译生物医学本体是一项重要挑战，但手动完成需要大量时间和金钱。我们研究了使用开源知识库来翻译生物医学本体的可能性。我们专注于两个方面：覆盖率和质量。我们研究了两种生物医学本体论的覆盖范围，这些本体论针对 9 种欧洲语言（捷克语、荷兰语、英语、法语、德语、意大利语、波兰语、葡萄牙语和西班牙语）的维基数据，以及针对两种本体的维基数据，以及阿拉伯语、中文和俄语第二个。我们首先使用维基数据和研究的本体之间的直接链接，然后通过其他中间本体使用二阶链接。然后，我们将借助 Wikidata 获得的翻译质量与商业机器翻译工具（此处为 Google Cloud Translation）进行比较。

更新日期：2020-04-08

点击分享查看原文

点击收藏

阅读更多本刊最新论文