当前位置: X-MOL 学术arXiv.cs.DL › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation
arXiv - CS - Digital Libraries Pub Date : 2021-04-11 , DOI: arxiv-2104.05111
Philipp Scharpf, Moritz Schubotz, Bela Gipp

Mathematical information retrieval (MathIR) applications such as semantic formula search and question answering systems rely on knowledge-bases that link mathematical expressions to their natural language names. For database population, mathematical formulae need to be annotated and linked to semantic concepts, which is very time-consuming. In this paper, we present our approach to structure and speed up this process by supporting annotators with a system that suggests formula names and meanings of mathematical identifiers. We test our approach annotating 25 articles on https://en.wikipedia.org. We evaluate the quality and time-savings of the annotation recommendations. Moreover, we watch editor reverts and comments on Wikipedia formula entity links and Wikidata item creation and population to ground the formula semantics. Our evaluation shows that the AI guidance was able to significantly speed up the annotation process by a factor of 1.4 for formulae and 2.4 for identifiers. Our contributions were reverted in 12% of the edited Wikipedia articles and 33% of the Wikidata items within a test window of one month. The >>AnnoMathTeX<< annotation recommender system is hosted by Wikimedia at https://annomathtex.wmflabs.org. In the future, our data refinement pipeline is ready to be integrated seamlessly into the Wikipedia user interface.

中文翻译:

使用注释建议快速链接维基百科文章中的数学Wikidata实体

诸如数学公式检索和问题解答系统之类的数学信息检索(MathIR)应用程序依赖于将数学表达式与其自然语言名称相关联的知识库。对于数据库填充,需要对数学公式进行注释并将其链接到语义概念,这非常耗时。在本文中,我们通过使用提示公式名称和数学标识符含义的系统来支持注释器,介绍了构建和加快此过程的方法。我们测试了我们的方法,并在https://en.wikipedia.org上注释了25篇文章。我们评估注释建议的质量和节省的时间。此外,我们观看编辑者对Wikipedia公式实体链接以及Wikidata项目创建和填充的回复和评论,以使公式语义成为基础。我们的评估表明,AI指导能够大大加快注释过程的速度,公式的1.4倍和标识符的2.4倍。在一个月的测试窗口内,已编辑的Wikipedia文章中的12%和Wikidata项中的33%恢复了我们的贡献。>> AnnoMathTeX <<注释推荐系统由Wikimedia托管在https://annomathtex.wmflabs.org。将来,我们的数据优化管道已准备就绪,可以无缝集成到Wikipedia用户界面中。注释推荐系统由Wikimedia托管在https://annomathtex.wmflabs.org。将来,我们的数据优化管道已准备就绪,可以无缝集成到Wikipedia用户界面中。注释推荐系统由Wikimedia托管在https://annomathtex.wmflabs.org。将来,我们的数据优化管道已准备就绪,可以无缝集成到Wikipedia用户界面中。
更新日期:2021-04-13
down
wechat
bug