当前位置: X-MOL 学术arXiv.cs.SE › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Latexify Math: Mathematical Formula Markup Revision to Assist Collaborative Editing in Math Q&A Sites
arXiv - CS - Software Engineering Pub Date : 2021-09-20 , DOI: arxiv-2109.09343
Suyu Ma, Chunyang Chen, Hourieh Khalajzadeh, John Grundy

Collaborative editing questions and answers plays an important role in quality control of Mathematics Stack Exchange which is a math Q&A Site. Our study of post edits in Mathematics Stack Exchange shows that there is a large number of math-related edits about latexifying formulas, revising LaTeX and converting the blurred math formula screenshots to LaTeX sequence. Despite its importance, manually editing one math-related post especially those with complex mathematical formulas is time-consuming and error-prone even for experienced users. To assist post owners and editors to do this editing, we have developed an edit-assistance tool, MathLatexEdit for formula latexification, LaTeX revision and screenshot transcription. We formulate this formula editing task as a translation problem, in which an original post is translated to a revised post. MathLatexEdit implements a deep learning based approach including two encoder-decoder models for textual and visual LaTeX edit recommendation with math-specific inference. The two models are trained on large-scale historical original-edited post pairs and synthesized screenshot-formula pairs. Our evaluation of MathLatexEdit not only demonstrates the accuracy of our model, but also the usefulness of MathLatexEdit in editing real-world posts which are accepted in Mathematics Stack Exchange.

中文翻译:

Latexify Math:数学公式标记修订以协助数学问答网站中的协作编辑

协同编辑问答在数学问答网站 Mathematics Stack Exchange 的质量控制中发挥着重要作用。我们对 Mathematics Stack Exchange 中后期编辑的研究表明,有大量与数学相关的编辑,涉及乳胶化公式、修改 LaTeX 以及将模糊的数学公式截图转换为 LaTeX 序列。尽管它很重要,但手动编辑一篇与数学相关的帖子,尤其是那些具有复杂数学公式的帖子,即使对于有经验的用户来说也是耗时且容易出错的。为了协助帖子所有者和编辑进行此编辑,我们开发了一个编辑辅助工具 MathLatexEdit,用于公式乳胶化、LaTeX 修订和屏幕截图转录。我们将此公式编辑任务制定为翻译问题,其中将原始帖子翻译为修订后的帖子。MathLatexEdit 实现了一种基于深度学习的方法,包括两个用于文本和视觉 LaTeX 编辑推荐的编码器-解码器模型,以及特定于数学的推理。这两个模型在大规模历史原始编辑后对和合成截图公式对上进行训练。我们对 MathLatexEdit 的评估不仅证明了我们模型的准确性,还证明了 MathLatexEdit 在编辑 Mathematics Stack Exchange 所接受的真实世界帖子中的有用性。
更新日期:2021-09-21
down
wechat
bug