当前位置: X-MOL 学术arXiv.cs.IR › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Machine Reading of Hypotheses for Organizational Research Reviews and Pre-trained Models via R Shiny App for Non-Programmers
arXiv - CS - Information Retrieval Pub Date : 2021-06-30 , DOI: arxiv-2106.16102
Victor Zitian Chen, Felipe Montano-Campos, Wlodek Zadrozny, Evan Canfield

The volume of scientific publications in organizational research becomes exceedingly overwhelming for human researchers who seek to timely extract and review knowledge. This paper introduces natural language processing (NLP) models to accelerate the discovery, extraction, and organization of theoretical developments (i.e., hypotheses) from social science publications. We illustrate and evaluate NLP models in the context of a systematic review of stakeholder value constructs and hypotheses. Specifically, we develop NLP models to automatically 1) detect sentences in scholarly documents as hypotheses or not (Hypothesis Detection), 2) deconstruct the hypotheses into nodes (constructs) and links (causal/associative relationships) (Relationship Deconstruction ), and 3) classify the features of links in terms causality (versus association) and direction (positive, negative, versus nonlinear) (Feature Classification). Our models have reported high performance metrics for all three tasks. While our models are built in Python, we have made the pre-trained models fully accessible for non-programmers. We have provided instructions on installing and using our pre-trained models via an R Shiny app graphic user interface (GUI). Finally, we suggest the next paths to extend our methodology for computer-assisted knowledge synthesis.

中文翻译:

通过面向非程序员的 R Shiny 应用程序机器阅读组织研究评论和预训练模型的假设

对于寻求及时提取和审查知识的人类研究人员来说,组织研究中的科学出版物数量变得非常庞大。本文介绍了自然语言处理 (NLP) 模型,以加速从社会科学出版物中发现、提取和组织理论发展(即假设)。我们在对利益相关者价值构建和假设的系统审查的背景下说明和评估 NLP 模型。具体来说,我们开发了 NLP 模型来自动 1)检测学术文档中的句子是否为假设(假设检测),2)将假设解构为节点(构造)和链接(因果/关联关系)(关系解构),和 3)根据因果关系(与关联)和方向(正、负、与非线性)(特征分类)对链接的特征进行分类。我们的模型报告了所有三个任务的高性能指标。虽然我们的模型是用 Python 构建的,但我们已经让非程序员完全可以访问预训练的模型。我们通过 R Shiny 应用程序图形用户界面 (GUI) 提供了有关安装和使用我们的预训练模型的说明。最后,我们建议了下一条路径,以扩展我们的计算机辅助知识合成方法。我们通过 R Shiny 应用程序图形用户界面 (GUI) 提供了有关安装和使用我们的预训练模型的说明。最后,我们建议了下一条路径,以扩展我们的计算机辅助知识合成方法。我们通过 R Shiny 应用程序图形用户界面 (GUI) 提供了有关安装和使用我们的预训练模型的说明。最后,我们建议了下一条路径,以扩展我们的计算机辅助知识合成方法。
更新日期:2021-07-01
down
wechat
bug