Using parsed and annotated corpora to analyze parliamentarians' talk in Finland,Journal of the Association for Information Science and Technology

当前位置： X-MOL 学术 › J. Assoc. Inf. Sci. Technol. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Using parsed and annotated corpora to analyze parliamentarians' talk in Finland
Journal of the Association for Information Science and Technology ( IF 2.8 ) Pub Date : 2021-06-04 , DOI: 10.1002/asi.24500
Mykola Andrushchenko ₁ , Kirsi Sandberg ₁ , Risto Turunen ₁ , Jani Marjanen ₂ , Mari Hatavara ₁ , Jussi Kurunmäki ₁ , Timo Nummenmaa ₁ , Matti Hyvärinen ₁ , Kari Teräs ₁ , Jaakko Peltonen ₁ , Jyrki Nummenmaa ₁

Affiliation

We present a search system for grammatically analyzed corpora of Finnish parliamentary records and interviews with former parliamentarians, annotated with metadata of talk structure and involved parliamentarians, and discuss their use through carefully chosen digital humanities case studies. We first introduce the construction, contents, and principles of use of the corpora. Then we discuss the application of the search system and the corpora to study how politicians talk about power, how ideological terms are used in political speech, and how to identify narratives in the data. All case studies stem from questions in the humanities and the social sciences, but rely on the grammatically parsed corpora in both identifying and quantifying passages of interest. Finally, the paper discusses the role of natural language processing methods for questions in the (digital) humanities. It makes the claim that a digital humanities inquiry of parliamentary speech and interviews with politicians cannot only rely on computational humanities modeling, but needs to accommodate a range of perspectives starting with simple searches, quantitative exploration, and ending with modeling. Furthermore, the digital humanities need a more thorough discussion about how the utilization of tools from information science and technologies alter the research questions posed in the humanities.

中文翻译：

使用解析和注释语料库分析芬兰议员的谈话

我们提供了一个搜索系统，用于对芬兰议会记录和前议员访谈的语法分析语料库进行搜索，用谈话结构的元数据和参与的议员进行注释，并通过精心挑选的数字人文案例研究讨论它们的使用。首先介绍语料库的构成、内容和使用原则。然后我们讨论了搜索系统和语料库的应用，以研究政治家如何谈论权力，如何在政治演讲中使用意识形态术语，以及如何识别数据中的叙述。所有案例研究都源于人文科学和社会科学中的问题，但在识别和量化感兴趣的段落时都依赖于语法分析的语料库。最后，该论文讨论了自然语言处理方法在（数字）人文学科中的问题的作用。它声称，对议会演讲和政客采访的数字人文调查不仅依赖于计算人文建模，还需要适应从简单搜索、定量探索到建模结束的一系列观点。此外，数字人文学科需要更深入地讨论信息科学和技术工具的使用如何改变人文学科提出的研究问题。但需要适应从简单搜索、定量探索到建模结束的一系列观点。此外，数字人文学科需要更深入地讨论信息科学和技术工具的使用如何改变人文学科提出的研究问题。但需要适应从简单搜索、定量探索到建模结束的一系列观点。此外，数字人文学科需要更深入地讨论信息科学和技术工具的使用如何改变人文学科提出的研究问题。

更新日期：2021-06-04

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11