当前位置: X-MOL 学术Digit. Scholarsh. Hum.it. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Patterns in language: Text analysis of government reports on the Irish industrial school system with word embedding
Digital Scholarship in the Humanities ( IF 1.299 ) Pub Date : 2019-04-10 , DOI: 10.1093/llc/fqz012
Susan Leavy 1 , Mark T Keane 1 , Emilie Pine 1
Affiliation  

Industrial Memories is a digital humanities initiative to supplement close readings of a government report with new distant readings, using text analytics techniques. The Ryan Report (2009), the official report of the Commission to Inquire into Child Abuse (CICA), details the systematic abuse of thousands of children 15 from 1936 to 1999 in residential institutions run by religious orders and funded and overseen by the Irish State. Arguably, the sheer size of the Ryan Report—over 1 million words— warrants a new approach that blends close readings to witness its findings, with distant readings that help surface system-wide findings embedded in the Report. Although CICA has been lauded internationally for 20 its work, many have critiqued the narrative form of the Ryan Report, for obfuscating key findings and providing poor systemic, statistical summaries that are crucial to evaluating the political and cultural context in which the abuse took place (Keenan, 2013, Child Sexual Abuse and the Catholic Church: Gender, Power, and Organizational Culture. Oxford University Press). In this article, we concentrate on describing the distant reading methodology we adopted, using machine learning and text-analytic methods and report on what they surfaced from the

中文翻译:

语言模式:政府对带有字词嵌入的爱尔兰工业学校系统的报告进行文字分析

工业回忆是一项数字人文科学计划,旨在使用文本分析技术用新的遥距读数来补充政府报告的近距读数。《赖安报告》(2009年)是调查虐待儿童委员会(CICA)的正式报告,详细介绍了1936年至1999年期间在爱尔兰政府资助和监督下的宗教机构经营的住宅机构中成千上万的儿童15受到系统性虐待的情况。 。可以说,《瑞安报告》的庞大规模(超过100万字)保证了一种新方法,该方法融合了近距离阅读以见证其发现,而远距离阅读则有助于在报告中嵌入整个系统的发现。尽管CICA的20项工作在国际上受到赞誉,但许多人批评Ryan报告的叙述形式,以掩盖关键发现并提供较差的系统性,统计摘要对于评估发生虐待的政治和文化背景至关重要(Keenan,2013年,《儿童性虐待和天主教:性别,权力和组织文化》,牛津大学出版社)。在本文中,我们将重点介绍采用机器学习和文本分析方法的远程阅读方法,并报告它们从书本中浮现出来的内容。
更新日期:2019-04-10
down
wechat
bug