当前位置: X-MOL 学术Critical Inquiry › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
The Computational Case against Computational Literary Studies
Critical Inquiry ( IF 2.0 ) Pub Date : 2019-03-01 , DOI: 10.1086/702594
Nan Z. Da

1 This essay works at the empirical level to isolate a series of technical problems, logical fallacies, and conceptual flaws in an increasingly popular subfield in literary studies variously known as cultural analytics, literary data mining, quantitative formalism, literary text mining, computational textual analysis, computational criticism, algorithmic literary studies, social computing for literary studies, and computational literary studies (the phrase I use here). In a nutshell the problem with computational literary analysis as it stands is that what is robust is obvious (in the empirical sense) and what is not obvious is not robust, a situation not easily overcome given the nature of literary data and the nature of statistical inquiry. There is a fundamental mismatch between the statistical tools that are used and the objects to which they are applied. Digital humanities (DH), a field of study which can encompass subjects as diverse as histories of media and early computational practices, the digitization of texts for open access, digital inscription and mediation, and computational linguistics and lexicology, and technical papers on data mining, is not the object of my critique. Rather, I am addressing specifically the project of running computer programs on large (or usually not so large) corpora of literary texts to yield quantitative results which are then mapped, graphed, and tested for statistical significance and used

中文翻译:

反对计算文学研究的计算案例

1 本文在实证层面上工作,以隔离文学研究中日益流行的子领域中的一系列技术问题、逻辑谬误和概念缺陷,这些子领域被称为文化分析、文学数据挖掘、定量形式主义、文学文本挖掘、计算文本分析、计算批评、算法文学研究、文学研究的社会计算和计算文学研究(我在这里使用的短语)。简而言之,计算文学分析的问题在于,稳健的东西是显而易见的(在经验意义上),而不明显的东西不稳健,鉴于文学数据的性质和统计的性质,这种情况不容易克服。询问。使用的统计工具与其应用的对象之间存在根本性的不匹配。数字人文 (DH),一个研究领域,可以涵盖多种学科,如媒体历史和早期计算实践、开放获取的文本数字化、数字铭文和中介、计算语言学和词汇学,以及关于数据挖掘的技术论文,不是我批评的对象。相反,我正在专门讨论在大型(或通常不是那么大)文学文本语料库上运行计算机程序以产生定量结果的项目,然后将这些结果映射、绘图和测试统计显着性并使用 开放获取的文本数字化、数字铭文和中介、计算语言学和词汇学,以及数据挖掘技术论文,不是我批评的对象。相反,我正在专门讨论在大型(或通常不是那么大)文学文本语料库上运行计算机程序以产生定量结果的项目,然后将这些结果映射、绘图和测试统计显着性并使用 开放获取的文本数字化、数字铭文和中介、计算语言学和词汇学,以及数据挖掘技术论文,不是我批评的对象。相反,我正在专门讨论在大型(或通常不是那么大)文学文本语料库上运行计算机程序以产生定量结果的项目,然后将这些结果映射、绘图和测试统计显着性并使用
更新日期:2019-03-01
down
wechat
bug