当前位置: X-MOL 学术Digit. Scholarsh. Hum.it. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A case study in text mining: Textual analysis of the Territorial Papers
Digital Scholarship in the Humanities ( IF 0.7 ) Pub Date : 2019-02-19 , DOI: 10.1093/llc/fqz007
Johannes Ledolter 1 , Lea VanderVelde 2
Affiliation  

The Territorial Papers of the United States are a valuable and underused resource containing almost 10,000 documents written between 1789 and 1848 about the formation of new sovereign states from US territory. These communications between the federal government and frontier settlers comprise the actual discourse of the nation’s expansion over six decades. Digitizing the Territorial Papers permits the possibility of analyzing the entire corpus globally. Text mining and topic modeling methods give us a lens on the language patterns through which new state governments and the expanding nation were formed. An initial statistical analysis of the textual information provides a visualization of content, helps discern how ideals about governance emerged, and lays the foundation for developing more sophisticated hypotheses and theoretical constructs.

中文翻译:

文本挖掘中的案例研究:区域论文的文本分析

美国的《国土文件》是有价值且未被充分利用的资源,其中包含将近10,000份在1789年至1848年之间就从美国领土形成新的主权国家而撰写的文件。联邦政府与边境定居者之间的这些交流构成了六十年来该国扩张的实际论述。对领土论文进行数字化处理,就可以在全球范围内分析整个语料库。文本挖掘和主题建模方法使我们对形成新州政府和扩展国家的语言模式有了一个了解。文本信息的初步统计分析提供了内容的可视化,有助于识别关于治理的理想是如何出现的,并为开发更复杂的假设和理论构造奠定了基础。
更新日期:2019-02-19
down
wechat
bug