当前位置: X-MOL 学术Psychological Review › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Disentangling contextual diversity: Communicative need as a lexical organizer.
Psychological Review ( IF 5.4 ) Pub Date : 2021-02-11 , DOI: 10.1037/rev0000265
Brendan T Johns 1
Affiliation  

Contextual diversity (CD; Adelman, Brown, & Quesada, 2006) modifies word frequency by ignoring word repetition in context. It has been repeatedly found that a CD count provides a better fit to lexical organization data than does word frequency (e.g., Adelman & Brown, 2008; Brysbaert & New, 2009). The importance of CD has been interpreted with the principle of likely need, adapted from the rational analysis of memory (Anderson & Schooler, 1991), which states that words that have been used in many past contexts are more likely to be needed in a future context. Central to the cognitive mechanisms of computing likely need is a definition of linguistic context itself. Typically, linguistic context is defined by relatively small units of language, such as a document within a corpus. However, recent research has demonstrated that larger definitions of context, some spanning tens or hundreds of thousands of words, provide a better accounting of lexical organization data (Johns, Dye, & Jones, 2020). This article attempts to redefine the notion of linguistic context by using socially based contextual measures, derived from the online communication patterns of hundreds of thousands of individuals from the discussion forum Reddit, consisting of over 55 billion words. Multiple count-based and semantic diversity models of contextual diversity were derived from this data. The results demonstrate that the communication patterns of individuals across discourses provides the best accounting of lexical organization data, indicating that classic notions of using local linguistic context to update a word's strength in the lexicon need to be reevaluated. (PsycInfo Database Record (c) 2021 APA, all rights reserved).

中文翻译:

解开语境多样性:作为词汇组织者的交流需求。

语境多样性(CD; Adelman,Brown和Quesada,2006)通过忽略语境中的单词重复来修饰单词频率。反复发现,CD计数比词频更适合词法组织数据(例如,Adelman&Brown,2008; Brysbaert&New,2009)。CD的重要性已根据可能需要的原则进行了解释,并根据对记忆的理性分析进行了改编(Anderson&Schooler,1991),该陈述指出,在过去的许多情况下使用的单词将来更可能需要语境。计算可能需要的认知机制的核心是语言上下文本身的定义。通常,语言上下文是由相对较小的语言单元(例如语料库中的文档)定义的。然而,最近的研究表明,更大的上下文定义(一些跨越数万或数十万个单词)可以更好地说明词汇组织数据(Johns,Dye和Jones,2020年)。本文试图通过使用基于社会的语境测量方法来重新定义语言语境的概念,该方法源自讨论论坛Reddit的数十万个人的在线交流模式,该模式由550亿个单词组成。从该数据得出基于多个计数的上下文多样性的语义多样性模型。结果表明,跨语篇的个人交流模式可以最好地说明词汇组织数据,这表明使用本地语言环境来更新单词的经典概念是“ 需要重新评估词典中的强度。(PsycInfo数据库记录(c)2021 APA,保留所有权利)。
更新日期:2021-02-11
down
wechat
bug