当前位置: X-MOL 学术Behav. Res. Methods › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
The Croatian psycholinguistic database: Estimates for 6000 nouns, verbs, adjectives and adverbs
Behavior Research Methods ( IF 5.953 ) Pub Date : 2021-04-26 , DOI: 10.3758/s13428-020-01533-x
Anita Peti-Stantić 1, 2 , Maja Anđel 1, 3 , Vedrana Gnjidić 1 , Gordana Keresteš 1, 4 , Nikola Ljubešić 1, 5, 6 , Irina Masnikosa 1 , Mirjana Tonković 1, 4 , Jelena Tušek 1, 2 , Jana Willer-Gold 7 , Mateusz-Milan Stanojević 1, 8
Affiliation  

Psycholinguistic databases containing ratings of concreteness, imageability, age of acquisition, and subjective frequency are used in psycholinguistic and neurolinguistic studies which require words as stimuli. Linguistic characteristics (e.g. word length, corpus frequency) are frequently coded, but word class is seldom systematically treated, although there are indications of its significance for imageability and concreteness. This paper presents the Croatian Psycholinguistic Database (CPD; available at: https://doi.org/10.17234/megahr.2019.hpb), containing 6000 Croatian nouns, verbs, adjectives and adverbs, rated for concreteness, imageability, age of acquisition, and subjective frequency. Moreover, we present computationally obtained extrapolations of concreteness and imageability to the remainder of the Croatian lexicon (available at: https://github.com/megahr/lexicon/blob/master/predictions/hr_c_i.predictions.txt). In the two studies presented here, we explore the significance of word class for concreteness and imageability in human and computationally obtained ratings. The observed correlations in the CPD indicate correspondences between psycholinguistic measures expected from the literature. Word classes exhibit differences in subjective frequency, age of acquisition, concreteness and imageability, with significant differences between nouns, verbs, adjectives and adverbs. In the computational study which focused on concreteness and imageability, concreteness obtained higher correlations with human ratings than imageability, and the system underpredicted the concreteness of nouns, and overpredicted the concreteness of adjectives and adverbs. Overall, this suggests that word class contains schematic conceptual and distributional information. Schematic conceptual content seems to be more significant in human ratings of concreteness and less significant in computationally obtained ratings, where distributional information seems to play a more significant role. This suggests that word class differences should be theoretically explored.



中文翻译:

克罗地亚心理语言学数据库:估计 6000 个名词、动词、形容词和副词

心理语言学数据库包含具体性、成像性、习得年龄和主观频率的评级,用于需要单词作为刺激的心理语言学和神经语言学研究。语言特征(例如词长、语料库频率)经常被编码,但词类很少被系统地处理,尽管有迹象表明它对形象性和具体性的重要性。本文介绍了克罗地亚心理语言学数据库(CPD;可在:https://doi.org/10.17234/megahr.2019.hpb),其中包含 6000 个克罗地亚语名词、动词、形容词和副词,根据具体性、形象性、习得年龄进行评级和主观频率。此外,我们将计算获得的具体性和可成像性外推到克罗地亚语词典的其余部分(可在:https://github.com/megahr/lexicon/blob/master/predictions/hr_c_i.predictions.txt)。在这里介绍的两项研究中,我们探讨了词类对人类和计算获得的评级的具体性和可形象性的重要性。在 CPD 中观察到的相关性表明了文献中预期的心理语言学测量之间的对应关系。词类在主观频率、习得年龄、具体性和形象性方面表现出差异,名词、动词、形容词和副词之间存在显着差异。在关注具体性和形象性的计算研究中,具体性与人类评分的相关性高于形象性,系统低估了名词的具体性,高估了形容词和副词的具体性。全面的,这表明词类包含示意性概念和分布信息。示意图概念内容似乎在人类对具体性的评级中更为重要,而在计算获得的评级中则不那么重要,其中分布信息似乎起着更重要的作用。这表明应该从理论上探讨词类差异。

更新日期:2021-04-27
down
wechat
bug