Abstract
In 1985, Zwicky argued that ‘particle’ is a pretheoretical notion that should be eliminated from linguistic analysis. We propose a reclassification of Russian particles that implements Zwicky’s directive. Russian particles lack a coherent conceptual basis as a category and many are ambiguous with respect to part of speech. Our corpus analysis of Russian particles addresses theoretical questions about the cognitive status of parts of speech and practical concerns about how particles should be represented in computational models. We focus on nine high-frequency words commonly classed as particles: ešče, tak, ved’, slovno, daže, že, li, da, net. We show that the current tagging of particles in the manually disambiguated Morphological Standard of the Russian National Corpus is not entirely consistent, and that this can create challenges for training a part-of-speech tagger. We offer an alternative tagging scheme that eliminates the category of ‘particle’ altogether. We show that our enriched scheme makes it possible for a part-of-speech tagger to achieve more useful results. Our analysis of particles provides a detailed account of various sub-uses that correspond to different parts of speech, their relationships, and relative distribution. In this sense, our study also contributes to the study of words that exhibit part-of-speech ambiguities.
Аннотация
В работе 1985 года Цвикки утверждал, что ‘частица’—это до-теоретическое понятие, которое нужно исключить из лингвистического анализа. Следуя установке Цвикки, мы предлагаем пересмотреть традиционный подход к русским частицам и перераспределить соответствующие слова по другим частеречным классам. Ясные содержательные основания для выделения русских частиц как отдельной категории отсутствуют, частеречная принадлежность многих частиц неоднозначна. В нашем корпусном исследовании рассмотрены теоретические вопросы о когнитивном статусе частей речи, а также практические сложности, связанные с представлением частиц в компьютерных моделях обработки данных. В центре внимания девять высокочастотных слов, традиционно определяемых как частицы: еще, так, ведь, словно, даже, же, ли, да, нет. В статье показано, что существующая система частеречной разметки, принятая в Морфологическом стандарте Национального корпуса русского языка (тексты со снятой омонимией), недостаточно последовательна и что это может создать проблемы при обучении частеречного анализатора. В статье предложена альтернативная система разметки, в которой категория ‘частиц’ как отдельной части речи полностью устранена. Благодаря этой улучшенной системе разметки частеречный анализатор может функционировать более успешно. В статье представлен подробный анализ девяти ‘частиц’ с разбором основных подтипов их употреблений, которые соответствуют различным частям речи, также обсуждаются взаимосвязи выделенных подтипов и их распределение в использованной выборке примеров. В этом отношении, данное исследование вносит вклад в изучение слов с неоднозначной частеречной принадлежностью.
Similar content being viewed by others
Notes
Since the meanings of these particles are heavily dependent on context, in this article they are not translated when cited out of context.
Abbreviations used in this paper: A—adjective, A-NUM—numeral adjective, A-PRO—pronominal adjective, ADV—adverb, ADV-PRO—pronominal adverb, CNJ—conjunction, CNJADV—adverbial conjunction, CNJCOO—coordinating conjunction, CNJSUB—subordinating conjunction, EMPH—emphasizer, INTJ—interjection, NEST—negative predicative of existence, NUM—numeral, PARENTH—parenthetical, PART—particle, PR—preposition, PRAEDIC—predicative, PRAEDIC-PRO—predicative pronoun, QST—question word, S—substantive, S-PRO—pronoun, V—verb, VMOD—modal verb.
Even in the case of verbs and pronouns, one can find marginal examples of lexemes that are uninflected, such as the verbal interjective forms xvat’ (< xvatat’ ‘grab’), pryg (< prygat’ ‘jump’), spaten’ki (< spat’ ‘sleep’) (Klobukov 2001); adverbial pronouns tak ‘so’, tam ‘there’, tut ‘here’, gde ‘where’, kogda ‘when’.
Most of the examples we cite are taken from the database used for our experiments (The Tromsø Repository of Language and Linguistics, http://hdl.handle.net/10037.1/10291). Because our experiments were run on examples containing only one token of the nine lexemes in our study, example (48) was excluded from the database because it contained more than one token. In addition, examples (5) and (49) illustrate low frequent or obsolete uses of particles that were not represented in our database.
Note that in this example, Zaliznjak is citing Fomenko’s words.
Kasatkina (2004, pp. 73–74) shows that as opposed to Standard Russian, in Russian dialects že often bears stress and escapes vowel reduction, and this takes place in certain specific uses not found in Standard Russian: e.g. dialectal že can be semantically equivalent to the conjunction i ‘and’, as in this example from the Vologodskaja oblast’: V magazin prjaniki privezli, konfety že ‘Spice cookies have been brought to the store, and candy (too)’.
Note that an alternative interpretation for examples (40) and (41) is that že appears in Wackernagel position and emphasizes the entire clause.
References
Andersen, G. & Fretheim, T. (Eds.) (2000). Pragmatic markers and propositional attitude (Pragmatics & Beyond. New Series, 79). Amsterdam, Philadelphia.
Bartoševič, A. (1978). Časticy i leksikografičeskaja praktika. Slavia orientalis, XXVII(3), 331–334.
Bogorodickij, V. A. (1939). Očerki po jazykovedeniju i russkomu jazyku. Moskva.
Brinton, L. J. (1996). Pragmatic markers in English. Grammaticalization and discourse functions (Topics in English Linguistics, 19). Berlin.
Chomsky, N. (1965). Aspects of the theory of syntax. Cambridge.
Croft, W. (2001). Radical Construction Grammar. Syntactic theory in typological perspective. Oxford.
Dedaić, M., & Mišković-Luković, M. (2010). South Slavic discourse particles: introduction. In M. Dedaić & M. Mišković-Luković (Eds.), South Slavic discourse particles (Pragmatics & Beyond New Series, 197, pp. 1–22). Amsterdam.
Drummen, A. (2015). Dramatic pragmatics. A discourse approach to particle use in ancient Greek tragedy and comedy (Doctoral dissertation). Ruprecht-Karls-Universität Heidelberg, Heidelberg.
Erelt, M., Erelt, T., & Ross, K. (2000). Eesti keele käsiraamat. Tallin.
Fries, Ch. C. (1952). The structure of English. New York.
Grišina, E. A., & Savčuk, S. O. (2009). Korpus ustnyx tekstov v Nacional’nom korpuse russkogo jazyka: sostav i struktura. In Nacional’nyj korpus russkogo jazyka. Novye rezul’taty i perspektivy. Available at http://ruscorpora.ru/corpora-biblio-2008.html.
Grišina, E. A., & Ljaševskaja, O. N. (2008). Grammatičeskij slovar’ novyx slov russkogo jazyka. Available at http://dict.ruslang.ru/gram.php.
Halácsy, P., Kornai, A., & Oravecz, C. (2007). HunPos: an open source trigram tagger. In ACL 2007. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. Interactive Poster and Demonstration Sessions, June 25–27, 2007, Prague (pp. 209–212). Stroudsburg.
Harris, Z. S. (1951). Methods in structural linguistics. Chicago.
Heinrichs, W. (1981). Die Modalpartikeln im Deutschen und Schwedischen (Linguistische Arbeiten, 101). Tübingen.
Kasatkina, R. F. (2004). Častica že v roli tekstovogo konnektora (na materiale russkoj dialektnoj reči). In T. M. Nikolaeva (Ed.), Verbal’naja i neverbal’naja opory prostranstva mežfrazovyx svjazej (pp. 71–82). Moskva.
Klobukov, E. V. (2001). Analitičeskie glagoly v russkom jazyke. In S. M. Kuz’mina (Ed.), Žizn’ jazyka. Sbornik statej k 80-letiju Mixaila Viktoroviča Panova (pp. 77–87). Moskva.
Kuznecov, S. A. (Ed.) (1998). Bol’šoj tolkovyj slovar’ russkogo jazyka. Sankt-Peterburg.
Langacker, R. W. (2013). Essentials of cognitive grammar. Oxford.
Makarova, A. (2015). One type of verbal diminutives in Russian: verbs ending in -n’kat’. Russian Linguistics, 39(1), 15–31.
Malyj akademičeskij slovar’ (1999). Slovar’ russkogo jazyka v četyrex tomax. Moskva.
Manning, C. D. (2011). Part-of-speech tagging from 97 % to 100 %: is it time for some linguistics? In A. Gelbukh (Ed.), Computational Linguistics and Intelligent Text Processing. 12th International Conference, CICLing 2011, Tokyo, Japan, February 2011. Proceedings, Part I (Lecture Notes in Computer Science, 6608, pp. 171–189). Berlin.
McCoy, S. (2003a). Unifying the meaning of multifunctional particles: the case of Russian ŽE. University of Pennsylvania Working Papers in Linguistics, 9(1), 123–135.
McCoy, S. (2003b). Connecting information structure and discourse structure through “Kontrast”: the case of colloquial Russian particles – TO, ŽE, and VED’. Journal of Logic, Language and Information, 12, 319–335.
Minčenkov, A. G. (2001). Russkie časticy v perevode na anglijskij jazyk. Sankt-Peterburg.
Miot, G. (1987). Il mio primo dizionario. Firenze.
Nikolaeva, T. M. (1985). Funkcii častic v vyskazyvanii (na materiale slavjanskix jazykov). Moskva.
Percov, N. V. (2002). O vozmožnom semantičeskom invariante russkix frazovyx častic uže i ešče. In N. D. Arutjunova (Ed.), Logičeskij analiz jazyka. Semantika načala i konca (pp. 137–145). Moskva.
Plungjan, V. A. (1987). Ocenka verojatnosti v značenii časticy že (k formalizacii semantičeskogo opisanija služebnyx slov). Naučno-texničeskaja informacija. Serija 2: Informacionnye processy i sistemy, 8, 36–40.
Russian National Corpus (RNC). Available at www.ruscorpora.ru.
Šaxmatov, A. A. (1941). Sintaksis russkogo jazyka. Leningrad.
Serianni, L., & Castelvecchi, A. (1997). Italiano: grammatica, sintassi, dubbi. Milano.
Sičinava, D. V. (2005). Obrabotka tekstov s grammatičeskoj razmetkoj: instrukcija razmetčika. Retrieved from http://ruscorpora.ru/sbornik2005/09sitch.pdf (1 March 2016).
Starodumova, E. A. (1997). Russkie časticy (pis’mennaja monologičeskaja reč’) (avtoreferat dissertacii). Moskva.
Švedova, N. Ju. (Ed.) (1980). Russkaja grammatika. Tom I: Fonetika, fonologija, udarenie, intonacija, slovoobrazovanie, morfologija. Moskva.
Tauli, V. (1972). Eesti grammatika (Vol. I). Uppsala.
Timberlake, A. (2004). A reference grammar of Russian. Cambridge.
Vasilyeva, A. N. (1972). Particles in colloquial Russian (manual for English-speaking students of Russian). Moscow.
Vikul’ceva, N. (2004). Semantičeskie, sintaksičeskie i pragmatičeskie osobennosti leksemy VED’ (Master’s thesis). University of Tartu, Tartu.
Vinogradov, V. V. (1972). Russkij jazyk. Grammatičeskoe učenie o slove. Moskva.
Voejkova, M. D. (2009). Problemy ispol’zovanija podkorpusa ustnoj razgovornoj reči (na primere analiza russkix diminutivov). In Nacional’nyj korpus russkogo jazyka. Novye rezul’taty i perspektivy. Available at http://ruscorpora.ru/corpora-biblio-2008.html.
Wierzbicka, A. (1976). Particles and linguistic relativity. International Review of Slavic Linguistics, 1(2–3), 327–367.
Wierzbicka, A. (1988). The semantics of grammar (Studies in Language Companion Series, 18). Amsterdam, Philadelphia.
Wierzbicka, A. (1992). Semantics, culture, and cognition. Universal human concepts in culture-specific configurations. New York, Oxford.
Zaliznjak, A. A. (1980). Grammatičeskij slovar’ russkogo jazyka. Moskva.
Zwicky, A. M. (1985). Clitics and particles. Language, 61(2), 283–305.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was supported in part by grant 222506 from the Norwegian Research Council. The authors would also like to thank their employer, UiT The Arctic University of Norway.
Appendix
Please note that all of the numbers given in the tables in the Appendix are cited in percentages.
Appendix
Rights and permissions
About this article
Cite this article
Endresen, A., Janda, L.A., Reynolds, R. et al. Who needs particles? A challenge to the classification of particles as a part of speech in Russian. Russ Linguist 40, 103–132 (2016). https://doi.org/10.1007/s11185-016-9160-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11185-016-9160-2