Abstract
In structuralist linguistics, compounds are argued not to constitute morphological categories, due to the absence of systematic form-meaning correspondences. This study investigates subsets of compounds for which systematic form-meaning correspondences are present: adjective–noun compounds in Mandarin. We show that there are substantial differences in the productivity of these compounds. One set of productivity measures (the count of types, the count of hapax legomena, and the estimated count of unseen types) reflect compounds’ profitability. By contrast, the category-conditioned degree of productivity is found to correlate with the internal semantic transparency of the words belonging to a morphological category. Greater semantic transparency, gauged by distributional semantics, predicts greater category-conditioned productivity. This dovetails well with the hypothesis that semantic transparency is a prerequisite for a word formation process to be productive.
Funding source: China Scholarship Council
Award Identifier / Grant number: 201906230064
Acknowledgment
The authors are indebted to Chuang Yu Ying and Karlina Denistia for their comments of an earlier version of this paper. We also thank our reviewers for their constructive feedback which helped us strengthen the paper.
-
Research funding: This research was funded by the China Scholarship Council (Grant No. 201906230064).
References
Aronoff, Mark. 1976. Word formation in generative grammar. Cambridge, Mass: MIT Press.Search in Google Scholar
Aronoff, Mark. 1982. Potential words, actual words, productivity and frequency. Preprints of the plenary session papers of the XIIIth international congress of linguists, 141–148. Tokyo: International Congress of Linguists Proceedings Publishing Committee.Search in Google Scholar
Aronoff, Mark & Kirsten Fudeman. 2011. What is morphology. UK: John Wiley & Sons.10.1093/obo/9780199772810-0001Search in Google Scholar
Baayen, Harald. 1993. On frequency, transparency and productivity. Yearbook of morphology 1992, 181–208. Dordrecht: Springer.10.1007/978-94-017-3710-4_7Search in Google Scholar
Baayen, R. Harald. 1994. Productivity in language production. Language & Cognitive Processes 9. 447–469. https://doi.org/10.1080/01690969408402127.Search in Google Scholar
Baayen, R. Harald. 2001. Word frequency distributions. Dordrecht: Kluwer Academic Publishers.10.1007/978-94-010-0844-0Search in Google Scholar
Baayen, R. Harald. 2009. Corpus linguistics in morphology: Morphological productivity. In Merja Kytö & Anke Lüdeling (eds.), Corpus Linguistics. An international handbook, 900–919. Berlin: Mouton de Gruyter.Search in Google Scholar
Baayen, R. Harald & Rochelle Lieber. 1991. Productivity and English derivation: a corpus-based study. Linguistics 29. 801–843. https://doi.org/10.1515/ling.1991.29.5.801.Search in Google Scholar
Baayen, R. Harald, Richard Piepenbrock & Leon Gulikers. 1996. The CELEX lexical database (cd-rom). University of Pennsylvania.Search in Google Scholar
Baayen, R. Harald, Laura A. Janda, Tore Nesset, Anna Endresen & Anastasia Makarova. 2013. Making choices in Russian: Pros and cons of statistical methods for rival forms. Russian Linguistics 37. 253–291. https://doi.org/10.1007/s11185-013-9118-6.Search in Google Scholar
Baayen, R. Harald, Yu-Ying Chuang, Elnaz Shafaei-Bajestan & James P. Blevins. 2019. The discriminative lexicon: A unified computational model for the lexicon and lexical processing in comprehension and production grounded not in (de)composition but in linear discriminative learning. [Special issue]. Complexity 2019. 1–39. https://doi.org/10.1155/2019/4895891.Search in Google Scholar
Bauer, Laurie. 2001. Morphological productivity. Cambridge: Cambridge University Press.10.1017/CBO9780511486210Search in Google Scholar
Boleda, Gemma. 2020. Distributional semantics and linguistic theory. Annual Review of Linguistics 6. 213–234. https://doi.org/10.1146/annurev-linguistics-011619-030303.Search in Google Scholar
Bolinger, Dwight L. 1948. On defining the morpheme. Word 4(1). 18–23. https://doi.org/10.1080/00437956.1948.11659323.Search in Google Scholar
Booij, Geert Evert. 1977. Dutch morphology. A study of word formation in generative grammar. Dordrecht: Foris.10.1515/9783112327708Search in Google Scholar
Booij, Geert Evert. 2010. Construction morphology. Language and Linguistics Compass 4(7). 543–555. https://doi.org/10.1111/j.1749-818x.2010.00213.x.Search in Google Scholar
Boucher, Jerry & Charles E. Osgood. 1969. The pollyanna hypothesis. Journal of Verbal Learning and Verbal Behavior 8(1). 1–8. https://doi.org/10.1016/s0022-5371(69)80002-2.Search in Google Scholar
Breiman, Leo. 2001. Random forests. Machine Learning 45. 5–32. https://doi.org/10.1023/a:1010933404324.10.1023/A:1010933404324Search in Google Scholar
Ceccagno, Antonella & Bianca Basciano. 2007. Compound headedness in Chinese: An analysis of neologisms. Morphology 17(2). 207–231. https://doi.org/10.1007/s11525-008-9119-0.Search in Google Scholar
Chatterjee, Samprit & Ali Hadi. 2012. Regression analysis by example. New York: John Wiley & Sons.Search in Google Scholar
Corbin, Danielle. 1987. Morphologie derivationelle et structuration du lexique [Derivational morphology and lexical structure]. Tübingen: Niemeyer.10.1515/9783111358383Search in Google Scholar
Evert, Steven & Baroni, Marco. 2006. The zipfR library: Words and other rare events in R. Paper presentated at the useR! 2006: The second R user conference, Vienna, June 2006.Search in Google Scholar
Firth, John Rupert. 1957. Studies in linguistic analysis. Oxford: Wiley-Blackwell.Search in Google Scholar
Goldberg, Adele E. 2016. Partial productivity of linguistic constructions: Dynamic categorization and statistical preemption. Language and Cognition 8(3). 369–390. https://doi.org/10.1017/langcog.2016.17.Search in Google Scholar
Harris, Zellig S. 1954. Distributional structure. Word 10(2–3). 146–162. https://doi.org/10.1080/00437956.1954.11659520.Search in Google Scholar
Hay, Jennifer B. 2001. Lexical frequency in morphology: Is everything relative? Linguistics 39. 1041–1070. https://doi.org/10.1515/ling.2001.041.Search in Google Scholar
Hay, Jennifer B. & R. Harald Baayen. 2002. Parsing and productivity. In Geert Booij & Jaap Van Marle (eds.), Yearbook of morphology 2001, 203–235. Dordrecht: Kluwer Academic Publishers.10.1007/978-94-017-3726-5_8Search in Google Scholar
Huang, Chu-Ren, Shu-Kai Hsieh & Keh-Jiann Chen. 2017. Mandarin Chinese words and parts of speech: A corpus-based study. New York: Taylor & Francis.10.4324/9781315669014Search in Google Scholar
Huang, C. R., S. K. Hsieh, J. F. Hong, Y. Z. Chen, Y. L. Su, Y. X. Chen & S. W. Huang. 2010. 中文词汇网络: 跨语言知识处理基础架构的设计理念与实践 [Chinese Wordnet: Design, Implementation, and Application of an Infrastructure for Cross-lingual Knowledge Processing]. 中国语文[Study of Chinese Language] 24(2). 14–23.Search in Google Scholar
Kastovsky, Dieter. 1986. Productivity in word formation. Linguistics 24. 585–600. https://doi.org/10.1515/ling.1986.24.3.585.Search in Google Scholar
Kennedy, Christopher. 1998. On the monotonicity of polar adjectives. Amsterdam: John Benjamins Publishing.Search in Google Scholar
Kruisinga, Etsko. 1932. A handbook of present-day English, Part II: English accidence and syntax. Utrecht: Noordhoff.Search in Google Scholar
Landauer, Thomas & Susan Dumais. 1997. A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction and representation of knowledge. Psychological Review 104(2). 211–240. https://doi.org/10.1037/0033-295x.104.2.211.Search in Google Scholar
Lieber, Rochelle. 2010. Introducing morphology. Cambridge, UK: Cambridge University Press.Search in Google Scholar
Matlin, Margaret W. 2016. Pollyanna principle. In Rüdiger Pohl (ed.), Cognitive illusions: Intriguing phenomena in thinking, judgment and memory, 315–335. London: Routledge.Search in Google Scholar
McDonald, Scott & Richard Shillcock. 2001. Rethinking the word frequency effect: The neglected role of distributional information in lexical processing. Language and Speech 44. 295–323. https://doi.org/10.1177/00238309010440030101.Search in Google Scholar
Mikolov, Tomas, Ilya Sutskever, Kai Chen, Greg S. Corrado & Jeffrey Dean. 2013. Distributed representations of words and phrases and their compositionality. arXiv preprint arXiv:1310.4546, 1–9. https://arxiv.org/abs/1310.4546v1.Search in Google Scholar
Perek, Florent. 2018. Recent change in the productivity and schematicity of the way-construction: A distributional semantic analysis. Corpus Linguistics and Linguistic Theory 14(1). 65–97. https://doi.org/10.1515/cllt-2016-0014.Search in Google Scholar
Perek, Florent & Martin Hilpert. 2017. A distributional semantic approach to the periodization of change in the productivity of constructions. International Journal of Corpus Linguistics 22(4). 490–520. https://doi.org/10.1075/ijcl.16128.per.Search in Google Scholar
Plag, Ingo. 2003. Word formation in English. Cambridge, UK: Cambridge University Press.10.1017/CBO9780511841323Search in Google Scholar
Plag, Ingo. 2010. Compound stress assignment by analogy: The constituent family bias. Zeitschrift für Sprachwissenschaft 29(2). 243–282. https://doi.org/10.1515/zfsw.2010.009.Search in Google Scholar
R Core Team. 2013. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. http://www.R-project.org/.Search in Google Scholar
Riddle, Elizabeth. 1985. A historical perspective on the productivity of the suffixes -ness and -ity. In Jacek Fisiak (ed.), Historical semantics, historical word-formation, 435–461. New York: Mouton.10.1515/9783110850178.435Search in Google Scholar
Sahlgren, Magnus. 2001. Vector-based semantic analysis: Representing word meanings based on random labels. In ESSLI Workshop on Semantic Knowledge Acquistion and Categorization. Helsinki: Kluwer Academic Publishers.Search in Google Scholar
Schultink, Henk. 1961. Produktiviteit als morfologisch fenomeen [Productivity as a morphological phenomenon]. Forum der Letteren 2. 110–125.Search in Google Scholar
Shaoul, Cyrus & Chris Westbury. 2010. Exploring lexical co-occurrence space using HiDEx. Behavior Research Methods 42(2). 393–413. https://doi.org/10.3758/brm.42.2.393.Search in Google Scholar
Sichel, Herbert S. 1986. Word frequency distributions and type-token characteristics. The Mathematical Scientist 11. 45–72.Search in Google Scholar
Song, Yan, Shuming Shi, Jing Li & Haisong Zhang. 2018. Directional skip-gram: Explicitly distinguishing left and right context for word embeddings. Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: Human language technologies, Volume 2 (Short Papers), 175–180. New Orleans: Association for Computational Linguistics.10.18653/v1/N18-2028Search in Google Scholar
Torsten Hothorn, Kurt Hornik & Achim Zeileis. 2006. Unbiased recursive partitioning: A conditional inference framework. Journal of Computational & Graphical Statistics 15. 651–674. https://doi.org/10.1198/106186006x133933.Search in Google Scholar
Tse, Chi-Shing, Melvin J. Yap, Yuen-Lai Chan, Wei Ping Sze, Cyrus Shaoul & Dan Lin. 2017. The Chinese lexicon project: A megastudy of lexical decision performance for 25,000+ traditional Chinese two-character compound words. Behavior Research Methods 49(4). 1503–1519. https://doi.org/10.3758/s13428-016-0810-5.Search in Google Scholar
Wood, Simon N. 2017. Generalized additive models: An introduction with R. Boca Raton: CRC Press.10.1201/9781315370279Search in Google Scholar
Xu, Zheng. 2018. The word status of Chinese adjective-noun combinations. Linguistics 56(1). 207–256. https://doi.org/10.1515/ling-2017-0035.Search in Google Scholar
Xu, Zheng. 2019. Chinese adjective-noun combinations. In Franz Rainer, Francesco Gardani, Wolfgang U. Dressler & Hans LuschützkyChristian (eds.), Competition in inflection and word-formation, 5, 307–334. Springer.10.1007/978-3-030-02550-2_12Search in Google Scholar
Supplementary Material
The online version of this article offers supplementary material (https://doi.org/10.1515/cllt-2020-0059).
© 2021 Walter de Gruyter GmbH, Berlin/Boston