Adjective–noun compounds in Mandarin: a study on productivity

Tian Shen; R. Harald Baayen

doi:10.1515/cllt-2020-0059

Published by De Gruyter Mouton March 10, 2021

Adjective–noun compounds in Mandarin: a study on productivity

Tian Shen and R. Harald Baayen

From the journal Corpus Linguistics and Linguistic Theory

https://doi.org/10.1515/cllt-2020-0059

Showing a limited preview of this publication:

Abstract

In structuralist linguistics, compounds are argued not to constitute morphological categories, due to the absence of systematic form-meaning correspondences. This study investigates subsets of compounds for which systematic form-meaning correspondences are present: adjective–noun compounds in Mandarin. We show that there are substantial differences in the productivity of these compounds. One set of productivity measures (the count of types, the count of hapax legomena, and the estimated count of unseen types) reflect compounds’ profitability. By contrast, the category-conditioned degree of productivity is found to correlate with the internal semantic transparency of the words belonging to a morphological category. Greater semantic transparency, gauged by distributional semantics, predicts greater category-conditioned productivity. This dovetails well with the hypothesis that semantic transparency is a prerequisite for a word formation process to be productive.

keywords: distributional semantics; Mandarin adjective–noun compounds; morphological category; morphological productivity; semantic transparency

Corresponding author: Tian Shen, School of Foreign Languages, Shanghai Jiao Tong University, Shanghai, China, E-mail: sweetilovefreedom@126.com

Funding source: China Scholarship Council

Award Identifier / Grant number: 201906230064

Acknowledgment

The authors are indebted to Chuang Yu Ying and Karlina Denistia for their comments of an earlier version of this paper. We also thank our reviewers for their constructive feedback which helped us strengthen the paper.

Research funding: This research was funded by the China Scholarship Council (Grant No. 201906230064).

References

Aronoff, Mark. 1976. Word formation in generative grammar. Cambridge, Mass: MIT Press.Search in Google Scholar

Aronoff, Mark. 1982. Potential words, actual words, productivity and frequency. Preprints of the plenary session papers of the XIIIth international congress of linguists, 141–148. Tokyo: International Congress of Linguists Proceedings Publishing Committee.Search in Google Scholar

Aronoff, Mark & Kirsten Fudeman. 2011. What is morphology. UK: John Wiley & Sons.10.1093/obo/9780199772810-0001Search in Google Scholar

Baayen, Harald. 1993. On frequency, transparency and productivity. Yearbook of morphology 1992, 181–208. Dordrecht: Springer.10.1007/978-94-017-3710-4_7Search in Google Scholar

Baayen, R. Harald. 1994. Productivity in language production. Language & Cognitive Processes 9. 447–469. https://doi.org/10.1080/01690969408402127.Search in Google Scholar

Baayen, R. Harald. 2001. Word frequency distributions. Dordrecht: Kluwer Academic Publishers.10.1007/978-94-010-0844-0Search in Google Scholar

Baayen, R. Harald. 2009. Corpus linguistics in morphology: Morphological productivity. In Merja Kytö & Anke Lüdeling (eds.), Corpus Linguistics. An international handbook, 900–919. Berlin: Mouton de Gruyter.Search in Google Scholar

Baayen, R. Harald & Rochelle Lieber. 1991. Productivity and English derivation: a corpus-based study. Linguistics 29. 801–843. https://doi.org/10.1515/ling.1991.29.5.801.Search in Google Scholar

Baayen, R. Harald, Richard Piepenbrock & Leon Gulikers. 1996. The CELEX lexical database (cd-rom). University of Pennsylvania.Search in Google Scholar

Baayen, R. Harald, Laura A. Janda, Tore Nesset, Anna Endresen & Anastasia Makarova. 2013. Making choices in Russian: Pros and cons of statistical methods for rival forms. Russian Linguistics 37. 253–291. https://doi.org/10.1007/s11185-013-9118-6.Search in Google Scholar

Baayen, R. Harald, Yu-Ying Chuang, Elnaz Shafaei-Bajestan & James P. Blevins. 2019. The discriminative lexicon: A unified computational model for the lexicon and lexical processing in comprehension and production grounded not in (de)composition but in linear discriminative learning. [Special issue]. Complexity 2019. 1–39. https://doi.org/10.1155/2019/4895891.Search in Google Scholar

Bauer, Laurie. 2001. Morphological productivity. Cambridge: Cambridge University Press.10.1017/CBO9780511486210Search in Google Scholar

Boleda, Gemma. 2020. Distributional semantics and linguistic theory. Annual Review of Linguistics 6. 213–234. https://doi.org/10.1146/annurev-linguistics-011619-030303.Search in Google Scholar

Bolinger, Dwight L. 1948. On defining the morpheme. Word 4(1). 18–23. https://doi.org/10.1080/00437956.1948.11659323.Search in Google Scholar

Booij, Geert Evert. 1977. Dutch morphology. A study of word formation in generative grammar. Dordrecht: Foris.10.1515/9783112327708Search in Google Scholar

Booij, Geert Evert. 2010. Construction morphology. Language and Linguistics Compass 4(7). 543–555. https://doi.org/10.1111/j.1749-818x.2010.00213.x.Search in Google Scholar

Boucher, Jerry & Charles E. Osgood. 1969. The pollyanna hypothesis. Journal of Verbal Learning and Verbal Behavior 8(1). 1–8. https://doi.org/10.1016/s0022-5371(69)80002-2.Search in Google Scholar

Breiman, Leo. 2001. Random forests. Machine Learning 45. 5–32. https://doi.org/10.1023/a:1010933404324.10.1023/A:1010933404324Search in Google Scholar

Ceccagno, Antonella & Bianca Basciano. 2007. Compound headedness in Chinese: An analysis of neologisms. Morphology 17(2). 207–231. https://doi.org/10.1007/s11525-008-9119-0.Search in Google Scholar

Chatterjee, Samprit & Ali Hadi. 2012. Regression analysis by example. New York: John Wiley & Sons.Search in Google Scholar

Corbin, Danielle. 1987. Morphologie derivationelle et structuration du lexique [Derivational morphology and lexical structure]. Tübingen: Niemeyer.10.1515/9783111358383Search in Google Scholar

Evert, Steven & Baroni, Marco. 2006. The zipfR library: Words and other rare events in R. Paper presentated at the useR! 2006: The second R user conference, Vienna, June 2006.Search in Google Scholar

Firth, John Rupert. 1957. Studies in linguistic analysis. Oxford: Wiley-Blackwell.Search in Google Scholar

Goldberg, Adele E. 2016. Partial productivity of linguistic constructions: Dynamic categorization and statistical preemption. Language and Cognition 8(3). 369–390. https://doi.org/10.1017/langcog.2016.17.Search in Google Scholar

Harris, Zellig S. 1954. Distributional structure. Word 10(2–3). 146–162. https://doi.org/10.1080/00437956.1954.11659520.Search in Google Scholar

Hay, Jennifer B. 2001. Lexical frequency in morphology: Is everything relative? Linguistics 39. 1041–1070. https://doi.org/10.1515/ling.2001.041.Search in Google Scholar

Hay, Jennifer B. & R. Harald Baayen. 2002. Parsing and productivity. In Geert Booij & Jaap Van Marle (eds.), Yearbook of morphology 2001, 203–235. Dordrecht: Kluwer Academic Publishers.10.1007/978-94-017-3726-5_8Search in Google Scholar

Huang, Chu-Ren, Shu-Kai Hsieh & Keh-Jiann Chen. 2017. Mandarin Chinese words and parts of speech: A corpus-based study. New York: Taylor & Francis.10.4324/9781315669014Search in Google Scholar

Huang, C. R., S. K. Hsieh, J. F. Hong, Y. Z. Chen, Y. L. Su, Y. X. Chen & S. W. Huang. 2010. 中文词汇网络: 跨语言知识处理基础架构的设计理念与实践 [Chinese Wordnet: Design, Implementation, and Application of an Infrastructure for Cross-lingual Knowledge Processing]. 中国语文[Study of Chinese Language] 24(2). 14–23.Search in Google Scholar

Kastovsky, Dieter. 1986. Productivity in word formation. Linguistics 24. 585–600. https://doi.org/10.1515/ling.1986.24.3.585.Search in Google Scholar

Kennedy, Christopher. 1998. On the monotonicity of polar adjectives. Amsterdam: John Benjamins Publishing.Search in Google Scholar

Kruisinga, Etsko. 1932. A handbook of present-day English, Part II: English accidence and syntax. Utrecht: Noordhoff.Search in Google Scholar

Landauer, Thomas & Susan Dumais. 1997. A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction and representation of knowledge. Psychological Review 104(2). 211–240. https://doi.org/10.1037/0033-295x.104.2.211.Search in Google Scholar

Lieber, Rochelle. 2010. Introducing morphology. Cambridge, UK: Cambridge University Press.Search in Google Scholar

Matlin, Margaret W. 2016. Pollyanna principle. In Rüdiger Pohl (ed.), Cognitive illusions: Intriguing phenomena in thinking, judgment and memory, 315–335. London: Routledge.Search in Google Scholar

McDonald, Scott & Richard Shillcock. 2001. Rethinking the word frequency effect: The neglected role of distributional information in lexical processing. Language and Speech 44. 295–323. https://doi.org/10.1177/00238309010440030101.Search in Google Scholar

Mikolov, Tomas, Ilya Sutskever, Kai Chen, Greg S. Corrado & Jeffrey Dean. 2013. Distributed representations of words and phrases and their compositionality. arXiv preprint arXiv:1310.4546, 1–9. https://arxiv.org/abs/1310.4546v1.Search in Google Scholar

Perek, Florent. 2018. Recent change in the productivity and schematicity of the way-construction: A distributional semantic analysis. Corpus Linguistics and Linguistic Theory 14(1). 65–97. https://doi.org/10.1515/cllt-2016-0014.Search in Google Scholar

Perek, Florent & Martin Hilpert. 2017. A distributional semantic approach to the periodization of change in the productivity of constructions. International Journal of Corpus Linguistics 22(4). 490–520. https://doi.org/10.1075/ijcl.16128.per.Search in Google Scholar

Plag, Ingo. 2003. Word formation in English. Cambridge, UK: Cambridge University Press.10.1017/CBO9780511841323Search in Google Scholar

Plag, Ingo. 2010. Compound stress assignment by analogy: The constituent family bias. Zeitschrift für Sprachwissenschaft 29(2). 243–282. https://doi.org/10.1515/zfsw.2010.009.Search in Google Scholar

R Core Team. 2013. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. http://www.R-project.org/.Search in Google Scholar

Riddle, Elizabeth. 1985. A historical perspective on the productivity of the suffixes -ness and -ity. In Jacek Fisiak (ed.), Historical semantics, historical word-formation, 435–461. New York: Mouton.10.1515/9783110850178.435Search in Google Scholar

Sahlgren, Magnus. 2001. Vector-based semantic analysis: Representing word meanings based on random labels. In ESSLI Workshop on Semantic Knowledge Acquistion and Categorization. Helsinki: Kluwer Academic Publishers.Search in Google Scholar

Schultink, Henk. 1961. Produktiviteit als morfologisch fenomeen [Productivity as a morphological phenomenon]. Forum der Letteren 2. 110–125.Search in Google Scholar

Shaoul, Cyrus & Chris Westbury. 2010. Exploring lexical co-occurrence space using HiDEx. Behavior Research Methods 42(2). 393–413. https://doi.org/10.3758/brm.42.2.393.Search in Google Scholar

Sichel, Herbert S. 1986. Word frequency distributions and type-token characteristics. The Mathematical Scientist 11. 45–72.Search in Google Scholar

Song, Yan, Shuming Shi, Jing Li & Haisong Zhang. 2018. Directional skip-gram: Explicitly distinguishing left and right context for word embeddings. Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: Human language technologies, Volume 2 (Short Papers), 175–180. New Orleans: Association for Computational Linguistics.10.18653/v1/N18-2028Search in Google Scholar

Torsten Hothorn, Kurt Hornik & Achim Zeileis. 2006. Unbiased recursive partitioning: A conditional inference framework. Journal of Computational & Graphical Statistics 15. 651–674. https://doi.org/10.1198/106186006x133933.Search in Google Scholar

Tse, Chi-Shing, Melvin J. Yap, Yuen-Lai Chan, Wei Ping Sze, Cyrus Shaoul & Dan Lin. 2017. The Chinese lexicon project: A megastudy of lexical decision performance for 25,000+ traditional Chinese two-character compound words. Behavior Research Methods 49(4). 1503–1519. https://doi.org/10.3758/s13428-016-0810-5.Search in Google Scholar

Wood, Simon N. 2017. Generalized additive models: An introduction with R. Boca Raton: CRC Press.10.1201/9781315370279Search in Google Scholar

Xu, Zheng. 2018. The word status of Chinese adjective-noun combinations. Linguistics 56(1). 207–256. https://doi.org/10.1515/ling-2017-0035.Search in Google Scholar

Xu, Zheng. 2019. Chinese adjective-noun combinations. In Franz Rainer, Francesco Gardani, Wolfgang U. Dressler & Hans LuschützkyChristian (eds.), Competition in inflection and word-formation, 5, 307–334. Springer.10.1007/978-3-030-02550-2_12Search in Google Scholar

Supplementary Material

The online version of this article offers supplementary material (https://doi.org/10.1515/cllt-2020-0059).

Received: 2020-09-22

Accepted: 2021-02-17

Published Online: 2021-03-10

Published in Print: 2022-10-26

Adjective–noun compounds in Mandarin: a study on productivity

Abstract

Acknowledgment

References

Supplementary Material

Journal and Issue

Articles in the same Issue