Abstract
Much of the patterned use of language occupies a poorly charted middle ground of usages that are neither frozen, one-off items listable in dictionaries nor products of maximally general rules found in grammars. Similarly, these usages fly below the radar of modular theories of language that make a strict distinction between items in a lexicon and the rules of syntax for combining them. Early constructionist approaches showed this neglected territory to be teeming with conventional form-meaning pairings, i.e., lexico-grammatical constructions. While corpora have been seen as a source for investigating these constructions, they entail a fundamental but seldom-noted limitation: constructions are constituted by both syntagmatic and paradigmatic relations, but corpora lack the paradigmatic dimension. Thus corpora can reveal multiword items but not the relations among them that constitute constructions. We elaborate on an alternative, illustrating how the design of an existing machine-readable language model affords discovery of lexico-grammatical constructions by capturing bottom up the relations they contract with other patterns. The network is noise-ridden, but we exploit the noise as the necessary background against which constructions can be set into relief and thus made discoverable.
Funding statement: Taiwan’s Ministry of Science and Technology (Grant/Award Number: ‘MOST105-2511-S-008-008’).
References
Biber, Douglas. 2009. A corpus-driven approach to formulaic language in English: Multi-word patterns in speech and writing. International Journal of Corpus Linguistics 14(3). 275–311.10.1075/ijcl.14.3.08bibSearch in Google Scholar
Biber, Douglas, Susan Conrad & Viviana Cortes. 2004. If you look at … : Lexical bundles in university teaching and textbooks. Applied Linguistics 25(3). 371–405.10.1093/applin/25.3.371Search in Google Scholar
Bolinger, Dwight L. 1961. Syntactic blends and other matters. Language 37(3). 366–381.10.2307/411078Search in Google Scholar
Bolinger, Dwight L. 1965. The atomization of meaning. Language 41(4). 555–573.10.2307/411524Search in Google Scholar
Bolinger, Dwight L. 1977. Idioms have relations. Forum Linguisticum 2. 157–169.Search in Google Scholar
Bybee, Joan. 1998. The emergent lexicon. Chicago Linguistic Society 34. 421–435.10.1093/acprof:oso/9780195301571.003.0013Search in Google Scholar
Church, Kenneth & Patrick Hanks. 1989. Word association norms, mutual information and lexicography. In Proceedings of the 27 Annual Meeting of ACL, 76–83. Vancouver, Canada.10.3115/981623.981633Search in Google Scholar
Csomay, Eniko & Viviana Cortes. 2010. Lexical bundle distribution in university classroom talk. In Stefan Thomas Gries, Stefanie Wulff & Mark Davies (eds.), Corpus linguistic applications: Current studies, new directions, 153–168. Amsterdam: Rodopi.10.1163/9789042028012_011Search in Google Scholar
Culicover, Peter W. & Ray Jackendoff. 2005. Simpler syntax. Oxford: Oxford University Press.10.1093/acprof:oso/9780199271092.001.0001Search in Google Scholar
Ellis, Nick C., Rita Simpson‐Vlach & Carson Maynard. 2008. Formulaic language in native and second language speakers: Psycholinguistics, corpus linguistics, and TESOL. TESOL Quarterly 42(3). 375–396.10.1002/j.1545-7249.2008.tb00137.xSearch in Google Scholar
Fillmore, Charles J. 1985. Syntactic intrusions and the notion of grammatical construction. In Mary Niepokuj, Mary VanClay, Vassiliki Nikiforidou & Deborah Feders (eds.), Proceedings of the eleventh annual meeting of the Berkeley linguistics society, 73–86. University of California, Berkeley: Berkeley Linguistics Society.10.3765/bls.v11i0.1913Search in Google Scholar
Fillmore, Charles J., Paul Kay & Mary Catherine O’Connor. 1988. Regularity and idiomaticity in grammatical constructions: The case of Let alone. Language 64(3). 501–538.10.2307/414531Search in Google Scholar
Firth, J. R. 1957. Modes of meaning. In John R. Firth (ed.), Papers in linguistics 1934–1951, 190–215. Oxford: Oxford University Press.Search in Google Scholar
Francis, Gill. 1993. A corpus-driven approach to grammar: Principles, methods, and examples. In Mona Baker, Gill Francis & Elena Tognini-Bonelli (eds.), Text and technology: In honour of John Sinclair. Philadelphia: John Benjamins.10.1075/z.64Search in Google Scholar
Goldberg, Adele E. 1995. Constructions: A construction grammar approach to argument structure. Chicago: University of Chicago Press.Search in Google Scholar
Goldberg, Adele E. 2006. Constructions at work: The nature of generalization in language. Oxford: Oxford University Press.10.1093/acprof:oso/9780199268511.001.0001Search in Google Scholar
Gries, Stefan Th. & Anatol Stefanowitsch. 2004. Covarying collexemes in the into-causative. In Michel Achard & Suzanne Kemmer (eds.), Language, culture, and mind. Stanford: CSLI.Search in Google Scholar
Halliday, M. A. K. 1961. Categories of the theory of grammar. Word 17(3). 241–292.10.1515/9783110820041-012Search in Google Scholar
Halliday, M. A. K. 2003. On the “architecture” of language. In Jonathan Webster (ed.), Collected works of M.A.K. Halliday, vol. 3. London: Equinox.Search in Google Scholar
Halliday, M. A. K. 2009. Methods – techniques – problems. In M. A. K. Halliday & Jonathan Webster (eds.), Continuum companion to systemic functional linguistics, 59–86. New York: Continuum.Search in Google Scholar
Harris, Zellig. 1946. From morpheme to utterance. Language 23(3). 161–183.10.2307/410205Search in Google Scholar
Harris, Zellig. 1951. Structural linguistics. Chicago: University of Chicago Press.Search in Google Scholar
Harris, Zellig. 1954. Distributional structure. Word 10. 146–162.10.1080/00437956.1954.11659520Search in Google Scholar
Hunston, Susan& & Gill Francis. 1999. Pattern grammar: A corpus-driven approach to the lexical grammar of English. Amsterdam: John Benjamins.10.1075/scl.4Search in Google Scholar
Jackendoff, Ray. 1995. The boundaries of the lexicon. In Martin Everaert, Erik-Jan Van der Linden, André Schenk & Rob Schreuder (eds.), Idioms: Structural and psychological perspectives, 133–165. Hillsdale, NJ: Erlbaum.Search in Google Scholar
Jackendoff, Ray. 2008. Construction after construction and its theoretical challenges. Language 84(1). 8–28.10.1353/lan.2008.0058Search in Google Scholar
Jakobson, Roman. 1990. Aspects of language and two types of aphasia. In Linda R. Waugh & Monique Monville-Burston (eds.), On language. Cambridge: Harvard University Press.Search in Google Scholar
Kilgarriff, Adam. 1997. I don’t believe in word senses. Computers and the Humanities 31(2). 91–113.10.1515/9783110895698.361Search in Google Scholar
Kilgarriff, Adam, Pavel Rychl’y, Pavel Smrz & David Tugwell. 2004. The sketch engine. In Geoffrey Williams & Sandra Vessier (eds.), Proceedings of EURALEX, 105–116. Lorient, France.Search in Google Scholar
Martinez, Ron & Norbert Schmitt. 2012. A phrasal expressions list. Applied Linguistics 33(3). 299–320.10.1093/applin/ams010Search in Google Scholar
Murphy, Raymond. 2012. English grammar in use. Cambridge: Cambridge University Press.Search in Google Scholar
Nattinger, James R. & Jeanette S. DeCarrico. 1992. Lexical phrases and language teaching. Oxford: Oxford University Press.Search in Google Scholar
Pawley, Andrew & Frances Hodgetts Syder. 1983. Two puzzles for linguistic theory: Nativelike selection and nativelike fluency. In Jack C. Richards & Richard W. Schmidt (eds.), Language and communication, 191–225. London: Longman.Search in Google Scholar
Pedersen, Ted. 1996. Fishing for exactness. In Proceedings of the South-Central SAS Users Group Conference (SCSUG-96), 188–200. Austin, TX.Search in Google Scholar
Renouf, Antoinette & John Sinclair. 1991. Collocational frames in English. In Karin Aijmer & Bengt Altenberg (eds.), English corpus linguistics: Studies in honour of Jan Svartvik, 128–144. London: Longman.Search in Google Scholar
Saussure, Ferdinand de. 1959. Course in general linguistics (W. Baskin, Trans.). New York: The Philosophical Library.Search in Google Scholar
Simpson-Vlach, Rita & Nick C. Ellis. 2010. An academic formulas list: New methods in phraseology research. Applied Linguistics 31(4). 487–512.10.1093/applin/amp058Search in Google Scholar
Siyanova-Chanturia, Anna & Ron Martinez. 2015. The idiom principle revisited. Applied Linguistics 36(5). 549–569.10.1093/applin/amt054Search in Google Scholar
Stefanowitsch, Anatol & Stefan Th. Gries. 2003. Collostructions: Investigating the interaction of words and constructions. International Journal of Corpus Linguistics 8(2). 209–243.10.1075/ijcl.8.2.03steSearch in Google Scholar
Stefanowitsch, Anatol & Stefan Th. Gries. 2005. Covarying Collexemes. Corpus Linguistics and Linguistic Theory 1(1). 1–43.10.1515/cllt.2005.1.1.1Search in Google Scholar
Stefanowitsch, Anatol & Stefan Th. Gries. 2009. Corpora and Grammar. In Anke Lüdeling & Merja Kytö (eds.), Corpus linguistics: An international handbook, vol. 2, 933–951. Berlin & New York: Mouton de Gruyter.Search in Google Scholar
Tsao, Nai-Lung & David Wible. 2009. A method for unsupervised lexical error detection and correction. In Proceedings of the NAACL workshop on innovative use of NLP for building educational applications, 51–54. Boulder, Colorado.10.3115/1609843.1609851Search in Google Scholar
Tsao, Nai-Lung & David Wible. 2013. Word similarity using constructions as contextual features. In Proceedings of the Joint Symposium on Semantic Processing, 51–59. Trento, Italy.Search in Google Scholar
Wible, David & Nai-Lung Tsao. 2010. StringNet as a computational resource for discovering and investigating linguistic constructions. In Proceedings of the NAACL HLT workshop on extracting and using constructions in computational linguistics, 25–31. Los Angeles, CA.Search in Google Scholar
Wible, David & Nai-Lung Tsao. 2011. Towards a new generation of corpus-derived lexical resources for language learning. In Fanny Meunier, Sylvie De Cock, Gaëtanelle Gilquin & Magali Paquot (eds.), A taste for corpora, 237–255. Amsterdam: John Benjamins.10.1075/scl.45.14wibSearch in Google Scholar
Yorio, Carlos A. 1980. Conventionalized language forms and the development of communicative competence. TESOL Quarterly 14(4).433–442.10.2307/3586232Search in Google Scholar
Article Note
Nai-Lung Tsao’s contribution to the research reported here was completed while he was employed full-time at National Central University.
© 2020 Walter de Gruyter GmbH, Berlin/Boston