Abstract
Traditionally, automated methods for loanword detection have not received an abundance of attention within the field of language contact. However, as research on loanwords has begun utilizing corpora with word counts in the millions, these generous quantities of data pose challenges for traditional methods of linguistic annotation. This paper presents a method for automatically detecting anglicisms within Spanish text and presents a case study, applying this method to explore the social stratification of anglicisms in Argentine media. The findings of the case study suggest that anglicisms may function as prestige markers in Argentina, which may be a logical consequence of the mode of contact: those of upper socio-economic status have greater access to outlets where loanwords seem to emerge, such as the media, Internet, and second language education.
References
Alex, Beatrice. 2006. Integrating language knowledge resources to extend the English inclusion classifier to a new language. In Proceedings of the fifth international conference on language resources and evaluation, 2431–2436. Italy: Genoa.Search in Google Scholar
Alex, Beatrice. 2007. Automatic detection of english inclusions in mixed-lingual data with an application to parsing. Edinburgh: University of Edinburgh dissertation.Search in Google Scholar
Alex, Beatrice. 2008. Comparing corpus-based to web-based lookup techniques for automatic English inclusion detection. In Proceedings of the sixth international language resources and evaluation conference, 2693–2697. Morocco: Marrakech.Search in Google Scholar
Andersen, Gisle. 2005. Assessing algorithms for automatic extraction of anglicisms in Norwegian texts. In Pernilla Danielsson & Martijn Wagenmakers (eds.), Proceedings from the corpus linguistics 2005 conference. UK: Birmingham.Search in Google Scholar
Andersen, Gisle. 2014. Pragmatic borrowing. Journal of Pragmatics 67. 17–33. https://doi.org/10.1016/j.pragma.2014.03.005.Search in Google Scholar
Baayen, R. Harald. 2010. Demythologizing the word frequency effect: A discriminative learning perspective. The Mental Lexicon 5(3). 436–461. https://doi.org/10.1075/ml.5.3.10baa.Search in Google Scholar
Backus, Ad. 2013. A usage-based approach to borrowability. In Zenner Eline & Kristiansen Gitte (eds.), New perspectives on lexical borrowing, 19–39. Boston & Berlin: De Gruyter Mouton.10.1515/9781614514305.19Search in Google Scholar
Balteiro, Isabel. 2011. A reassessment of traditional lexicographical tools in the light of new corpora: Sports anglicisms in Spanish. International Journal of English Studies 11(2). 23–52. https://doi.org/10.6018/ijes/2011/2/149631.Search in Google Scholar
Barrs, Keith. 2014. Lexical semantics of English loanwords in Japanese. In Proceedings of the 47th British association for applied linguistics annual meeting, 29–35. Coventry: University of Warwick.Search in Google Scholar
Biber, Douglas, Randi Reppen, Erin Schnur & Romy Ghanem. 2016. On the (non) utility of Juilland’s D to measure lexical dispersion in large corpora. International Journal of Corpus Linguistics 21(4). 439–464. https://doi.org/10.1075/ijcl.21.4.01bib.Search in Google Scholar
Boczkowski, Pablo J. & Martin de Santos. 2007. When more media equals less news: Patterns of content homogenization in Argentina’s leading print and online newspapers. Political Communication 24(2). 167–180. https://doi.org/10.1080/10584600701313025.Search in Google Scholar
Brysbaert, Marc & New Boris. 2009. Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a New and improved word frequency measure for American English. Behavior Research Methods 41(4). 977–990. https://doi.org/10.3758/brm.41.4.977.Search in Google Scholar
Calude, Andreea Simona, Steven Miller & Mark Pagel. 2020. Modelling loanword success – A sociolinguistic quantitative study of Māori loanwords in New Zealand English. Corpus Linguistics and Linguistic Theory 16(1). 29–66. https://doi.org/10.1515/cllt-2017-0010.Search in Google Scholar
Chan, Tak Wing & John H. Goldthorpe. 2007. Social status and newspaper readership. American Journal of Sociology 112(4). 1095–1134. https://doi.org/10.1086/508792.Search in Google Scholar
Chesley, Paula. 2010. Lexical borrowings in French: Anglicisms as a separate phenomenon. Journal of French Language Studies 20(3). 231–251. https://doi.org/10.1017/s0959269510000049.Search in Google Scholar
Crespo-Fernández, Eliecer & Carmen Luján-García. 2017. Anglicisms and word axiology in homosexual language. Revista Española de Lingüística Aplicada/Spanish Journal of Applied Linguistics 30(1). 74–103. https://doi.org/10.1075/resla.30.1.04cre.Search in Google Scholar
De Smedt, Tom & Daelemans Walter. 2012. Pattern for Python. Journal of Machine Learning Research 13(66). 2063–2067.Search in Google Scholar
Finkel, Jenny Rose, Trond Grenager & Christopher Manning. 2005. Incorporating non-local information into information extraction systems by Gibbs sampling. In Proceedings of the 43rd annual meeting of the association for computational linguistics, 363–370. Michigan: Ann Arbor.10.3115/1219840.1219885Search in Google Scholar
Fisherman, Haya. 1990. Attitudes toward foreign words in contemporary Hebrew. International Journal of the Sociology of Language 86(1). 5–40. https://doi.org/10.1515/ijsl.1990.86.5.Search in Google Scholar
Francom, Jerid, Mans Hulden & Ussishkin Adam. 2014. ACTIV-ES: A comparable, cross-dialect corpus of ‘everyday’ Spanish from Argentina, Mexico, and Spain. In The ninth international conference on language resources and evaluation, 1733–1737. Iceland: Reykjavik.Search in Google Scholar
Gries, Stefan Th. 2008. Dispersions and adjusted frequencies in corpora. International Journal of Corpus Linguistics 13(4). 403–437. https://doi.org/10.1075/ijcl.13.4.02gri.Search in Google Scholar
Guzman, Gualberto A., Jacqueline Serigos, Barbara E. Bullock & Jacqueline Almeida Toribio. 2016. Simple tools for exploring variation in code-switching for linguists. In Proceedings of the second workshop on computational approaches to code switching, 12–20. Austin, Texas: Association for Computational Linguistics.10.18653/v1/W16-5802Search in Google Scholar
Hassall, Tim, Elisabet Titik Murtisari, Christine Donnelly & Jeff Wood. 2008. Attitudes to western loanwords in Indonesian. International Journal of the Sociology of Language 2008(189). 55–84. https://doi.org/10.1515/ijsl.2008.003.Search in Google Scholar
Haugen, Einar. 1950. The analysis of linguistic borrowing. Language 26(2). 210–231. https://doi.org/10.2307/410058.Search in Google Scholar
Hughes, Baden, Timothy Baldwin, Steven Bird, Jeremy Nicholson & Andrew MacKinlay. 2006. Reconsidering language identification for written language resources. In Proceedings of 5th international conference on language resources and evaluation, 485–488. Italy: Genoa.Search in Google Scholar
Koo, Hahn. 2015. An unsupervised method for identifying loanwords in Korean. Language Resources and Evaluation 49(2). 1–19. https://doi.org/10.1007/s10579-015-9296-5.Search in Google Scholar
Leidig, Sebastian, Tim Schlippe & Tanja Schultz. 2014. Automatic detection of anglicisms for the pronunciation dictionary generation: A case study on our German it corpus. In 4th Workshop on spoken language technologies for under-resourced languages. Russia: Petersburg.Search in Google Scholar
Loveday, Leo. 1996. Language contact in Japan: A socio-linguistic history. Oxford studies in language contact. Oxford: Clarendon Press.Search in Google Scholar
Mansikkaniemi, André & Mikko Kurimo. 2012. Unsupervised vocabulary adaptation for morph-based language models. In Proceedings of the NAACL-HLT 2012 workshop: Will we ever really replace the N-Gram model? On the future of language modeling for HLT, 37–40. Montréal, Canada: Association for Computational Linguistics.Search in Google Scholar
Matus-Mendoza, Maríadelaluz. 2002. The English lexical loan: A class marker. Journal of Hispanic Higher Education 1(4). 329–337. https://doi.org/10.1177/153819202236977.Search in Google Scholar
Nair, Vineeth G. 2014. Getting started with beautiful soup. Birmingham, UK: Packt Publishing.Search in Google Scholar
Ngom, Fallou. 2003. The social status of Arabic, French, and English in the Senegalese speech community. Language Variation and Change 15(3). 351–368. https://doi.org/10.1017/s0954394503153045.Search in Google Scholar
Onysko, Alexander & Esme Winter-Froemel. 2011. Necessary loans – luxury loans? Exploring the pragmatic dimension of borrowing. Journal of Pragmatics 43(6). 1550–1567. https://doi.org/10.1016/j.pragma.2010.12.004.Search in Google Scholar
Rosner, Mike & Farrugia Paulseph-John. 2007. A tagging algorithm for mixed language identification in a noisy domain. In Proceedings of the eighth annual conference of the international speech communication association, 190–193. Belgium: Antwerp.10.21437/Interspeech.2007-77Search in Google Scholar
Ruellot, Viviane. 2011. English in French print advertising from 1999 to 2007. World Englishes 30(1). 5–20. https://doi.org/10.1111/j.1467-971x.2010.01654.x.Search in Google Scholar
Sanou, Rosa, Graciela Albiñana, Graciela Galli & Claudia Castañeda. 2017. Análisis Sociolingüístico: Actitudes y Empleo de Anglicismos En San Juan [Sociolinguistic analysis: Attitudes and use of anglicisms in San Juan]. Texturas 16. 19–40. https://doi.org/10.14409/texturas.v0i16.6937.Search in Google Scholar
Serigos, Jacqueline. 2016. Using distributional semantics in loanword research: A concept-based approach to quantifying semantic specificity of anglicisms in Spanish. International Journal of Bilingualism 21(5). 521–540. https://doi.org/10.1177/1367006916635836.Search in Google Scholar
Solorio, Thamar & Yang Liu. 2008. Learning to predict code-switching points. In Proceedings of the conference on empirical methods in natural language processing, 973–981. Hawaii: Honolulu.10.3115/1613715.1613841Search in Google Scholar
Varga, Dražen, Lidija Orešković Dvorski & Saša Bjelobaba. 2012. English loanwords in French and Italian daily newspapers. Studia Romanica et Anglica Zagrabiensia 56. 71–84. UDC 811.133.1’373.45:811.111.Search in Google Scholar
Varra, Rachel Marie. 2013. The social correlates of lexical borrowing in Spanish in New York City. New York: University of New York dissertation.Search in Google Scholar
Winford, Donald. 2003. An introduction to contact linguistics, 1st edn. Oxford: Wiley-Blackwell.Search in Google Scholar
Zenner, Eline, Dirk Speelman & Dirk Geeraerts. 2014. A sociolinguistic analysis of borrowing in weak contact situations: English loanwords and phrases in expressive utterances in a Dutch reality TV show. International Journal of Bilingualism 19(3). 333–346. https://doi.org/10.1177/1367006914521699.Search in Google Scholar
Zenner, Eline, Dirk Speelman & Dirk Geeraerts. 2012. Cognitive sociolinguistics meets loanword research: Measuring variation in the success of anglicisms in Dutch. Cognitive Linguistics 23(4). 749–792. https://doi.org/10.1515/cog-2012-0023.Search in Google Scholar
Zipf, George. 1935. The psychobiology of language: An introduction to dynamic philology. Cambridge, Massachusetts: M.I.T. Press.Search in Google Scholar
© 2020 Walter de Gruyter GmbH, Berlin/Boston