Collaborative writing in face-to-face settings: A substantive and methodological review
Introduction
Defined as a task in which two or more learners interact with each other throughout the writing process to co-produce one single document, collaborative writing (CW) has been thriving in the last two decades (Storch, 2019). Following early work on CW in second language (L2) settings (e.g., DiCamilla & Antón, 1997; Kowal & Swain, 1994), CW has been introduced to and investigated in diverse language classrooms. This growing interest in CW has primarily been motivated by (a) the theoretical support afforded by the interaction hypothesis (Long, 1983), the output hypothesis (Swain, 1993), and sociocultural theory (Vygotsky, 1978); (b) the increasing popularity of communicative language teaching methods and task-based language teaching (TBLT); as well as (c) the fact that CW unites elements that are critical for L2 learning and writing development (e.g., intermediate peer feedback, negotiations of form and meaning, and L2 written output).
Previous research on collaborative L2 writing mainly falls under two strands. The first strand of studies, which has focused on examining the potential of CW for L2 learning, has used either descriptive or quasi-experimental methods to investigate whether CW offers language learning opportunities, promotes the development of L2 writing proficiency, and facilitates L2 performance or learning gains (e.g., Kim, 2008; Shehadeh, 2011; Wigglesworth & Storch, 2009). Findings in this area have generally confirmed the benefits of CW as a pedagogical task. Building upon such findings, the other strand of research has explored variables that may influence the potential of CW for L2 learning. Researchers have investigated, for example, how variation in learner traits, task design, task implementation, or learning settings may influence the different aspects of CW relating to L2 learning, including co-constructed texts, learner interaction, and task perception.
The two subsequent sections first review the different variables that have been investigated in CW and then review how prior research has analyzed evidence of learning, learners’ co-constructed texts, collaborative dialogues, and task perception in CW. A third section involves the issue of how such investigations are reported. It should be noted that while CW tasks have been widely employed in both traditional and computer-mediated communication (CMC) settings, this study exclusively focused on CW tasks implemented in face-to-face settings, as research in CMC settings has been reviewed fairly recently in Li (2018).
Variables that have been investigated in face-to-face CW research, in essence, can be divided into two categories: task-related variables and learner-related variables. The former includes variables concerning the design and implementation of CW tasks (i.e., task design and task implementation), whereas the latter involves variables relating to the traits of learners (e.g., proficiency, individual differences).
In terms of task-related variables, prior CW research has probed into task design variables such as task type and task mode. For instance, García Mayo (2002) and Alegria la Colina and García Mayo (2007) compared the L2 learning opportunities in both meaning-focused (e.g., jigsaw) and form-focused tasks (e.g., dictogloss, text reconstruction). Rouhshad, Wigglesworth, and Storch (2016) and Tan, Wigglesworth, and Storch (2010) examined the role of task modality by comparing how L2 learners performed differently in CW tasks designed for face-to-face settings and those designed for CMC settings.
With regard to the implementation of CW tasks, prior research has examined a variety of variables (e.g., group sizes, group-formation methods, language used for interaction, and pre-task modeling). For instance, Fernández Dobao, 2012, Fernández Dobao, 2014 examined how group size may influence the L2 written output, peer interaction, and vocabulary acquisition of intermediate Spanish learners in CW tasks. Mozaffari (2017) compared two CW task conditions in an Iranian EFL classroom: working in self-assigned pairs and working in teacher-assigned pairs. Unlike the studies examining the grouping of students, Zhang (2018a, examined how L1 or L2 use may influence EFL learners’ linguistic performance and peer interaction in CW.
Compared with task-related variables, the number of learner-related variables that have been examined seems to be more limited. One frequently investigated learner variable is proficiency. A number of studies have investigated how learners, when paired with peers of various proficiency levels, may assume differing roles, attend to language use differently, and form different relationships (Kim & McDonough, 2008; Storch & Aldosari, 2013; Watanabe & Swain, 2007). DiCamilla and Antón (2012) examined the relationship between proficiency level and L1 use in learners’ collaborative dialogues in CW. Another learner trait that has received growing attention is a learner’s linguistic background. For instance, a range of recent studies have examined how a pair member’s linguistic background (e.g., L2 learners vs. heritage language learners) may influence their interactional features (e.g., Fernández Dobao, 2020a; Henshaw, 2013; Torres & Cung, 2019; Walls, 2013, 2018) or perceptions of CW tasks (e.g., Fernández Dobao, 2020b). Also noteworthy is recent work by García Mayo and colleagues, which examined the potential of CW for a likely underrepresented learner population (i.e., young L2 learners) in Content and Language Integrated Language (CLIL) programs (e.g., Basterrechea & García Mayo, 2013; Calzada & García Mayo, 2020).
To understand the utility of CW in the language classroom in comparison with other tasks or in relation to the aforementioned variables, prior research has often investigated learning gains, learners’ jointly written texts, learner interaction, and task perception in CW. This section provides an overview of the measurement of these four aspects in prior research.
In detecting evidence for learning gains in CW, prior research has utilized a wide range of measures. For instance, Eckerth (2008) designed pre- and post-tests to measure gains in explicit knowledge of target grammatical features resulting from CW tasks. Henshaw (2013) considered learning gains through the incorporation of negotiated L2 forms in subsequent individual writing tasks. While all focusing on vocabulary learning in CW, Kim (2008) adopted a vocabulary knowledge scale to examine learners’ self-reported word knowledge in a pre-test and post-test, whereas Fernández Dobao (2014) and García Mayo and Zeitler (2017) compared learners’ usage of chosen lexis in a pre-test and a post-test following CW tasks. On the other hand, Shehadeh’s (2011) longitudinal study focused on writing development and measured improvement in learners’ writing quality after receiving CW as a semester-long treatment.
Similar to other L2 writing research, CW research has also employed the complexity, accuracy, fluency (CAF) framework to analyze written products (Housen et al., 2012). The measures of complexity (i.e., the use of diverse and sophisticated lexical and syntactic structures) in L2 writing, particularly syntactic complexity, have involved substantial variability (Lan, Liu, & Staples, 2019), ranging from T-unit, clause-related measures, and subordination-related measures, to specific lexico-grammatical features such as nominal structures (Biber, Gray, & Poonpon, 2011; Larsson & Kaatari, 2020; Norris & Ortega, 2009). Defined as “the extent to which an L2 learner’s performance deviates from a norm” (Housen et al., 2012, p. 4), accuracy has been mainly measured in three ways: holistic scales, number of error-free units, and the number of errors (Polio, 1997). Last, fluency, while often measured by text length, has also occasionally been considered as a temporal phenomenon and operationalized as words per minute (e.g., Chenoweth & Hayes, 2001). Overall, immense variability is characteristic of the metrics of CAF in previous L2 writing research.
To better understand the nature of collaboration and interaction and their possible associations with learner writing, learners’ collaborative dialogues have also often been analyzed. Specifically, three parameters have been employed: (a) focus area (i.e., what learners discuss); (b) language learning opportunities (i.e., how learners negotiate form and meaning); and (c) the dynamics of collaboration. Regarding focus area, a predominant framework (Storch, 2005) identified both task-related focus areas (e.g., content elicitation, discussion on organization) and interaction-related focus areas (e.g., interpersonal communication, backchannels) (McDonough, Crawford, & De Vleeschauwer, 2016; Wigglesworth & Storch, 2009; Zhang, 2019a, 2019b). Language learning opportunities are often operationalized as language-related episodes (LREs), defined as any part of interaction during which learners “talk about the language they are producing, question their language use, or correct themselves or others” (Swain & Lapkin, 1998, p. 326). LREs have been further analyzed for type (e.g., Fernández Dobao, 2014), resolution (e.g., Kim, 2008), and level of engagement (e.g., Rouhshad et al., 2016). Last, prior research has often analyzed the dynamics of collaboration using Storch’s (2002, 2013) model, which distinguishes five collaboration types based upon a global analysis of learners’ involvement in and control over a CW task (e.g., Chen & Hapgood, 2019; Watanabe & Swain, 2007).
Aside from their L2 output in CW, learners’ task perception has occasionally been examined to afford additional insights on the utility of CW tasks in language classrooms from the learner perspective. Interviews and questionnaires are two frequently employed methodologies to gather such information in prior research (Fernández Dobao & Blum, 2013; Fernández Dobao, 2020b; Storch, 2005).
Aiming to understand collaborative L2 writing as a subdomain of L2 research and inform future studies in this area, one goal of the current study is to review how previous face-to-face CW research has analyzed the various L2 features of CW, including the co-constructed texts, collaborative talks, and task perception.
In addition to synthesizing the analyses in CW research (i.e., the substantive review), this paper is also interested in the methodological practices in this domain. The field of quantitative L2 research has witnessed a surge in methodological syntheses in the past decade. In addition to surveying individual research and reporting practices in quantitative L2 research (e.g., Al-Hoorie & Vitta, 2019; Derrick, 2016; Plonsky, 2013, 2014; Plonsky & Ghanbar, 2018), previous efforts have systematically examined methodological practices in a range of subdomains of L2 research, such as L2 interaction (Plonsky & Gass, 2011), task-based learner production (Plonsky & Kim, 2016), computer-mediated interaction (Ziegler, 2016), and written corrective feedback (Liu & Brown, 2015). Their findings have suggested considerable idiosyncrasies and inconsistencies in the reporting of statistical data (e.g., standard deviation, effect sizes, confidence intervals, assumption checking) that are critical for better interpreting results in primary studies and conducting further meta-analytic research (Liu & Brown, 2015; Plonsky, 2013, 2014). Derrick (2016) also found very limited reporting of instrument piloting and revision, which poses obstacles for research replications and potential threats to internal validity. Other researchers have raised concerns about the reporting of reliability estimates for both instruments and the coding of L2 features (Larson‐Hall & Plonsky, 2015; Plonsky & Derrick, 2016; Plonsky & Kim, 2016; Saito & Plonsky, 2019).
Drawing upon the argument in Plonsky (2013) that advancement in a field of inquiry relies on “sound research designs, principled data analyses, and transparent reporting practices” (p. 656), this study joins other L2 researchers’ efforts in promoting methodological rigor by reviewing the research and reporting practices in quantitative CW research. Although still a young domain of research, CW has received much attention in the past decades. A methodological and substantive review of face-to-face CW research will, we hope, elucidate the existing scope of inquiry, shed light on areas in need of more research, and provide empirically-grounded guidance to future research efforts. With these goals in mind, this paper answers four research questions (RQs):
- 1
What are the study contexts and research design features in quantitative CW research in face-to-face L2 settings?
- 2
How have the different aspects of CW (i.e., co-constructed texts, learner interaction, task perception) been investigated in quantitative CW research conducted in face-to-face L2 settings?
- 3
What variables have been investigated in quantitative CW research in face-to-face L2 settings?
- 4
What research and reporting practices have been employed in quantitative CW research in face-to-face L2 settings?
Section snippets
Data collection
In an effort to alleviate the “file drawer problem” associated with publication bias (Rosenthal, 1979), both published and unpublished studies that met all of the following criteria were retrieved: (a) collaborative L2 writing research published before March 2018; (b) studies that employed quantitative methods; (c) studies that were conducted in a face-to-face setting; and (d) studies that were written in English (which is the predominant language for academic work). Given that the current
RQ1: learner demographic characteristics, research contexts, and design features
This section reports the demographic information, research contexts, and study design features of the 94 included studies. Table 4 presents two demographic features (proficiency level, age) and two features related to research contexts. In terms of proficiency level, over half of the studies (52 %) sampled intermediate learners, whereas only 9.6 % investigated beginners. It is also noteworthy that 14.9 % of the studies did not report the proficiency levels of the participants, although most of
Discussion
Following the tradition in research synthesis of reviewing prior research in order to inform future research efforts, this study examines both the substantive and methodological aspects of quantitative CW research in face-to-face settings from 1992 to 2017. This section first discusses major findings about the substantive aspects. It then discusses the findings about the methodological aspects with reference to methodological reviews in pertinent research areas, followed by a summary of
Conclusion
This study presents a methodological and substantive review of 94 quantitative studies on collaborative L2 writing implemented in face-to-face settings, covering both observational and experimental research in this domain between 1992 and 2017. When considering the recommendations proposed above, it is critical to acknowledge the limitations of the current study. First, this review exclusively focused on studies conducted in face-to-face settings, leaving out empirical research in CMC settings
Acknowledgement
Our sincere thanks to Qi Liu, who assisted with the arduous task of coding, and to Ekaterina Sudina for her insightful comments on an earlier draft of this paper.
Meixiu Zhang (Ph.D., Northern Arizona University) is Assistant Professor of Applied Linguistics at Texas Tech University, where she teaches courses in applied linguistics. Her primary research areas include L2 writing, corpus linguistics, L2 pedagogy, and second language acquisition. Her work has appeared in journals such as Journal of Second Language Writing, Language Teaching Research, System, and Corpora.
References1 (87)
Collaborative writing tasks in the L2 classroom: Comparing group, pair, and individual work
Journal of Second Language Writing
(2012)- et al.
Collaborative writing in pairs and small groups: Learners’ attitudes and perceptions
System
(2013) Cognitive task complexity and L2 written syntactic complexity, accuracy, lexical complexity, and fluency: A research synthesis and meta-analysis
Journal of Second Language Writing
(2017)- et al.
Methodological synthesis of research on the effectiveness of corrective feedback in L2 writing
Journal of Second Language Writing
(2015) The “file drawer problem” and tolerance for null results
Psychological Bulletin
(1979)Effects and student perceptions of collaborative writing in L2
Journal of Second Language Writing
(2011)Are two heads better than one? Pair work and grammatical accuracy
System
(1999)Collaborative writing: Product, process, and students’ reflections
Journal of Second Language Writing
(2005)Collaborative writing in the EFL classroom: The effects of L1 and L2 use
System
(2018)Towards a quantitative model of understanding the dynamics of collaboration in collaborative writing
Journal of Second Language Writing
(2019)
Exploring collaboratively written L2 texts among first-year learners of German in Google Docs
Computer Assisted Language Learning
Attention to form across collaborative tasks by low-proficiency learners in an EFL setting
The seven sins of L2 research: A review of 30 journals’ statistical quality and their CiteScore, SJR, SNIP, JCR Impact Factors
Language Teaching Research
On the foundations of knowledge in applied linguistics research: Sampling bias and the problem of generalizability
Annual Review of Applied Linguistics
Collaboration or cooperation? Analyzing group dynamics and revision processes in wikis
CALICO Journal
Use of wikis to promote collaborative EFL writing
Language Learning & Technology
Language-related episodes during collaborative tasks: A comparison of CLIL and EFL learners
Should we use characteristics of conversation to measure grammatical complexity in L2 writing development?
TESOL Quarterly
The impact of collaborative writing on the writing fluency of Iranian EFL learners
Journal of Language Teaching and Research
Child learners’ reflections about EFL grammar in a collaborative writing task: When form is not at odds with communication
Language Awareness
Understanding knowledge, participation and learning in L2 collaborative writing: A metacognitive theory perspective
Language Teaching Research
Fluency in writing: Generating text in L1 and L2
Written Communication
Instrument reporting practices in second language research
TESOL Quarterly
Repetition in the collaborative discourse of L2 learners: A Vygotskian perspective
Canadian Modern Language Review
Functions of L1 in the collaborative interaction of beginning and advanced second language learners
International Journal of Applied Linguistics
Investigating consciousness‐raising tasks: Pedagogically targeted and non‐targeted learning gains
International Journal of Applied Linguistics
Attention to form in collaborative writing tasks: Comparing pair and small group interaction
Canadian Modern Language Review
Exploring interaction between heritage and second language learners in the Spanish language classroom: Opportunities for collaborative dialogue and learning
Collaborative writing in mixed classes: What do heritage and second language learners think?
Foreign Language Annals
The effectiveness of two form‐focused tasks in advanced EFL pedagogy
International Journal of Applied Linguistics
Lexical language-related episodes in pair and small group work
International Journal of English Studies
Learning opportunities and outcomes of L2-L2 and L2-HL learner interaction during a collaborative writing task
Statistical assumptions in L2 research: A systematic review
Second Language Research
The contribution of collaborative and individual tasks to the acquisition of L2 vocabulary
The Modern Language Journal
The role of tasks as vehicles for language learning in classroom interaction
The effect of interlocutor proficiency on the collaborative dialogue between Korean as a second language learners
Language Teaching Research
Using pretask modelling to encourage collaborative learning opportunities
Language Teaching Research
Using collaborative language production tasks to promote students’ language awareness
Language Awareness
Grammatical complexity: ‘What Does It Mean’ and ‘So What’ for L2 writing classrooms?
Journal of Second Language Writing
Reporting and interpreting quantitative research findings: What gets reported and recommendations for the field
Language Learning
Syntactic complexity across registers: Investigating (in)formality in student writing
Journal of English for Academic Purposes
Computer-mediated collaborative writing in L2 contexts: An analysis of empirical research
Computer Assisted Language Learning
Cited by (44)
Expanding the collaborative writing research framework: A longitudinal analysis of how collaborative and independent writers orient to writing spaces
2024, Journal of Second Language WritingCollaborative writing in L2 classrooms: A research agenda
2023, Language TeachingUsing blended learning to teach writing to Saudi EFL university students during the COVID-19 pandemic
2024, International Journal of English Language and Literature Studies
Meixiu Zhang (Ph.D., Northern Arizona University) is Assistant Professor of Applied Linguistics at Texas Tech University, where she teaches courses in applied linguistics. Her primary research areas include L2 writing, corpus linguistics, L2 pedagogy, and second language acquisition. Her work has appeared in journals such as Journal of Second Language Writing, Language Teaching Research, System, and Corpora.
Luke Plonsky is Associate Professor of Applied Linguistics at Northern Arizona University, where he teaches courses in second language acquisition and research methods. His work in these and other areas has resulted in over 80 articles, book chapters, and books. Luke is Associate Editor of Studies in Second Language Acquisition, Managing Editor of Foreign Language Annals, Co-Editor of de Gruyter Mouton's Series on Language Acquisition, and Co-Director of the IRIS Database (iris-database.org). In addition to prior appointments at Georgetown and University College London, Luke has taught in Japan, The Netherlands, Poland, Spain, and Puerto Rico.
- 1
Note: Full references of studies included in the sample are available on the IRIS Database (iris-database.org).