Rethinking interviews as representations of spoken language in learner corpora

Keywords: learner corpus research, spoken language, task, interview, representativeness

Abstract

Following the call to examine the role of learner corpora in SLA research (Bell and Payant 2021), this paper discusses spoken learner corpora ––specifically those collected through interviews–– and considers the aspects of spoken learner language that they represent. The interview is both an elicitation technique and a complex genre. The overlapping of the two conceptualisations under the same term may give rise to problems of definition about the nature of the language collected and, as a consequence, to difficulties in interpretation when assessing the characteristics of spoken learner data. In this paper, we use original research to exemplify some of the areas that need some rethinking in terms of future reconceptualisation about how spoken data are collected and analysed. This research shows the potential impact of the degree of interviewer/interviewee engagement with the task, suggesting that not enough attention has been paid to the genre of interview in learner corpus research.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References

Aguado-Jiménez, Pilar, Pascual Pérez-Paredes and Purificación Sánchez. 2012. Exploring the use of multidimensional analysis of learner language to promote register awareness. System 40/1: 90–103.

Aijmer, Karin. 2018. Intensification with very, really and so in selected varieties of English. In Sebastian Hoffmann, Andrea Sand, Sabine Arndt-Lappe and Lisa Marie Dillmann eds. Corpora and Lexis. Leiden. Rodopi, 106–139.

Allwood, Jens. 2000. An activity-based approach to pragmatics. In Harry Bunt and William Black eds. Abduction, Belief and Context in Dialogue: Studies in Computational Pragmatics. Amsterdam: John Benjamins, 47–80.

Anderwald, Lieselotte and Susanne Wagner. 2007. The Freiburg English Dialect Corpus: Applying corpus-linguistic research tools to the analysis of dialect data. In John C. Beal, Karen P. Corrigan and Hermann L. Moisl eds. Creating and Digitizing Language Corpora Volume 1: Synchronic Databases. London: Palgrave Macmillan, 35–53.

Beeching, Kate. 2016. Pragmatic Markers in British English: Meaning in Social Interaction. Cambridge: Cambridge University Press.

Bell, Philippa, Laura Collins and Emma Marsden. 2021. Building an oral and written learner corpus of a school programme: Methodological issues. In Bert Le Bryn and Magali Paquot eds. Learner Corpus Research Meets Second Language Acquisition. Cambridge: Cambridge University Press, 214–242.

Bell, Philippa and Caroline Payant. 2021. Designing learner corpora: Collection, transcription, and annotation. In Nicole Tracy-Ventura and Magali Paquot eds, 53–67.

Biber, Douglas. 1995. Dimensions of Register Variation: A Cross-Linguistic Comparison. Cambridge: Cambridge University Press.

Biber, Douglas, Susan Conrad, Randi Reppen, Pat Byrd, Marie Helt, Victoria Clark, Viviana Cortes, Eniko Csomay and Alfredo Urzua. 2004. Representing Language Use in the University: Analysis of the TOEFFL 2000 Spoken and Written Academic Language Corpus. Princeton: Educational Testing Service.

Biber, Douglas, Stig Johansson, Geoffrey Leech, Susan Conrad and Edward Finegan. 1999. Longman Grammar of Spoken and Written English. Harlow: Longman.

Biber, Douglas, Stig Johansson, Geoffrey Leech, Susan Conrad and Edward Finegan. 2021. Grammar of Spoken and Written English. Amsterdam: John Benjamins.

Bunt, Harry. 2022. The multifunctionality of utterances in interactive discourse. In Zihan Yin and Elaine Vine eds. Multifunctionality in English: Corpora, Language and Academic Literacy Pedagogy. London: Routledge, 11–29.

Carter, Ronald and Michael McCarthy. 2006. Cambridge Grammar of English: A Comprehensive Guide. Cambridge: Cambridge University Press.

Carter, Ronald and Michael McCarthy. 2017. Spoken grammar: Where are we and where are we going? Applied Linguistics 38/1: 1–20.

Castello, Erik. 2023. Stance adverbials in spoken English interactions: Insights from corpora of L1 and L2 elicited conversations. Contrastive Pragmatics 4/2: 243–273

Cohen, Louis, Lawrence Manion and Keith Morrison. 2017. Research Methods in Education. New York: Routledge.

Crawford, William J. 2022. Corpora and speaking skills. In Reka R. Jablonkai and Eniko Csomay eds. The Routledge Handbook of Corpora and English Language Teaching and Learning. New York: Routledge, 89–101.

Curry, Niall, Robbie Love and Olivia Goodman. 2022. Adverbs on the move: Investigating publisher application of corpus research on recent language change to ELT coursebook development. Corpora 17/1: 1–38.

Curry, Niall and Geraldine Mark. 2023. Using corpus linguistics in materials development and teacher education. Second Language Teacher Education 22: 187–208.

De Cock, Sylvie. 2004. Preferred sequences of words in NS and NNS speech. Belgian Journal of English Language and Literatures 2: 225–246.

Friginal, Erik, Joseph J. Lee, Brittany Polat and Audrey Roberson. 2017. Exploring Spoken English Learner Language Using Corpora: Learner Talk. London: Springer.

Friginal, Eric and Brittany Polat. 2015. Linguistic dimensions of learner speech in English interviews. Corpus Linguistics Research 1: 53–82.

Fung, Loretta and Ronald Carter. 2007. Discourse markers and spoken English: Native and learner use in pedagogic settings. Applied Linguistics 28/3: 410–439.

Gablasova, Dana, Vaclav Brezina and Tony McEnery. 2019. The Trinity Lancaster Corpus: development, description and application. International Journal of Learner Corpus Research 5/2: 126–158.

Gilquin, Gaëtanelle. 2021. Learner corpora. In Magali Paquot and Stefan Th. Gries eds, 283–303.

Gilquin, Gaëtanelle, Sylvie De Cock and Sylviane Granger. 2010. The Louvain International Database of Spoken English Interlanguage. Handbook and CD-ROM. Louvain-La-Neuve: Presses universitaires de Louvain.

Gráf, Tomáš. 2017. The Story of the Learner Corpus LINDSEI CZ. Karlova: Univerzita Karlova, Filozofická fakulta. https://dspace.cuni.cz/bitstream/handle/20.500.11956/97524/1541592_tomas_graf_22-35.pdf?sequence=1&isAllowed=y

Gut, Ulrike. 2012. The LeaP corpus: A multilingual corpus of spoken learner German and learner English. In Thomas Schmidt and Kai Wörmer eds. Multilingual Corpora and Multilingual Corpus Analysis. Amsterdam: John Benjamins, 3–24.

Hanks, Elizabeth, Tony McEnery, Jesse Egbert, Tove Larsson, Douglas Biber, Randi Reppen, Paul Baker, Vaclav Brezina, Gavin Brookes, Isobelle Clarke and Raffaella Bottini. 2024. Building LANA-CASE, a spoken corpus of American English conversation: Challenges and innovations in corpus compilation. Research in Corpus Linguistics 12/2: 24–44.

Hasselgreen, Angela. 2004. Testing the Spoken English of Young Norwegians: A study of Test Validity and the Role of ‘Smallwords’ in Contributing to Pupils’ Fluency. Cambridge: Cambridge University Press.

Ishikawa, Shin’ichi. 2019. The ICNALE spoken dialogue: A new dataset for the study of Asian learners’ performance in L2 English interviews. English Teaching 74/4: 153–177.

Jones, Christian. 2022. What are the basics of analysing a corpus? In Anne O’Keeffe and Michael McCarthy eds, 126–139.

Knight, Dawn and Svenja Adolphs. 2022. Building a spoken corpus? In Anne O’Keeffe and Michael McCarthy eds, 21–34.

Knight, Dawn, Fernando Loizides, Steven Neale, Laurence Anthony and Irena Spasić. 2021. Developing computational infrastructure for the CorCenCC corpus: The national corpus of contemporary Welsh. Language Resources and Evaluation 55/1: 789–816.

Koester, Almut. 2022. Building small specialised corpora. In Anne O’Keeffe and Michael McCarthy eds, 48–61.

Larsson, Tove, Tony Berber Sardinha, Bettany Gray and Douglas Biber. 2023. Exploring early L2 writing development through the lens of grammatical complexity. Applied Corpus Linguistics 3/3: 100077. https://doi.org/10.1016/j.acorp.2023.100077

Lee, David. 2002. Genres, registers, text types, domains and styles: Clarifying the concepts and navigating a path through the BNC jungle. In Bernhard Kettemann and Georg Marko eds. Teaching and Learning by Doing Corpus Analysis: Proceedings of the Fourth International Conference on Teaching and Language Corpora. Leiden: Rodopi, 245–292.

Love, Robbie. 2020. Overcoming Challenges in Corpus Construction: The Spoken British National Corpus 2014. London: Routledge.

Mann, Steve. 2011. A critical review of qualitative interviews in applied linguistics. Applied Linguistics 32/1: 6–24.

McCarthy, Michael. 2010. Spoken fluency revisited. English Profile Journal 1. https://doi.org/10.1017/S2041536210000012.

McCarthy, Michael. 2020. Innovations and Challenges in Grammar. London: Routledge.

McCarthy, Michael and Ronald Carter. 1994. Language as Discourse: Perspectives for Language Teaching. Routledge: London.

McEnery, Tony, Robbie Love and Vaclav Brezina. 2017. Compiling and analysing the Spoken British National Corpus 2014. International Journal of Corpus Linguistics 22/3: 311–318.

McKinley, Jim and Heath Rose eds. 2017. Doing Research in Applied Linguistics: Realities, Dilemmas and Solutions. London: Routledge.

Mukherjee, Joybrato. 2009. The grammar of conversation in advanced spoken learner English. In Karin Aijmer ed. Corpora and Language Teaching. Amsterdam: John Benjamins, 203–230.

O’Keeffe, Anne and Svenja Adolphs. 2008. Response tokens in British and Irish discourse. In Klaus P. Schneider and Anne Barron eds. Variational Pragmatics: A Focus on Regional Varieties in Pluricenctric Languages. Amsterdam: John Benjamins, 69–98.

O’Keeffe, Anne and Michael McCarthy eds. 2022. The Routledge Handbook of Corpus Linguistics. London: Routledge.

Myers, Greg. 2010. Stance-taking and public discussion in blogs. Critical Discourse Studies 7/4: 263–275.

Paquot, Magali and Stefan Th. Gries eds. 2021. A Practical Handbook of Corpus Linguistics. New York: Springer International Publishing.

Paquot, Magali and Luke Plonsky. 2017. Quantitative research methods and study quality in learner corpus research. International Journal of Learner Corpus Research 3/1: 61–94.

Pérez-Paredes, Pascual. 2019. The pedagogic advantage of teenage corpora for secondary school learners. In Peter Crosthwaite ed. Data Driven Learning for the Next Generation: Corpora and DDL for Pre-tertiary Learners. London: Routledge, 67–87.

Pérez-Paredes, Pascual and Geraldine Mark. 2022. What can corpora tell us about language learning? In Anne O’Keeffe and Michael McCarthy eds, 312–327.

Pérez-Paredes, Pascual and María Sánchez-Torne. 2019. The linguistic dimension of L2 interviews: A multidimensional analysis of native speaker language. Focus on ELT Journal 1/1: 4-26.

Stubbs, Michael. 2007. On texts, corpora and models of language. In Michael Hoey, Michaela Malhberg, Michael Stubbs and Wolfgang Teubert eds. Text, Discourse and Corpora: Theory and Analysis. London: Bloomsbury, 127–161.

Tracy-Ventura, Nicole and Florence Myles. 2015. The importance of task variability in the design of learner corpora for SLA research. International Journal of Learner Corpus Research 1/1: 58–95.

Tracy-Ventura, Nicole and Magali Paquot eds. 2021. The Routledge Handbook of Second Language Acquisition and Corpora. London: Routledge.

Tracy-Ventura, Nicole, Magali Paquot and Florence Myles. 2021. The future of corpora in SLA. In Nicole Tracy-Ventura and Magali Paquot eds, 409–424.

Tyler, Andrea and Lourdes Ortega. 2018. Usage-inspired L2 instruction: An emergent, researched pedagogy. In Andrea Tyler, Lourdes Ortega, Mariko Uno and Hae In Park eds. Usage-Inspired L2 Instruction : Researched Pedagogy. Amsterdam: John Benjamins, 3–26.

Waters, Cathleen. 2013. Transatlantic variation in English adverb placement. Language Variation and Change 25/2: 179–200.

Published
2024-06-10
How to Cite
Pérez-Paredes, P., & Mark, G. (2024). Rethinking interviews as representations of spoken language in learner corpora. Research in Corpus Linguistics, 12(2), 111–145. https://doi.org/10.32714/ricl.12.02.06