Designing and building SCoPE²: A spoken corpus of Brazilian Portuguese and L2-English

Keywords: bilingual corpus, spoken corpus, L2 corpus, corpus design, corpus construction

Abstract

This paper presents the process of designing and building a bilingual spoken corpus in order to pragmatically analyse oral L2-English discourse produced by a group of Brazilian university students living in Ireland. It discusses some of the decisions made, challenges faced, and considerations taken while designing a do-it-yourself corpus with a theoretical framework grounded in Corpus Pragmatics. The main objective is to share the lessons learned by examining the steps of designing and building SCoPE², a bilingual spoken corpus, including the selection of participants, gathering data, and challenges in transcribing and coding spoken language with pragmatics in mind.

Downloads

Download data is not yet available.

References

Adolphs, Svenja and Knight, Dawn. 2010. Building a spoken corpus: what are the basics?. In Anne O’Keeffe and Michael McCarthy eds. The Routledge Handbook of Corpus Linguistics. Abingdon: Routledge, 38-52.

Aijmer, Karin and Christoph Rühlemann eds. 2015. Corpus Pragmatics: A Handbook. Cambridge: Cambridge University Press.
Cutting, Joan. 2002. Pragmatics and Discourse: A Resource Book for Students. London: Routledge.

Farr, Fiona, Brona Murphy and Anne O’Keeffe. 2004. The Limerick corpus of Irish English: design, description and application. Teanga 21: 5-29. https://ulir.ul.ie/bitstream/handle/10344/4712/Farr_2004_Limerick.pdf?sequence=2 (15/05/2019).

Granger, Sylviane. 2002. A bird’s-eye view of learner corpus research. In Granger, Sylviane, Joseph Hung and Stephanie Petch-Tyson eds. Computer Learner Corpora, Second Language Acquisition and Foreign Language Teaching. Amsterdam: John Benjamins, 3-33.

Kirk, John M and Gisle Andersen. 2016. Compilation, transcription, markup and annotation of spoken corpora. International Journal of Corpus Linguistics 21/3: 291-298.

McCarthy, Michael. 1998. Spoken Language and Applied Linguistics. Cambridge: Cambridge University Press.

McCarthy, Michael and Anne O’Keeffe. 2010. Historical perspective: what are corpora and how have they evolved?. In Anne O’Keeffe and Michael McCarthy eds. The Routledge Handbook of Corpus Linguistics. Abingdon: Routledge, 3-13.

Mello, Heliana, Tommaso Raso, Maryualê M. Mittmann, Heloísa P. Vale and Priscila O. Côrtes. 2012. Transcrição e segmentação prosódica do corpus C-ORA-BRASIL: critérios de implementação e validação. In Raso, Tommaso and Heliana Mello eds. C-ORAL-BRASIL I: Corpus de Referência do Português Falado Informal. Belo Horizonte: Editora UFMG, 125-176.

O’Keeffe, Anne, Michael McCarthy and Ronald Carter. 2007. From Corpus to Classroom: Language Use and Language Teaching. Cambridge: Cambridge University Press.

Prodromou, Luke. 2003. In search of the Successful User of English: how a corpus of non-native language could impact on EFL teaching. Modern English Teacher 12/2: 5-14.

Prodromou, Luke. 2008. English as a Lingua Franca: A Corpus-based Analysis. London: Continuum.

Romero-Trillo, Jesus ed. 2008. Pragmatics and Corpus Linguistics: A Mutualistic Entente. Berlin/New York: Mouton de Gruyter.

Rühlemann Christoph and Brian Clancy. 2018. Corpus linguistics and pragmatics. In Ilie, Cornelia and Neal R. Norrick eds. 2018. Pragmatics and its Interfaces. Amsterdam/Philadelphia: John Benjamins.
Published
2020-05-04
How to Cite
Santos, G. (2020). Designing and building SCoPE²: A spoken corpus of Brazilian Portuguese and L2-English. Research in Corpus Linguistics, 8(1), 49-64. https://doi.org/10.32714/ricl.08.01.04