Design and construction of a social media corpus: Influencers’ speech in vlogs
Abstract
This article outlines the creation of a social media corpus of Turkish vlogs on YouTube, aimed at analyzing the translanguaging practices and multimodal communication of Turkish social media influencers. It firstly describes the process of constructing the corpus, including transcription conventions and ad hoc annotation. The article then analyzes the phenomenon of translanguaging, with an emphasis on its prevalent forms and modes. Given the challenges associated with compiling a multimodally rich social media corpus, this paper provides strategies for manually transcribing and annotating linguistic and semiotic features in ELAN software, as well as strategies for managing tier-based annotations for vlog datasets. Additionally, the study presents approaches for handling non-standard linguistic codes and marked occurrences in language contact zones, illustrated through examples drawn from the vlog corpus where Turkish serves as the standard code.
Downloads
Metrics
References
Arısoy, Ebru., Doğan Can, Siddika Parlak, Hasim Sak and Murat Saraçlar. 2009. Turkish broadcast news transcription and retrieval. IEEE Transactions on Audio, Speech, and Language Processing 17/5: 874–883.
Baynham, Mike and Tong King Lee. 2019. Translation and Translanguaging. New York: Routledge.
Blackledge, Adrian and Angela Creese. 2017. Translanguaging and the body. International Journal of Multilingualism 14/3: 250–268.
Blommaert, Jan. 2008. Grassroots Literacy. New York: Routledge.
Blommaert, Jan and Piia Varis. 2011. Language and superdiversity. Diversities 13/2: 3–21.
Bokhove, Christian and Christopher Downey. 2018. Automated generation of ‘good enough’ transcripts as a first step to transcription of audio-recorded data. Methodological Innovations 11/2: 1–14.
Dynel, Marta. 2014. Participation framework underlying YouTube interaction. Journal of Pragmatics 73: 37–52.
Frobenius, Maximiliane. 2011. Beginning a monologue: The opening sequence of video blogs. Journal of Pragmatics 43/3: 814–827.
Jacquemet, Marco. 2005. Transidiomatic practices: Language and power in the age of globalization. Language and Communication 25/3: 257–277.
Knight, Dawn, David Evans, Ronald Carter and Svenja Adolphs. 2009. HeadTalk, HandTalk and the corpus: Towards a framework for multi-modal, multi-media corpus development. Corpora 4/1: 1–32.
Kramsch, Claire. 2018. Trans-spatial utopias. Applied Linguistics 39/1: 108–115.
Li, Wei. 2011. Moment analysis and translanguaging space: Discursive construction of identities by multilingual Chinese youth in Britain. Journal of Pragmatics 43/5: 1222–1235.
Love, Robbie. 2020. Overcoming Challenges in Corpus Construction: The Spoken British National Corpus 2014. New York: Routledge.
Lustig, Andrew, Gavin Brookes and Daniel Hunt. 2021. Social semiotics of gangstalking evidence videos on YouTube: Multimodal discourse analysis of a novel persecutory belief system. JMIR Mental Health 8/10: e30311. https://doi.org/10.2196/30311
Mısır, Hülya and Hale Işık Güler. 2023. Translanguaging dynamics in the digital landscape: Insights from a social media corpus. Language Awareness 32/3: 1–20.
Otheguy, Ricardo, Ofelia García and Wallis Reid. 2015. Clarifying translanguaging and deconstructing named languages: A perspective from linguistics. Applied Linguistics Review 6/3: 281–307.
Schmidt, Axel and Konstanze Marx. 2019. Multimodality as challenge: YouTube data in linguistic corpora. In Janina Wildfeuer, Jana Pflaeging, John A. Bateman, Ognyan Seizov and Chiao-I Tseng eds. Multimodality: Disciplinary Thoughts and the Challenge of Diversity. Berlin: Mouton De Gruyter, 115–144.
Wittenburg, Peter, Hennie Brugman, Albert Russel, Alex Klassmann and Han Sloetjes. 2006. ELAN: A professional framework for multimodality research. In Nicoletta Calzolari, Khalid Choukri, Aldo Gangemi, Bente Maegaard, Joseph Mariani, Jan Odijk and Daniel Tapias eds. Proceedings of LREC 2006, Fifth International Conference on Language Resources and Evaluation, 1556–1559.
Zhu, Hua and Wei Li. 2020. Translanguaging, identity, and migration. In Jane Jackson ed. The Routledge Handbook of Language and Intercultural Communication. New York: Routledge, 234–248.
Copyright (c) 2024 Research in Corpus Linguistics
This work is licensed under a Creative Commons Attribution 4.0 International License.