"A matter both of curioſity and uſefulneſs": Compiling the Corpus of English Texts on Language

Keywords: Coruña Corpus; corpus compilation; Late Modern English; scientific writing


This paper describes the compilation of CETeL, the subcorpus on Language and Linguistics in the Coruña Corpus of English Scientific Writing, and discusses the various challenges encountered during the process of selection and digitisation of material. CETeL includes forty-four samples of texts on Language, languages, and Linguistics from the period 1700-1900, and on completion will contain c.400,000 words.

The paper will examine the historical context of academic writing on Language in the period and the way in which this context affects the process of compilation. Likewise, the compilation criteria used for the whole of the Coruña Corpus will be discussed in order to show the extent to which these criteria have themselves affected the compilation of CETeL, and how they contribute towards making the corpus representative of the disciplinary practices of the period.

Finally, the corpus will also be described according to a series of parameters used to assure representativeness and balance, namely the date of publication of samples, their genre, and the sex and geographical origin of their authors.


Allen, Bryce, Jian Qin and Frederik Wilfrid Lancaster. 1994. Persuasive communities: A longitudinal analysis of references in the philosophical transactions of the Royal Society, 1665–1990. Social Studies of Science 24/2: 279–310.

Atkinson, Dwight. 1996. The philosophical transactions of the Royal Society of London, 1675–1975: A sociohistorical discourse analysis. Language in Society 25/3: 333–371.

Bailey, Richard W. 1985. The conquests of English. In Sidney Greenbaum ed. The English Language Today. Oxford: Pergamon Institute of English, 9–19.

Beal, Joan. 2004. English in Modern Times. London: Arnold.

Beal, Joan. 2008. Shamed by your English? The market value of a ‘good’ pronunciation. In Joan Beal, Carmela Nocera and Massimo Sturiale eds. Perspectives on Prescriptivism. Bern: Peter Lang, 21–40.

Beal, Joan. 2012. Late Modern English in its historical context. In Isabel Moskowich and Begoña Crespo eds. Astronomy ‘Playne and Simple.’ The Writing of Science between 1700 and 1900. Amsterdam: John Benjamins, 1–14.

Biber, Douglas. 1993. Representativeness in corpus design. Literary and Linguistic Computing 8: 243–257.

Biber, Douglas and Susan Conrad. 2009. Register, Genre, and Style. Cambridge: Cambridge University Press.

Boyle, Robert. 1661 (1965). Proemial essay. In Thomas Birch ed. The Works of Robert Boyle. Vol. I. Hildesheim: Georg Olms, 192–204.

Burke, Peter. 2000. Historia Social del Conocimiento: De Gutemberg a Diderot. Vol. I Barcelona: Paidos Ibérica.

Camiña, Gonzalo and Inés Lareo. 2019. Editorial policy in CHET. In Isabel Moskowich, Estafanía Sánchez-Barreiro, Inés Lareo and Paula Lojo-Sandino comps eds. Corpus of History English Texts (CHET). A Coruña: Repositorio Universidade da Coruña. https://ruc.udc.es/dspace/handle/2183/21849 (29 September, 2019)

Campbell, Lyle. 2001. The history of linguistics. In Mark Aronoff and Janie Rees-Miller eds. The Handbook of Linguistics. Oxford: Blackwell, 81–104.

Claridge, Claudia, Josef Schmied and Rainer Siemund. 1999. The Lampeter Corpus of Early Modern English tracts. In Knut Hofland, Anne Lindebjerg and Jørn Thunestvedt eds. ICAME Collection of English Language Corpora (CD-ROM). Norway: The HIT Centre, University of Bergen.

Crespo, Begoña. 2004. The scientific register in the history of English: A corpus-based study. Studia Neophilologica 76/2: 125–139.

De la Cruz Cabanillas, Isabel. 2001. Lexicografía y semántica del inglés moderno. In Isabel de la Cruz Cabanillas and Francisco Javier Martín Arista eds. Lingüística Histórica Inglesa. Barcelona: Ariel, 699–727.

Di Cesare, Donatella. 1990. The philosophical and anthropological place of Wilhelm von Humboldt’s linguistic typology: Linguistic comparison as a means to compare the different processes of human thought. In Tullio De Mauro and Lia Formigari eds. Leibniz, Humboldt, and the Origins of Comparativism. Amsterdam: John Benjamins, 157–179.

Gotti, Maurizio. 1996. Robert Boyle and the Language of Science. Milano: Guerini Scientifica.

Gotti, Maurizio. 2001. The experimental essay in Early Modern English. European Journal of English Studies 5/2: 221–239.

Gotti, Maurizio. 2003. Specialized Discourse: Linguistic Features and Changing Conventions. Bern: Peter Lang.

Gotti, Maurizio. 2005. Investigating Specialized Discourse. Bern: Peter Lang.

Gray, Bethany. 2011. Exploring Academic Writing through Corpus Linguistics: When Discipline Tells only Part of the Story. Flagstaff, AZ: Northern Arizona University (Unpublished PhD dissertation).

Hickey, Raymond. 2010. Attitudes and concerns in eighteenth-century English. In Raymond Hickey ed. Eighteenth-Century English. Cambridge: Cambridge University Press, 1–19.

Kytö, Merja, Juhani Rudanko and Erik Smitterberg. 2000. Building a bridge between the present and the past: A corpus of 19th-century English. ICAME Journal 24: 85–97.

Millward, Celia M. and Mary Hayes. 2012. A Biography of the English Language. Boston: Wadsworth, Cengage Learning.

Moskowich, Isabel. 2012. CETA as a tool for the study of modern astronomy in English. In Isabel Moskowich and Begoña Crespo eds. Astronomy ‘Playne and Simple.’ The Writing of Science between 1700 and 1900. Amsterdam: John Benjamins, 35–56.

Moskowich, Isabel and Begoña Crespo eds. 2012. Astronomy ‘Playne and Simple.’ The Writing of Science between 1700 and 1900. Amsterdam: John Benjamins.

Moskowich, Isabel, Gonzalo Camiña-Rioboo, Inés Lareo and Begoña Crespo eds. 2016. The Conditioned and the Unconditioned: Late Modern English Texts on Philosophy. Amsterdam: John Benjamins.

Moskowich, Isabel, Begoña Crespo, Luis Puente-Castelo and Leida Maria Monaco eds. 2019. Writing History in Late Modern English: Explorations of the Coruña Corpus. Amsterdam: John Benjamins.

Robins, Robert H. 1978. The Neogrammarians and their nineteenth-century predecessors. Transactions of the Philological Society 76/1: 1–16.

Robins, Robert H. 1997. A Short History of Linguistics. London: Routledge.

Schmidt, Siegfried. 1975. German philosophy of language in the late 19th century. In Herman Parret ed. History of Linguistic Thought and Contemporary Linguistics. Berlin: de Gruyter, 658–684.

Taavitsainen, Irma and Päivi Pahta. 1998. Vernacularisation of medical writing in English: A corpus-based study of scholasticism. Early Science and Medicine 3/2: 157–185.

How to Cite
Monaco, L. M., & Puente-Castelo, L. (2019). "A matter both of curioſity and uſefulneſs": Compiling the Corpus of English Texts on Language. Research in Corpus Linguistics, 7, 47-68. https://doi.org/10.32714/ricl.07.03