Loughborough University
Browse
MORTON_NESI_DHC_mpdf.pdf (334.68 kB)

Corpora, catalogues and correspondence: The item-level identification and digitisation of business letters for the British Telecom Correspondence

Download (334.68 kB)
chapter
posted on 2018-01-18, 11:23 authored by Ralph Morton, Hilary Nesi
This paper explores some of the challenges in working with archive material to produce language corpora. It takes as a case study the British Telecom Correspondence Corpus (BTCC) which contains a selection of the letters held in the BT Archives, housed in Holborn Telephone Exchange. One of the essential differences between a corpus and an archive is that a corpus is intended to be representative of a language variety. Material makes its way into historical archives in a variety of ways, and while they may preserve a breadth of material, archives are not generally collected to be representative, nor are they primarily designed to facilitate linguistic investigation. Work on the BTCC began as part of a Jisc-funded project to digitise the BT Archives and create a ‘research resource for the higher education sector’ (Hay, 2014:12). The BT Digital Archives became available to the public in July 2013. Our experiences using this resource inform the second half of the paper, in particular regarding the identification of corpus material and the difficulty in identifying letters at an item level. This leads to a wider discussion of how best to digitise physical archives.

History

School

  • Social Sciences

Department

  • Communication, Media, Social and Policy Studies

Published in

Proceedings of the Digital Humanities Congress 2014 Studies in the Digital Humanities

Pages

N/A - N/A (12)

Citation

MORTON, R. and NESI, H., 2015. Corpora, catalogues and correspondence: The item-level identification and digitisation of business letters for the British Telecom Correspondence. In: Mills, C., Pidd, M. and Williams, J. (eds.). Proceedings of the Digital Humanities Congress 2014. Studies in the Digital Humanities, Sheffield: HRI Online Publications.

Publisher

© The authors. Published by HRI Open Book

Version

  • VoR (Version of Record)

Publisher statement

This work is made available according to the conditions of the Creative Commons Attribution-NoDerivatives 4.0 International (CC BY-ND 4.0) licence. Full details of this licence are available at: http://creativecommons.org/licenses/by-nd/4.0/

Publication date

2015

Notes

This is an Open Access Article. It is published by HRI Open Book under the Creative Commons Attribution-NoDerivatives 4.0 International Licence (CC BY-ND). Full details of this licence are available at: http://creativecommons.org/licenses/by-nd/4.0/

Language

  • en