Behind the Word Clouds: Electronic Text, Machine Reading and Corpus Linguistics: Tim Shortis Argues That Corpus Linguistics Is Changing Knowledge about Language, and Explains the Theory Behind It and Its Potential for the Classroom
Shortis, Tim, English Drama Media
A revolution in knowledge about language
Hyperbole comes easily in the excited discourse around the impact of ICTs and their ongoing penultimate promises. So it is with caution that I am suggesting that there is a quiet revolution going on in what counts as knowledge about language and meaningful reading and that little of this has permeated what is done in school English lessons so far. This may be about to change as we come face to face with ever-larger collections of electronically mediated text and exemplification of new methods for reading them.
The new and ever larger collections of resources are apparent, the means of reading them less so--although recent higher education research may point a way. UK Public Library memberships now offer free home access to the full Oxford English Dictionary (www.oed.com) along with digital archives of contemporary and historical newspapers. Agencies such as the National Archive, the Old Bailey, JISC and The British Library have all put significant collections of searchable text online. Some of these are plain text, some facsimiles, and some both. For example, Oxford University/JISC collaboration's magnificent World War 1 collection (http:/ /www.oucs.ox.ac.uk/ww1lit/) offers 5,000 textual artefacts mainly in facsimile form but with searchable words-only transcripts. All these collections involve engaging with a different order of textual scale and will require different kinds of literacy to being curled up in your chair reading a book under the light of an Anglepoise, although that will of course, remain important. The question then is how are we, as English teachers, as a professional community of practice specializing in the learning of literacy, to respond to these changes in the representational resources of the written word? What is our responsibility as such archives become available to students and future citizens, including our role in equipping these people in our care to understand and resist the abuse of the associated technologies of machine--reading in its aggressive forms: data-mining for commercial and political exploitation and its infringements of privacy, for example?
The data-driven study of very large collections of electronic text, assisted by the machine reading capacities of computers, or corpus linguistics as it has become known, has transformed understanding of core domains of language study, and even of the concepts …
Questia, a part of Gale, Cengage Learning. www.questia.com
Publication information: Article title: Behind the Word Clouds: Electronic Text, Machine Reading and Corpus Linguistics: Tim Shortis Argues That Corpus Linguistics Is Changing Knowledge about Language, and Explains the Theory Behind It and Its Potential for the Classroom. Contributors: Shortis, Tim - Author. Magazine title: English Drama Media. Issue: 15 Publication date: October 2009. Page number: 25+. © 2008 National Association for the Teaching of English. COPYRIGHT 2009 Gale Group.
This material is protected by copyright and, with the exception of fair use, may not be further copied, distributed or transmitted in any form or by any means.