Cited page

Citations are available only to our active members. Sign up now to cite pages or passages in MLA, APA and Chicago citation styles.

X X

Cited page

Display options
Reset

Behind the Word Clouds: Electronic Text, Machine Reading and Corpus Linguistics: Tim Shortis Argues That Corpus Linguistics Is Changing Knowledge about Language, and Explains the Theory Behind It and Its Potential for the Classroom

By: Shortis, Tim | English Drama Media, October 2009 | Article details

Look up
Saved work (0)

matching results for page

Why can't I print more than one page at a time?
While we understand printed pages are helpful to our users, this limitation is necessary to help protect our publishers' copyrighted material and prevent its unlawful distribution. We are sorry for any inconvenience.

Behind the Word Clouds: Electronic Text, Machine Reading and Corpus Linguistics: Tim Shortis Argues That Corpus Linguistics Is Changing Knowledge about Language, and Explains the Theory Behind It and Its Potential for the Classroom


Shortis, Tim, English Drama Media


A revolution in knowledge about language

Hyperbole comes easily in the excited discourse around the impact of ICTs and their ongoing penultimate promises. So it is with caution that I am suggesting that there is a quiet revolution going on in what counts as knowledge about language and meaningful reading and that little of this has permeated what is done in school English lessons so far. This may be about to change as we come face to face with ever-larger collections of electronically mediated text and exemplification of new methods for reading them.

The new and ever larger collections of resources are apparent, the means of reading them less so--although recent higher education research may point a way. UK Public Library memberships now offer free home access to the full Oxford English Dictionary (www.oed.com) along with digital archives of contemporary and historical newspapers. Agencies such as the National Archive, the Old Bailey, JISC and The British Library have all put significant collections of searchable text online. Some of these are plain text, some facsimiles, and some both. For example, Oxford University/JISC collaboration's magnificent World War 1 collection (http:/ /www.oucs.ox.ac.uk/ww1lit/) offers 5,000 textual artefacts mainly in facsimile form but with searchable words-only transcripts. All these collections involve engaging with a different order of textual scale and will require different kinds of literacy to being curled up in your chair reading a book under the light of an Anglepoise, although that will of course, remain important. The question then is how are we, as English teachers, as a professional community of practice specializing in the learning of literacy, to respond to these changes in the representational resources of the written word? What is our responsibility as such archives become available to students and future citizens, including our role in equipping these people in our care to understand and resist the abuse of the associated technologies of machine--reading in its aggressive forms: data-mining for commercial and political exploitation and its infringements of privacy, for example?

The data-driven study of very large collections of electronic text, assisted by the machine reading capacities of computers, or corpus linguistics as it has become known, has transformed understanding of core domains of language study, and even of the concepts …

The rest of this article is only available to active members of Questia

Sign up now for a free, 1-day trial and receive full access to:

  • Questia's entire collection
  • Automatic bibliography creation
  • More helpful research tools like notes, citations, and highlights
  • Ad-free environment

Already a member? Log in now.

Select text to:

Select text to:

  • Highlight
  • Cite a passage
  • Look up a word
Learn more Close
Loading One moment ...
Highlight
Select color
Change color
Delete highlight
Cite this passage
Cite this highlight
View citation

Are you sure you want to delete this highlight?