Comparing English Worldwide: The International Corpus of English

By Sidney Greenbaum | Go to book overview

11
The Survey Parser: Design and Development

ALEX CHENGYU FANG


1. INTRODUCTION

Automatic parsing aims at the decomposition of a sentence into its syntactic constituent structures, so that the relations between words and groups of words are clarified. It is a process deemed essential as a help in understanding the sentential meaning. It is also the first step towards a reversed process whereby a natural language sentence can be automatically constructed according to the specification of abstract semantic meanings. The best example to demonstrate the application of parsing is multi-lingual machine translation. A sentence in Language A is first of all parsed to help to arrive at its semantics, which are then formalized and finally represented as a corresponding sentence in Language B. Success in parsing will represent a major breakthrough in natural language processing. Nearly all the major universities around the world host research teams working on different approaches to parsing. Britain alone boasts research teams in this area at Cambridge, Edinburgh, Leeds, Nottingham, Sheffield, Sussex, and York.

Despite efforts in the past 50 years or so, however, 'the state of the art in parsing general English by computer is but primitive' ( Blacket al., 1993: 2). In 1990-2, three experiments were carried out on eleven rule-based parsers, which subsequently produced a success rate of only 33 per cent on naturally occurring sentences (cf. Blacket al., 1993). The increasingly popular stochastic approach ( Fujisaki, 1984; Garside and Leech, 1985; Atwell, 1988; Briscoe and Carroll, 1991; Fujisakiet al., 1991; Magerman, 1994), despite its advantage over the rule-based approach, suffers from incorrect analyses, especially in the attachment of constituent structures (cf. Briscoe and Carroll, 1991). SPATTER, a probabilistic parser, achieved a 78 per cent crossing-brackets score, 1 and yet only about 35 per cent of the parses exactly matched the human annotations for those sentences ( Magerman, 1994: v). Some systems try to remedy these parsing problems through man-machine interactions, but this usually proves too costly. The TOSCA Parser developed at the University of Nijmegen, Holland, for instance, requires considerable manual pre-editing of the input text in order to reduce parsing times and ambiguities.

-142-

Notes for this page

Add a new note
If you are trying to select text to create highlights or citations, remember that you must now click or tap on the first word, and then click or tap on the last word.
One moment ...
Default project is now your active project.
Project items

Items saved from this book

This book has been saved
Highlights (0)
Some of your highlights are legacy items.

Highlights saved before July 30, 2012 will not be displayed on their respective source pages.

You can easily re-create the highlights by opening the book page or article, selecting the text, and clicking “Highlight.”

Citations (0)
Some of your citations are legacy items.

Any citation created before July 30, 2012 will labeled as a “Cited page.” New citations will be saved as cited passages, pages or articles.

We also added the ability to view new citations from your projects or the book or article where you created them.

Notes (0)
Bookmarks (0)

You have no saved items from this book

Project items include:
  • Saved book/article
  • Highlights
  • Quotes/citations
  • Notes
  • Bookmarks
Notes
Cite this page

Cited page

Style
Citations are available only to our active members.
Sign up now to cite pages or passages in MLA, APA and Chicago citation styles.

(Einhorn, 1992, p. 25)

(Einhorn 25)

1

1. Lois J. Einhorn, Abraham Lincoln, the Orator: Penetrating the Lincoln Legend (Westport, CT: Greenwood Press, 1992), 25, http://www.questia.com/read/27419298.

Cited page

Bookmark this page
Comparing English Worldwide: The International Corpus of English
Table of contents

Table of contents

  • Title Page iii
  • Preface vii
  • Contents ix
  • List of Contributors xi
  • List of Figures xiii
  • List of Tables xv
  • Abbreviations xvi
  • Part I Introduction 1
  • 1: Introducing ICe 3
  • References 12
  • 2: Learner English Around the World 13
  • References 23
  • Part II Compilation and Annotation 25
  • 3: The Design of the Corpus 27
  • References 35
  • 4: Markup Systems 36
  • Notes 45
  • References 45
  • 5: The Umb Intelligent ICe Markup Assistant 54
  • References 64
  • 6: ICe Annotation Tools 65
  • 7: Developing the ICe Corpus Utility Program 79
  • 8: About the ICe Tagset 92
  • 9: Autasys: Grammatical Tagging and Cross-Tagset Mapping 110
  • 10: An Outline of the Survey's ICe Parsing Scheme 125
  • Reference 139
  • 11: The Survey Parser: Design and Development 142
  • References 157
  • Part III Problems of Implementation 161
  • 12: The New Zealand Spoken Component of ICe: Some Methodological Challenges1 163
  • References 177
  • 13: Second-Language Corpora1 182
  • References 195
  • 14: The International Corpus of English in Hong Kong 197
  • References 213
  • Part IV Applications 215
  • 15: The Corpus as A Research Domain 217
  • 16: ICe and Teaching 227
  • 17: The Sociolinguistics of English in Nigeria and the ICe Project 239
  • 18: Why A Fiji Corpus? 249
  • References 260
  • 19: Prosice: A Spoken English Database for Prosody Research 262
  • References 278
  • Index 281
Settings

Settings

Typeface
Text size Smaller Larger Reset View mode
Search within

Search within this book

Look up

Look up a word

  • Dictionary
  • Thesaurus
Please submit a word or phrase above.
Print this page

Print this page

Why can't I print more than one page at a time?

Full screen
/ 290

matching results for page

Cited passage

Style
Citations are available only to our active members.
Sign up now to cite pages or passages in MLA, APA and Chicago citation styles.

"Portraying himself as an honest, ordinary person helped Lincoln identify with his audiences." (Einhorn, 1992, p. 25).

"Portraying himself as an honest, ordinary person helped Lincoln identify with his audiences." (Einhorn 25)

"Portraying himself as an honest, ordinary person helped Lincoln identify with his audiences."1

1. Lois J. Einhorn, Abraham Lincoln, the Orator: Penetrating the Lincoln Legend (Westport, CT: Greenwood Press, 1992), 25, http://www.questia.com/read/27419298.

Cited passage

Welcome to the new Questia Reader

The Questia Reader has been updated to provide you with an even better online reading experience.  It is now 100% Responsive, which means you can read our books and articles on any sized device you wish.  All of your favorite tools like notes, highlights, and citations are still here, but the way you select text has been updated to be easier to use, especially on touchscreen devices.  Here's how:

1. Click or tap the first word you want to select.
2. Click or tap the last word you want to select.

OK, got it!

Thanks for trying Questia!

Please continue trying out our research tools, but please note, full functionality is available only to our active members.

Your work will be lost once you leave this Web page.

For full access in an ad-free environment, sign up now for a FREE, 1-day trial.

Already a member? Log in now.