Abstract The EtsaTrans machine translation system has been in development at the University of the Free State for the last four years and is currently the only machine translation system being developed in South Africa for specialised and non-general...
Abstract Automatic lemmatisation is a general normalisation procedure in text processing, where all inflected forms of a lexical word are normalised to a single lemma (i.e. a meaningful, uninflected base form from which more complex word forms could...
Wangerin, Walter. 2006. Jesus: 'n roman. Uit Engels vertaal deur George Weideman. Wellington: Lux Verbi.BM 352 p. Prys: R169,95. ISBN: 978 07963 0413 1. In Jesus: 'n roman word die verhaal van Jesus, die seun van God, soos dit in die eerste vier...
This word list is under constant development. Please send any comments, suggestions or additions to Gerhard van Huyssteen. / Hierdle woordelys word voortdurend verdet ontwikkel. Stuur asseblief enige kommentaar, voorstelle of toevoegings aan Gerhard...
Abstract Resources play a crucial role in human language technology (HLT), since research and development are largely dependent on its availability. It follows that effective management of these resources is essential to this domain and there are...
Abstract This article investigates one kind of cross-part-of-speech relation for English, Czech and Zulu lexical resources in the form of semantic networks (wordnets). Many languages have rules whereby new words are derived regularly and productively...
Abstract The Afrikaans wordnet is a lexical-conceptual network in the form of an electronic lexical database, developed at the North-West University. In this article, a methodology for a semi-automatic construction of the entries--so-called synonym...
Willemse, Hein. 2007. Aan die ander kant: swart Afrikaanse skrywers in die Afrikaanse letterkunde. Pretoria: Protea Boekhuis. (Talata-reeks 2.) 245 p. Prys: R150,00. ISBN: 978-1-86919-168-9. In die "Erkennings" wat die teks voorafgaan, meld Willemse...
Weideman, George. 2007. Bloujare se stories. Pretoria: LAPA Uitgewers. 76 p. Prys: R85,00. ISBN: 978-0-7993-3922-2. Pa siedend agter die stuurwiel, vier van ons deurgeskud en dors op die agtersitplek van die ou VW, 'n deurgestampte skokbreker en...
Vir Tertia Visser-Downie How the cry of a bird can stir us ... (...) Into interstices of this world-space, (into which the unbroken bird-cry passes, as people do into dreams -) they drive their wedges, wedges of shrieking. (...)...
Kannemeyer, J.C. 2007. "Uit puur verstrooiing": lesings, praatjies, artikels, onderhoude en besprekings. Stellenbosch: Inset Uitgewers. 459 p. ISBN: 978-0-620-388-52-8. Oor Simon Vestdijk se buitengewoon omvangryke oeuvre skryf A. Roland Holst by...
Aucamp, Hennie. 2007. 'n Vreemdeling op deurtog. Pretoria: Protea Boekhuis. 136 p. Prys: R120,00. ISBN: 978-1-86919-202-0. 'n Vreemdeling op deurtog, Hennie Aucamp se nuutste verhaalbundel is, in sy eie terme, 'n protobiografiese bundel waarin verhale,...
Walters, M.M. 2004. Satan ter sprake. Pretoria: Protea Boekhuis. 83 p. Prys: R79,95. ISBN: 1-86919-051-3. Dit klink effens vreemd om Satan ter sprake te bring: stel jy hom "aan die woord" of "bespreek" jy hom? Altwee hierdie betekenisse is ter sake...
Abstract Working with corpora in the South African Bantu languages has up till now been limited to the utilisation of raw corpora. Such corpora, however, have limited functionality. Thus the next Iogical step in any NLP application is the development...
Abstract Two methods for objectively measuring similarities and dissimilarities between the eleven official languages of South Africa are described. The first concerns the use of n-grams. The confusions between different languages in a text-based...
This special issue of Literator is dedicated to human language technologies (HLT) for South African languages and contains eleven reworked conference papers that were read during special theme sessions at two conferences in South Africa during 2007....
Abstract The development of two cote-technologies for Afrikaans, viz, a hyphenator and a compound analyser is described in this article. As no annotated Afrikaans data existed prior to this project to serve as training data for a machine learning...
Abstract In this article the development of an inflected form generator for Afrikaans is described. Two requirements are set for this inflected form generator, viz. to generate only one specific inflected form of a lemma and to generate all possible...
Abstract The development of a computational morphological analyser for Setswana necessitates the accurate modelling and implementation of, among others, compounding as a word formation process. Compounding is known to be an area of Setswana morphology...
Abstract Lexical transducers utilise a two-level finite-state network to simultaneously code morphological analysis and morphological generation rewrite rules. Multiple extensions following the verb root can be morphologically analysed as a closed...