EBSCOHost
Request Free Trial
Subscribers

African Journal of Marine Science

Southern African Linguistics and Applied Language Studies 2003, 21 (4) : 189–201
© NISC (pty) Ltd. www.nisc.co.za

Spoken language corpora for the nine official African languages of South Africa

<< Previous Abstract :: Next Abstract >>

Authors: J Allwood1 and AP Hendrikse2

1 Department of Linguistics, University of Gothenburg, Renströmparken, S-312 98 Gothenburg, Sweden
2 Department of Linguistics, University of South Africa, PO Box 392, Pretoria 0003, South Africa

Corresponding Author: AP Hendrikse (E-mail: hendrap@unisa.ac.za)


Abstract:
In this paper we give an outline of a corpus planning project which aims to develop linguistic resources for the nine official African languages of South Africa in the form of corpora, more specifically spoken language corpora. In the course of the article, we will address issues such as spoken language vs. written language, register vs. activity and normative vs. non-normative approaches to corpus planning. We then give an outline of the design of a spoken language corpus for the nine official African languages of South Africa. We consider issues such as representativity and sampling (urban–rural, dialects, gender, social class and activities), transcription standards and conventions as well as the problems emanating from widespread loans and code switching and other forms of language mix characteristic of spoken language. Finally, we summarise the status of the project at present and plans for the future.

Link to fulltext on Ingenta: Full Text on Ingenta

Newsletter
View available jobs at NISC
Jobs
Login to download your reprints
Authors
Print this page
Print
E-mail this page to someone
E-mail
NISC Databases
NISC Journals
NISC Books