124x Filetype PDF File size 0.43 MB Source: web.stanford.edu
THE REPRESENTATION OF SETSWANA DOUBLE OBJECTS IN LFG Ansu Berg Rigardt Pretorius Laurette Pretorius North-West University North-West University University of South Africa Proceedings of the LFG13 Conference Miriam Butt and Tracy Holloway King (Editors) 2013 CSLI Publications http://csli-publications.stanford.edu Abstract Setswana is a Bantu language in the south eastern zone of Bantu languages and is one of the eleven official languages of South Africa. The technological development of Setswana includes the development of a parser that covers all the salient characteristics of Setswana morphology and syntax. One such characteristic is the occurrence of two objects, both of which may be represented in the verb by object agreement morphemes. After discussing relevant typological features of Setswana, we focus on the syntactic structure of Setswana sentences with double objects and double object agreement morphemes and on how the implementation of two Setswana objects can be modelled in XLE. 1. Introduction Setswana is a Bantu language in the south eastern zone (zone S in Doke‟s classification) of Bantu languages (Cole, 1959; Guthrie, 1971) and one of the eleven official languages of South Africa. The work reported on in this article forms part of a larger project aimed at the technological development of Setswana. Previous work includes the development of a finite state morphological analyser and tokeniser (see, for example, Pretorius et al., 2010). The present work also forms part of a subproject for developing a computational grammar and parser for Setswana, making use of LFG and XLE (Berg et al., 2012). Under consideration are simple declarative sentences that are in the indicative mood, present tense, positive and have more than one object. More specifically, we ask the question: „How may such sentences and their syntactic structure be modelled in LFG and implemented in XLE?‟ The structure of the article is as follows: Section 1 briefly contextualises and states the research question. Section 2 discusses specific typological features of Setswana that are relevant for addressing the research question. In section 3 we discuss in some detail the occurrence of double object and object agreement morphemes and their modelling with LFG, while the XLE implementation is touched upon in section 4. Section 5 concludes the article. 2. Setswana typological features Setswana is an agglutinative language with a rich system of verbal inflections (Nurse, 2008: 28). Words in sentences are arranged in an SVO order. Nouns in Setswana are classified into 20 noun classes and agreement is prominent in the language. 2.1 Orthography and morphology Verbal prefixes are written disjointly, while verbal suffixes are written conjoined to the verbal root. This disjunctive writing style has significant consequences for tokenisation in that Setswana verbs cannot be tokenised on white space only. Due to the disjunctive orthography the word as unit of morphological description needs further clarification. We follow Kosch (2006), who distinguishes between an orthographic word (a unit which is separated by spaces from other units in the sentence) and a linguistic word (a unit which functions as a member of a word category, such as a noun, pronoun, verb and adverb). As is characteristic of agglutinative languages, Setswana verbal prefixes and suffixes provide essential information regarding type, mood, tense, aspect, and polarity (Cole, 1955:242-267; Krüger, 2006:198-243). Prefixes include negative morphemes, subject agreement morphemes, object agreement morphemes, aspectual morphemes and the temporal morpheme. The most frequently used suffixes include the causative, applicative, reciprocal, perfect and passive. Verbs can also take less frequently used suffixes while they always take a verbal ending (Cole, 1955:192-211; Krüger, 2006:257). Example (1) illustrates both the disjunctive orthography and the agglutinative morphology. The linguistic word (verb) ba tla thusana „they will help each other‟ comprises a verbal root -thus- to which the subject agreement morpheme ba of noun class 2 and the future tense morpheme tla have been prefixed, while the reciprocal suffix -an- and the verbal ending -a are suffixed to this verbal root. (1) ba tla thusana ba-tla-thus-an-a SC2-FUT-help-RECP-VEND „they will help each other‟ In (2) the Setswana sentence only consists of two linguistic words. The two words are the noun basimane „boys‟ and the verb ba tla re thusa „they will help us‟. The verb consists of the subject agreement morpheme ba of noun class 2, the future tense morpheme tla, the object agreement morpheme re of the first person plural, the verbal root thus- „buy‟ and the verbal ending -a. The English equivalent of this sentence consists of five linguistic words. Notice that the determiners „the‟ and „a‟ do not appear in Setswana. (2) Basimane ba tla re thusa. basimane ba tla re thusa boys they will us help ba-simane ba-tla-re-thus-a N2-boys SC2-FUT-OCP1PL-help-VEND „The boys will help us.‟ The verbal prefixes and suffixes of Setswana are integral parts of the morphological structure of verbs and have morphological status in the c- structure of LFG (Bresnan, 2001:150), as shown in the following c-structure (Figure 1): S NP VP N V basimane ba tla re thusa Figure 1: The c-structure for (2) 2.2 Word order Setswana employs the SVOX word order, where „S‟ represents the subject, „V‟ the verb, „O‟ the object and „X‟ the adjuncts (Creissels, 2000:250-252; Watters, 2000:196-205). In a simple transitive sentence in Setswana the subject appears before the verb. The object follows the verb. This is illustrated in (3) where the subject bana „children‟ precedes the verb ba bua „they speak‟ and an object Setswana „Setswana‟ appears post verbally. (3) Bana ba bua Setswana. bana ba bua Setswana children they speak Setswana ba-ana ba-bu-a se-tswana N2-children SC2-speak-VEND N7-tswana „The children speak Setswana.‟ The object Setswana „Setswana‟ in (3) may be replaced by the object agreement morpheme se which acts as an object marker. The object agreement morpheme is placed in the verbal morphology where it is prefixed immediately preceding the verbal root (further explained in section 3.2). The basic word order is then altered as illustrated in (4). This sentence consists of a subject bana „children‟ and a verb ba a se bua „they speak it‟. (4) Bana ba a se bua. bana ba a se bua children they it speak ba-ana ba-a-se-bu-a
no reviews yet
Please Login to review.