Loading...

Proceedings of

3rd International Conference on Advances In Computing, Control And Networking ACCN 2015

"A RULE-BASED SETSWANA VERB LEMMATIZER"

G. A MALEMA M. LEFOANE N.P MOTLOGELWA
DOI
10.15224/978-1-63248-082-8-08
Pages
38 - 45
Authors
3
ISBN
978-1-63248-082-8

Abstract: “Lemmatization is a pre-processing stage in several natural language processing applications such as data retrieval. There are a few attempts on Setswana word lemmatization. Developed Setswana lemmatizers do not show in details where lemmatization fails to work well leading to reduced performance. This paper presents a detailed rule-based Setswana verb lemmatizer. Challenges in verb lemmatization are pointed out by word category. The overall results show that rule based Setswana verb lemmatization gives a good performance of 87%. However, reflexive verbs have a significant large percentage of exceptions”

Keywords: Setswana, Verb lemmatization,rule-based lemmatization.

Download PDF