Morphological Analyzer of
Minangkabau Derivational
Affixes : Theoretical Approach
By
Nur Rosita
1003-14-021004
COMPANY CONFIDENTIAL
Abstract
Austronesian
MalayoPolynesian
Western MalayoPolinesian
Minangkabau
I . Introduction
1. Background of the Study
2. Aims and Objectives
This paper has attempt to generally
introduce Minangkabau language in relation
to its derivational affixes
and provide
theoretical base in developing morphological
analyzer for its derivatinoal affixes
3. Material and Method
Material
1. Primary source : source texts provided online such as books,
journals, and articles
2. Secondary source : conversation with friends and feed back
with the writer as a native speaker
Method
1. Collecting data
2. Formulating Minangkabau derivational affixes
3. Constructing finite state automation of Minangkabau
derivational affixes
4. Scope and Limitation
The scope of the study is about derivational affixes
in Minangkabau language. It is limited to the
theoretical approach and framework for developing
morphological analyzer of Minangkabau derivational
affixes to be applicable in any further research
related to computational linguistics core of natural
language processing
5. Significance of the Study
This study can hopefully be essential as the
theoretical framework to be applied practically in
constructing
morphological
analyzer
for
Minangkabau language
6. Definition of Key Terms
1. Minangkabau language : a native language of
Minangkabaunese who mainly live in West Sumatera,
Indonesia
2. Morphological Analyzer : a program for analyzing the
morphology of an input word by giving the output, it
detects morphemes of any text.
3. Derivational Affixes : an affix by means of which
one word is formed (derived) from another that is
often of a different word class from the original
II. Review of Related
Literature
1. Minangkabau Language
Speaker : around 9 million
speakers
Dialect : Padang Panjang, Agam,
Jamee, BatusangkarPariangan, KerinciMinangkabau, Orang Mamak,
Payokumbuah, Muko-Muko,
Penghulu, Si Junjung,
Singkarak, Tanah, Ulu,
Agam-Tanah Datar.
Word Order : SVO
2. Morphological Analyzer
1. Turing Test Model of
algorithmic computation
was introduced in 1936
2. bottom-up affixstripping approach for
Ancient Greek by
Packard in 1973
3. KIMMO based in
Koskenniemis models in
1983
4. 20 s century: Turkish,
Indian Languages,
Korean, Japan, Chinese,
Arabic, French, German,
Indonesia
In conclusion : A morphological analyzer is the automated
implementation of human ability to analyze a language which
always returns a morpheme with the suffix associated with
it. It segments the word into morphemes. A Morphological
Analyzer is the computational implementation of human
ability to analyze a language. It is a computer program that
analyses words belong to Natural Languages and produces its
grammatical structure as output. The computer takes word
as an input and analyses it using the given resources and
algorithm.
3. Derivational Morphology
Jurafsky and Martin (2007) stated that derivational morphology is
the combination of a word stem with a grammatical morpheme,
usually resulting in a word of a different class with a meaning hard to
predict exactly
Verhaar (1999:107) classifies affixes into four. They are (a) prefix;
it is added in the beginning of word or base. (b) Suffix; it is added in
the final position of word or base. (c) Infix; it is inserted into the
word or base. (d) Confix (also called simulfix, ambifix, or sirkumfix);
it is added in the beginning and the final position of the words.
Chaer (2003:177-195) says that in affixation process there are
three elements that are involved; the base form, affixes, and
grammatical meaning resulted.
Continue
Minangkabau language
Hoa Nio (1979) says that there are two morphological processes in the
Minangkabau language: affixation and reduplication.
Moussay (1998:66-68) there are 24 prefixes in the Minangkabau language; ba-,
bar-, di-, ka-, maN-, pa-, paN-, par-, sa-, ta-, tar-, baka-, baku-, bapa-, bapar-,
basi-, dipa-, dipar-, mampa-, mampar-, mampasi-, tapa-, tapar-, tasi -.For
example in the wordsba-salang, bar-anak, di-agiah, ka-andak.In addition
there are also 5 suffixes in the Minangkabaunesse. They are an, -i, -kan, -lah,
-nyo. For example arebuai-an, sakik-i, kalah-kan, io-lah, elok-nyo. There are
also 50 confixes in the Minangkabauness. While there are 2 infixes in the
Minangkabu
language;-am-,
-um-.For
instancePamuncakcomes
frompuncak,turun tumuruncomes from turun.
Derivational Affixes Formulation
Verb to Noun
Makan : makan + an : makanan
Eat food
2. Noun to Verb
Pakaian : ba + pakaian : bapakaian
Clothes
wear
3. Adjective to Noun
Sadiah : ka + sadiah + an : kasadiahan
Sad
sadness
4. Adjective to Verb (active)
Gadang : mampa + gadang : mampagadang
Big
make bigger
5. Adjective to verb (passive)
Ketek : dipa + ketek : dipaketek
Small
become smaller
1.
Continue
Continue
In conclusion, Derivational affixes are prefixes and suffixes
used to create a new word which does not change the function
of the word but only change the grammatical categories, as
opposed to inflectional affixes, which are used to inflect an
existing word.
III. Data Presentation
and Conclusion
It uses positional tagset with 3 different morphological tags and a special
lemma tag that directly follows lemma.
Ph. wakmambantunyo (ph. I help
him/her)
Lemma position
Awak<p>_PS1
+
inyo<p>_PS3
proclitic
enclitic
mam+bantu<v>+_VSA
morphological
I
him/her
tag
help
Continue
ph.Wakmangirimannyo
(ph. I deliver him/her)
Yields
Awak<p>_PS1+maN+kirim<v>+kan_VSA+inyo<p>_PS3
Continue
In some derivational case, the lemma lexical category can be different than the
lexical category of the whole surface form, as shown in the example below:
v.
kirim
(v. deliver)
yields
kirim<v>_VSA
v.
mangirim
(v. deliver)
yields
maN+kirim<v>_VSA
n.
kiriman
(n. package)
yields
kirim<v>+an_NSD
n.
pangiriman
(n. delivery)
yields
paN+kirim<v>+an_NSD
continue
Below given the plural surface word form. There are also several
special plural cases using infix, which hardly coded in the
dictionary:
n.
gerigi
(n. teeth) yields
gerigi<n>_NPD
n.
gigi-gigi
(n. teeth) yields
gigi<n>_NPD
Below is given the example of numeral-noun agreement:
n.
2 kaco
(n. 2 mirrors)
yields
2<c>_CC- mirror<n>_NSD
(lit n. *2 mitrror)
duo<c>_CCn.
duo kaco
(n. two mirrors)
yields
kaco<n>_NSD>
(lit n. *two mirror)
n.
Kaco-kaco
(n.mirrors)
yields
kaco<n>NPD
n.
*2 kaco-kaco
(lit n. 2 mirrors)
yields
2<c>_CC- kaco<n>_NPS
Below given the example of numeral alternation:
num.
(num. 2)
yields
2<c>_CC-
num.
duo
(num. two)
yields
duo<c>_CC-
num.
ke-2
(num. second)
yields
ke+2<c>_CO-
num.
kaduo
(num. second)
yields
ka+duo<c>_CO-
IV. Summary and Conclusion
Summary :
It can be concluded that the analysis of Minangkabau language as
one of local languages in Indonesia is till deficient in the studies of
computational linguistics. From this brief theoretical review, the
wider and deeper morphological analyzer will be possible to be done
with a good selection of the lexical entries by choosing the most
frequent and productive lemma providing output with morphemic
segmentation, lemma morpheme position, lexical category, and
morphological feature.
Thank you for your attention