Up: 3.16. Hungarian Specifications Next: 3.16.2. Hungarian Category Index
Table of contents
Without handling derivation a satisfactory morphological analysis is not possible for Hungarian. The HUMOR system, a general purpose morphological analyzer, handles it and the results can be converted to Multext format since on syntactic level the morphological origin of stems are generally irrelevant. The resulting word class is defined by the rightmost derivational suffix. The suffix characters are literally attached to the word.
Here are the derivations that the analyser recognizes but instead of the origin we place only the resulting class to the output. (Suffix tags used in HUMOR are in upper case, actual suffixes are in lower case.)
Type | Form | Example | Gloss |
DIM | cska, acska, ecske, öcske, ocska | utcá+cska | little_street |
FEM | né | Kovács+né | Mrs._Kovács |
Type | Form | Example | Gloss |
IF | ás, és | olvas+ás | read+ing_(gerund) |
DES | hatnék, hetnék | olvas+hatnék | the_intention_of_reading |
Type | Form | Example | Gloss |
FAK | ít | szép+ít | make_it_pretty_(in_compounds_only) |
MI | od, ed, öd | vállas+od+ik | becomes_strong |
MIGY | kod, ked, köd | okos+kod+ik | plays_the_smart_(frequently) |
If 'szemetelés' (littering, action of throwing away litter) is not in the dictionary we derive it from the verb 'litter' szemetel[V] + és[IF] (where IF=Verb2Noun).
Instead of giving the extra attribute to the verb expressing that it has a derivational suffix we simply give the result of the analysis+conversion: szemetelés[N].
In Hungarian some derivation may follow the inflectional suffix. For these derivations the suffix+derivation together forms a compound derivation. Then a new stem is generated from the stem + inflection + derivation segments and the resulting part of speech is determined by the derivation.
Type | Form | Example | Gloss |
Nc-sn--ns1-+FAM | ék | apá+m+ék | some_people_with_my_father |
Nc-su--u---+IKEP | i | asztal+onként+i | sg._done_by_each_every_table |
Afc-sn--n----+KIEM | ik | nagy+obb+ik | the_bigger_one |
There are adverbs that may get case endings. Since case inflections derive adverbs from nouns these constructions can be handled as derivations. That means the stem is the stem + inflection combination and the part of speech is adverb.
Compounding is handled in a very similar way to derivation. The rightmost word class is always the resulting one. If it contains some derivation as well then the result is the word class that the derivation determines.
Up: 3.16. Hungarian Specifications Next: 3.16.2. Hungarian Category Index