MULTEXT-East Morphosyntactic Specifications

2.3. Attributes and values

Up: 2. Common MULTEXT Specifications Previous: 2.2. Categories Next: 2.4. Attribute Index

Table of contents

The common MULTEXT-East tables of attribute values are given for all categories above and have a rigid structure, which makes them suitable for automatically verifying the conformance of a particular morphosyntactic description with the tables, or for expanding a morphosyntactic description into its more verbose form.

This formal part is given as a table, having the following columns:

  1. Position gives the position of the attribute in the string of the morphosyntactic description;
  2. Attribute gives the name of the attribute;
  3. Value gives the name and one-letter code of the attribute-value;
  4. a column for each of the languages. For easier comparison between them they have been grouped by language family.

The language columns define (by marking with 'x'), in the first line, whether the category is used by the language, and in subsequent lines, which attribute-values a particular language uses.

2.3.1. Noun

Common specifications for Noun
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYNounNenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
1Typecommoncenroplcsskslsl-rozajhbssr-torruukmkbgsqfaethuceka
properpenroplcsskslsl-rozajhbssr-torruukmkbgsqfaethuceka
gerundgplka
2Gendermasculinemenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsq
femininefenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsq
neuternenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsq
commoncruuk
3Numbersingularsenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
pluralpenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
dualdcsslsl-rozajbg-dam
counttmkbg
collectivelsl-rozaj
4Casenominativenplcsskslsl-rozajhbssr-torruukmkbgbg-damsqethuceka
genitivegplcsskslsl-rozajhbssr-torruukbg-damsqfaethuceka
dativedplcsskslsl-rozajhbssr-torruukbg-damsqhuceka
accusativeaplcsskslsl-rozajhbssr-torruukbg-damsqhu
vocativevroplcsskhbssr-torruukmkbgbg-damfaka
locativelplcsskslsl-rozajhbssr-torruukbg-damce
instrumentaliplcsskslsl-rozajhbssr-torruukbg-damhuceka
directrro
obliqueoromkbg-dam
partitive1et
illativexethu
inessive2ethu
elativeeethu
allativetethuce
adessive3ethu
ablativebsqethu
translative4et
terminative9ethu
essivewethuka
abessive5et
komitativeket
aditive7et
temporalismhu
causalischu
sublativeshu
delativehhu
sociativeqhu
factiveyhu
superessivephu
distributiveuhu
essive-formalfhu
ergativezceka
lativejce
comparison8ce
5Definitenessnonromkbgsqfa
yesyromkbgsqfa
short-artsbg
full-artfbg
proximalpmk
distaldmk
6Cliticnonroka
yesyroka
7Animatenonplcsskslsl-rozajhbssr-torruukbg-damka
yesyplcsskslsl-rozajhbssr-torruukbg-damka
8Owner_Numbersingularshu
pluralphu
9Owner_Personfirst1hu
second2hu
third3hu
10Owned_Numbersingularshu
pluralphu
11Case2partitivepru
locativelru
12Humannonpl
yesypl
13Aspectprogressiveppl
perfectiveepl
14Negationnonpl
yesypl
15Classbubce
vuvce
dudce
jujce
16Articlet-formtsr-tor
v-formvsr-tor
n-formnsr-tor
2.3.1.1. Notes
  • In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.
  • In the Macedonian case system the value 'oblique' conflates archaic forms of 'genitive', 'dative' and 'accusative'.

2.3.2. Verb

Common specifications for Verb
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYVerbVenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
1Typemainmenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
auxiliaryaenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
modaloenrocssksl-rozajmksqfaet
copulacrocssksl-rozajhbssr-torfa
baseben
lightlfa
2VFormindicativeienroplcssksl-rozajruukmkbgbg-damsqfaethuceka
subjunctivesrosl-rozajsqfaka
imperativemroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
conditionalcenplcsskslrubg-damethuka
infinitivenenroplcsskslsl-rozajhbssr-torruukbg-damethuce
participlepenrocsskslsl-rozajhbssr-torrubgbg-damsqfaetce
gerundgroplruukbgetce
supineuslsl-rozajet
transgressivetcssk
quotativeqet
impersonalopluk
presentrslhbssr-tor
futurefslhbssr-tor
interrogativevce
realistic_conditionalkce
unrealistic_conditionalhce
causativezceka
potentialxce
aoristahbssr-tor
imperfectehbssr-tor
admirativedsq
optativeysq
3Tensepresentpenroplcssksl-rozajruukmkbgbg-damsqfaethuceka
imperfectirosl-rozajmkbgbg-damsqetceka
futurefplcssksl-rozajruukceka
pastsenroplcssksl-rozajruukbgfaethu
pluperfectlroceka
aoristasl-rozajmkbgbg-damsqka
recent_pastrce
evident_pastece
perfective_pasttce
compoundcmk
perfectnka
4Personfirst1enroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethu
second2enroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethu
third3enroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethu
5Numbersingularsenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethu
pluralpenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethu
dualdslsl-rozajbg-dam
collectivelsl-rozaj
6Gendermasculinemroplcsskslsl-rozajhbssr-torruukmkbg
femininefroplcsskslsl-rozajhbssr-torruukmkbg
neuternroplcsskslsl-rozajhbssr-torruukmkbg
7Voiceactiveacssl-rozajrubgsqetka
passivepcssl-rozajrubgsqetka
medialmru
autoactivecka
inactiveika
mediopassivedka
8Negativenoncsskslsl-rozajhbssr-tormkfaet
yesycsskslsl-rozajhbssr-tormkfaet
9Definitenessnonbgsqhu
yesybghu
short-artsplrubg
full-artfplrubg
1s2s2hu
1sd0sq
1sd3sa1sq
1sd3pa3sq
3sd4sq
3sd3sa5sq
1pd6sq
3pd7sq
3sa8sq
3pd3sa9sq
10Cliticnonroplfaka
yesyroplfaka
agglutinantapl
demandingdpl
11Casenominativenru
genitivegru
dativedru
accusativearu
locativelru
instrumentaliru
illativexet
inessive2et
elativeeet
translative4et
abessive5et
12Animatenoncs
yesycs
13Clitic_snoncs
yesycs
14Aspectprogressivepplskslsl-rozajruukmkfaka
perfectiveeplskslsl-rozajruukmkbg-damka
biaspectualbslruukmkce
ambivalentask
iterativerce
semelfactivefce
imperfectiveibg-dam
15Courtesynonsl-rozajfa
yesysl-rozajfa
16Transitivenonfa
yesyfa
17Humannonpl
yesypl
18Classbubce
vuvce
dudce
jujce
19Subject_Personfirst1ka
second2ka
third3ka
20Direct_Object_Personfirst1ka
second2ka
third3ka
21Indirect_Object_Personfirst1ka
second2ka
third3ka
22Subject_Numbersingularska
pluralpka
23Direct_Object_Numbersingularska
pluralpka
24Indirect_Object_Numbersingularska
pluralpka
25Subject_Casenominativenka
ergativezka
dativedka
26Direct_Object_Casenominativenka
dativedka
27Indirect_Object_Casedativedka

2.3.3. Adjective

Common specifications for Adjective
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYAdjectiveAenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
1Typequalificativefenroplcssksl-rozajruukmkfahu
indefinitei
possessivescsskslsl-rozajhbssr-torrumk
ordinalosl-rozajukmk
participlepplslhbssr-torukmkka
generalgslhbssr-tormksqka
preposedrsq
2Degreepositivepenroplcsskslsl-rozajhbssr-torruukmkfaethuceka
comparativecenroplcsskslsl-rozajhbssr-torruukmkfaethuceka
superlativesenroplcsskslsl-rozajhbssr-torruukmkfaethuka
elativeesl-rozaj
diminutivedsl-rozajka
3Gendermasculinemroplcsskslsl-rozajhbssr-torruukmkbgbg-damsq
femininefroplcsskslsl-rozajhbssr-torruukmkbgbg-damsq
neuternroplcsskslsl-rozajhbssr-torruukmkbgbg-dam
commoncuk
4Numbersingularsroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqethuka
pluralproplcsskslsl-rozajhbssr-torruukmkbgbg-damsqethuka
dualdcsslsl-rozajbg-dam
collectivelsl-rozaj
5Casenominativenplcsskslsl-rozajhbssr-torruukbg-damsqethuceka
genitivegplcsskslsl-rozajhbssr-torruukbg-damsqfaethuka
dativedplcsskslsl-rozajhbssr-torruukbg-damsqhuka
accusativeaplcsskslsl-rozajhbssr-torruukbg-damsqhu
vocativevrocsskhbssr-torbg-damka
locativelplcsskslsl-rozajhbssr-torruukbg-dam
instrumentaliplcsskslsl-rozajhbssr-torruukbg-damhuka
directrro
obliqueorobg-dam
partitive1et
illativexethu
inessive2ethu
elativeeethu
allativetethu
adessive3ethu
ablativebsqethu
translative4et
terminative9ethu
essivewethuka
abessive5et
komitativeket
aditive7et
temporalismhu
causalischu
sublativeshu
delativehhu
sociativeqhu
factiveyhu
superessivephu
distributiveuhu
essive-formalfhu
other6ce
ergativezka
6Definitenessnonroslsl-rozajhbssr-tormkbgbg-damsqfa
yesyroslsl-rozajhbssr-tormkbgbg-damsqfa
short-artsplruukbg
full-artfplruukbg
proximalpmk
distaldmk
7Cliticnonroka
yesyroka
8Animatenonplcssksl-rozajhbssr-toruk
yesyplcssksl-rozajhbssr-toruk
9Formationnominalncs
compoundccs
10Owner_Numbersingularshu
pluralphu
11Owner_Personfirst1hu
second2hu
third3hu
12Owned_Numbersingularshu
pluralphu
13Aspectprogressiveppluk
perfectiveepluk
biaspectualbuk
14Voiceactiveapluk
passiveppluk
15Tensepresentpuk
pastsuk
16Humannonpl
yesypl
17Negationnonpl
yesypl
18Classbubce
vuvce
dudce
jujce
19Articulationarticulatedasq
unarticulatedusq
20Articlet-formtsr-tor
v-formvsr-tor
n-formnsr-tor
2.3.3.1. Notes
  • In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.
  • For Macedonian, the definiteness attributes can take the values: non definite (no), generally definite (yes), definite at short visible distance (proximal), and definite at longer visible distance (distal).

2.3.4. Pronoun

Common specifications for Pronoun
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYPronounPenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
1Typepersonalpenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
demonstrativedroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
indefiniteiroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
possessivesenroplcsskslsl-rozajhbssr-torruukbgsqethuceka
interrogativeqenplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
relativerenplcsslsl-rozajhbssr-torruukmkbgbg-damsqethuka
exclamativee
reflexivexenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuce
reciprocalyfaethuka
negativezroplcsskslsl-rozajruukmkbgbg-damceka
generalgenplcsskslsl-rozajukmkbg
int-relwro
determinalmetka
ex-thereten
nonspecificnru
emphatichuk
definitefce
2Personfirst1enroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
second2enroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
third3enroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
3Gendermasculinemenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsq
femininefenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsq
neuternenroplcsskslsl-rozajhbssr-torruukmkbgbg-dam
4Numbersingularsenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
pluralpenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
dualdcsslsl-rozajbg-dam
paucalc
collectivelsl-rozaj
5Casenominativenenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqethuceka
genitivegroplcsskslsl-rozajhbssr-torruukbg-damsqfaethuceka
dativedroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqhuceka
accusativeaenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfahu
vocativevroskhbssr-torruka
locativelplcsskslsl-rozajhbssr-torruukbg-damce
instrumentaliplcsskslsl-rozajhbssr-torruukbg-damhuceka
directrro
obliqueorobg-dam
partitive1et
illativexethu
inessive2ethu
elativeeethu
allativetethuce
adessive3ethu
ablativebsqethu
translative4et
terminative9ethu
essivewethuka
abessive5et
komitativeket
aditive7et
temporalismhu
causalischu
sublativeshu
delativehhu
sociativeqhu
factiveyhu
superessivephu
distributiveuhu
essive-formalfhu
ergativezceka
lativejce
comparison8ce
6Owner_Numbersingularsenrocsskslsl-rozajsqhu
pluralpenrocsskslsl-rozajsqhu
dualdslsl-rozaj
7Owner_Gendermasculinemencsslsl-rozajsq
femininefencsslsl-rozajsq
neuterncsslsl-rozajsq
8Cliticnonroplcssksl-rozajmkbgsqfaka
yesyroplcsskslsl-rozajmkbgsqfaka
boundbslsl-rozajsq
agglutinantapl
9Referent_Typepersonalpplcssksl-rozajbg
possessivesplcssksl-rozajukbg
attributiveabg
quantitativeqbg
10Syntactic_Typenominalnplcssksl-rozajruuksq
adjectivalaplcssksl-rozajruuksq
adverbialrplruuk
11Definitenessnonsl-rozajmkbgsq
yesysl-rozajmkbgsq
short-artsplbg
full-artfplbg
proximalpmksq
distaldmksq
12Animatenonplcssksl-rozajhbssr-torruuk
yesyplcssksl-rozajhbssr-torruuk
13Clitic_syesycs
noncs
14Pronoun_Formstrongsrosq
weakwrosq
15Owner_Personfirst1hu
second2hu
third3hu
16Owned_Numbersingularshu
pluralphu
17Wh_Typerelativeren
questionqen
18Courtesynonfa
yesyfa
19Humannonpl
yesypl
20Inclusioninclusiveice
exclusiveece
21Articlet-formtsr-tor
v-formvsr-tor
n-formnsr-tor
2.3.4.1. Notes
  • In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.
  • For Macedonian, the definiteness attributes can take the values: non definite (no), generally definite (yes), definite at short visible distance (proximal), and definite at longer visible distance (distal).

2.3.5. Determiner

Common specifications for Determiner
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYDeterminerDenrofa
1Typedemonstrativedenrofa
indefiniteienrofa
possessivesenro
interrogativeqfa
relativer
exclamativeefa
articleafa
generalgen
int-relwro
negativezro
emphatichro
exceptionalxfa
2Personfirst1enro
second2enro
third3enro
3Gendermasculinemro
femininefro
neuternro
4Numbersingularsenrofa
pluralpenrofa
5Casedirectrro
obliqueoro
6Owner_Numbersingularsenro
pluralpenro
7Owner_Gendermasculinemen
femininefen
neuternen
8Cliticnonro
yesyro
9Modific_Typeprenominero
postnominoro
10Wh_Typerelativeren
questionqen

Notes: In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.

2.3.6. Article

Common specifications for Article
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYArticleTrosl-rozajsqhu
1Typedefinitefrosl-rozajhu
indefiniteirosl-rozajsqhu
possessivesrosq
demonstrativedro
nominalnsq
adjectivalasq
numericalmsq
pronominalpsq
2Gendermasculinemrosl-rozaj
femininefrosl-rozaj
neuternrosl-rozaj
3Numbersingularsrosl-rozaj
pluralprosl-rozaj
dualdsl-rozaj
collectivelsl-rozaj
4Casenominativensl-rozaj
genitivegsl-rozaj
dativedsl-rozaj
accusativeasl-rozaj
locativelsl-rozaj
instrumentalisl-rozaj
directrro
obliqueoro
5Cliticnonro
yesyro
6Animatenonsl-rozaj
yesysl-rozaj

Notes: In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.

2.3.7. Adverb

Common specifications for Adverb
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYAdverbRenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
1Typegeneralgrocsslsl-rozajhbssr-tormkbghu
particleprohu
causalohuka
negativezro
adjectivalamkbg
verbalvmkhu
modifiermenrosqhuka
specifiersensqka
int-relwro
portmanteaucro
interrogativeqsqhuka
participlerslhbssr-tor
modaldmk
locallka
temporaltka
quantitativeuka
relativeeka
2Degreepositivepenroplcsskslsl-rozajhbssr-torruukmksqfahu
comparativecenroplcsskslsl-rozajhbssr-torruukmkfahu
superlativesenroplcsskslsl-rozajhbssr-torruukmksqhu
elativeesl-rozaj
3Cliticnonroplhuka
yesyroplhuka
agglutinantapl
burkinostkaupl
4Numbersingularshu
pluralphu
5Personfirst1hu
second2hu
third3hu
6Wh_Typerelativeren
questionqen
7Casegenitivegfa

2.3.8. Adposition

Common specifications for Adposition
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYAdpositionSenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
1Typeprepositionpenroplcssksl-rozajruukmkbgsqfaet
postpositiontenfaethuceka
2Formationsimplesrocssksl-rozajruukmkfa
compoundcrocssksl-rozajruukmkfa
3Casenominativenslsl-rozajsqka
genitivegroplcsskslsl-rozajhbssr-torruukbg-damka
dativedroplcsskslsl-rozajhbssr-torruukbg-damka
accusativearoplcsskslsl-rozajhbssr-torruukbg-damsq
locativelplcsskslsl-rozajhbssr-torruukbg-dam
instrumentaliplcsskslsl-rozajhbssr-torruukbg-damka
ablativebsq
essivewka
ergativezka
vocativevka
4Cliticnonroka
yesyroka

2.3.9. Conjunction

Common specifications for Conjunction
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYConjunctionCenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
1Typecoordinatingcenrocsskslsl-rozajhbssr-torruukmkbgsqfaethuka
subordinatingsenrocsskslsl-rozajhbssr-torruukmkbgsqfaethuka
portmanteaurro
2Formationsimplesrosl-rozajhbsruukmkbgfahu
compoundcrosl-rozajhbsruukmkbgfahu
3Coord_Typesimplesro
repetitrro
correlatcro
sentencepruhu
wordswruhu
initialien
non-initialnen
4Sub_Typenegativezroru
positiveproru
5Cliticnonro
yesyro
6Numbersingularscs
pluralpcs
7Personfirst1cs
second2cs
third3cs

2.3.10. Numeral

Common specifications for Numeral
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYNumeralMenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
1Typecardinalcenroplcsskslsl-rozajhbssr-torruukmkbgsqfaethuceka
ordinaloenroplcsskslsl-rozajhbssr-torruukbgfaethuceka
fractalfrosqfahuka
multiplemrocsskhbsrusqka
collectlroplrusq
specialscsskslhbssr-tormk
ordinal2rfa
pronominalpslmksq
approximativeaka
2Gendermasculinemroplcsskslsl-rozajhbssr-torruukmkbgsq
femininefroplcsskslsl-rozajhbssr-torruukmkbgsq
neuternroplcsskslsl-rozajhbssr-torruukmkbg
3Numbersingularsroplcsskslsl-rozajhbssr-torruukbgsqethuceka
pluralproplcsskslsl-rozajhbssr-torruukbgsqethuceka
dualdcsslsl-rozaj
collectivelsl-rozaj
4Casenominativenplcsskslsl-rozajhbssr-torruuksqethuceka
genitivegplcsskslsl-rozajhbssr-torruuksqfaethuceka
dativedplcsskslsl-rozajhbssr-torruuksqhuceka
accusativeaplcsskslsl-rozajhbssr-torruuksqhu
vocativevskhbssr-torka
locativelplcsskslsl-rozajhbssr-torruukce
instrumentaliplcsskslsl-rozajhbssr-torruukhuceka
directrro
obliqueoro
partitive1et
illativexethu
inessive2ethu
elativeeethu
allativetethuce
adessive3ethu
ablativebsqethu
translative4et
terminative9ethu
essivewethuka
abessive5et
komitativeket
aditive7et
temporalismhu
causalischu
sublativeshu
delativehhu
sociativeqhu
factiveyhu
superessivephu
distributiveuhu
essive-formalfhu
multiplicative6hu
ergativezceka
lativejce
comparison8ce
5Formdigitdroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqethuceka
romanrroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqethuceka
letterlroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqethuceka
bothbrosq
m-formmbg
approxabg
alphabeticcbg-damka
6Definitenessnonroslmkbgfa
yesyroslmkbgfa
short-artsbg
full-artfbg
proximalpmk
distaldmk
7Cliticnonroka
yesyroka
8Classdefinitefplcssk
definite11cssk
definite22cs
definite343plcs
definite2344sk
demonstrativedcssk
indefiniteicssk
interrogativeqcssk
relativercs
9Animatenonplcssksl-rozajhbssr-torruuk
yesyplcssksl-rozajhbssr-torruuk
10Owner_Numbersingularshu
pluralphu
11Owner_Personfirst1hu
second2hu
third3hu
12Owned_Numbersingularshu
pluralphu
13Humannonpl
yesypl
14Articlet-formtsr-tor
v-formvsr-tor
n-formnsr-tor

Notes: In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.

2.3.11. Particle

Common specifications for Particle
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYParticleQroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqceka
1Typenegativezrohbssr-torbgbg-damsq
infinitivenrosq
subjunctivesrosq
aspectaro
futurefro
generalgbgbg-damsq
comparativecbgbg-damsq
verbalvbgsq
interrogativeqhbssr-torbgbg-damsq
modalohbssr-torbgbg-dam
affirmativerhbssr-torsq
definitivedbg-damsq
2Formationsimplesrumkbg
compoundcrumkbg
3Cliticnonropl
yesyropl
agglutinantapl
demandingdpl

2.3.12. Interjection

Common specifications for Interjection
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYInterjectionIenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
1Typemoodmsqhu
otherosqhu
2Formationsimplesrubg
compoundcrubg

2.3.13. Abbreviation

Common specifications for Abbreviation
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYAbbreviationYenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuka
1Syntactic_Typenominalnroruet
verbalvroet
adjectivalaroet
adverbialrroruet
pronominalpro
2Gendermasculinemroru
femininefroru
neuternroru
3Numbersingularsroruet
pluralproruet
paucalcru
4Casenominativenruet
genitivegruet
dativedru
accusativearu
locativelru
instrumentaliru
directrro
obliqueoro
vocativevro
partitive1et
illativexet
inessive2et
elativeeet
allativetet
adessive3et
ablativebet
translative4et
terminative9et
essivewet
abessive5et
komitativeket
aditive7et
5Definitenessyesyro
nonro
2.3.13.1. Notes
  • In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.

2.3.14. Residual

Common specifications for Residual
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYResidualXenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuce
1Typeforeignfslhbssr-tormksqce
typotslmksq
programpslmksq
webwslhbssr-tormksq
emoeslhbssr-tormksq
hashtaghslhbssr-tormksq
ataslhbssr-tormksq
2.3.14.1. Notes
  • For Slovene the Type attribute has been introduced on Residual, which distinguishes the values of "foreign", to mark a words in a strech of foreign language text, "typo", a mis-typed word, and "program", where the tokenisation program made a mistake. The second, and esp. the third value are useful for hand-annotation of corpora.

2.3.15. Punctuation

Common specifications for Punctuation
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianSerbo-CroatianTorlak dialect of SerbianRussianUkrainianMacedonianBulgarianDamaskiniAlbanianPersianEstonianHungarianChechenGeorgian
0CATEGORYPunctuationZenroplcsskslsl-rozajhbssr-torruukmkbgbg-damsqfaethuceka
2.3.15.1. Notes
  • Due to popular demand (i.e. Vladimir Benko:), punctuation has been introduced into the specifications in Version 5. Even though MULTEXT(-East) specifications and the MSDs are meant to describe the morphosyntax of words rather than act directly as corpus tags, they are nevetheless often used for exactly this purpose. This means that punctuation was tagged in each corpus differently,leading to inconsistences. Therefore, the Z category was introduced.
Up: 2. Common MULTEXT Specifications Previous: 2.2. Categories Next: 2.4. Attribute Index
Date: 2022-06-24
This work is licensed under the Creative Commons Attribution-ShareAlike 4.0 International.