MULTEXT-East Morphosyntactic Specifications, Version 5 (draft)

2.3. Attributes and values

Up: 2. Common MULTEXT Specifications Previous: 2.2. Categories Next: 2.4. Attribute Index

Table of contents

The common MULTEXT-East tables of attribute values are given for all categories above and have a rigid structure, which makes them suitable for automatically verifying the conformance of a particular morphosyntactic description with the tables, or for expanding a morphosyntactic description into its more verbose form.

This formal part is given as a table, having the following columns:

  1. Position gives the position of the attribute in the string of the morphosyntactic description;
  2. Attribute gives the name of the attribute;
  3. Value gives the name and one-letter code of the attribute-value;
  4. a column for each of the languages. For easier comparison between them they have been grouped by language family: first English as the 'hub' language, then Romance (Romanian), followed by Slavic (Slovene, Czech, Bulgarian), and finaly Finno-Ungric (Estonian, Hungarian). Croatian, Serbian and Resian have been added later, and not yet properly grouped.

The language columns define (by marking with 'x'), in the first line, whether the category is used by the language, and in subsequent lines, which attribute-values a particular language uses.

2.3.1. Noun

Common specifications for Noun
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYNounNenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
1Typecommoncenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
properpenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
gerundgpl
2Gendermasculinemenroplcsskslsl-rozajhrsrbsruukmkbg
femininefenroplcsskslsl-rozajhrsrbsruukmkbg
neuternenroplcsskslsl-rozajhrsrbsruukmkbg
commoncruuk
3Numbersingularsenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
pluralpenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
dualdcsslsl-rozaj
counttsrmkbg
collectivelsl-rozaj
4Casenominativenplcsskslsl-rozajhrsrbsruukmkbgethuce
genitivegplcsskslsl-rozajhrsrbsruukfaethuce
dativedplcsskslsl-rozajhrsrbsruukhuce
accusativeaplcsskslsl-rozajhrsrbsruukhu
vocativevroplcsskhrsrbsruukmkbgfa
locativelplcsskslsl-rozajhrsrbsruukce
instrumentaliplcsskslsl-rozajhrsrbsruukhuce
directrro
obliqueoromk
partitive1et
illativexethu
inessive2ethu
elativeeethu
allativetethuce
adessive3ethu
ablativebethu
translative4et
terminative9ethu
essivewethu
abessive5et
komitativeket
aditive7et
temporalismhu
causalischu
sublativeshu
delativehhu
sociativeqhu
factiveyhu
superessivephu
distributiveuhu
essive-formalfhu
ergativezce
lativejce
comparison8ce
5Definitenessnonromkbgfa
yesyromkbgfa
short-artsbg
full-artfbg
proximalpmk
distaldmk
6Cliticnonro
yesyro
7Animatenonplcsskslsl-rozajhrsrbsruuk
yesyplcsskslsl-rozajhrsrbsruuk
8Owner_Numbersingularshu
pluralphu
9Owner_Personfirst1hu
second2hu
third3hu
10Owned_Numbersingularshu
pluralphu
11Case2partitivepru
locativelru
12Humannonpl
yesypl
13Aspectprogressiveppl
perfectiveepl
14Negationnonpl
yesypl
15Classbubce
vuvce
dudce
jujce
2.3.1.1. Notes
  • In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.
  • In the Macedonian case system the value 'oblique' conflates archaic forms of 'genitive', 'dative' and 'accusative'.

2.3.2. Verb

Common specifications for Verb
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYVerbVenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
1Typemainmenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
auxiliaryaenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
modaloenrocssksl-rozajsrmkfaet
copulacrocssksl-rozajhrsrbsfa
baseben
lightlfa
2VFormindicativeienroplcssksl-rozajsrruukmkbgfaethuce
subjunctivesrosl-rozajfa
imperativemroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
conditionalcenplcsskslsrruethu
infinitivenenroplcsskslsl-rozajhrsrbsruukethuce
participlepenrocsskslsl-rozajhrsrbsrumkbgfaetce
gerundgroplsrruukbgetce
supineuslsl-rozajet
transgressivetcssk
quotativeqet
impersonalopluk
presentrslhrbs
futurefslhrbs
interrogativevce
realistic_conditionalkce
unrealistic_conditionalhce
causativezce
potentialxce
aoristahrbs
imperfectehrbs
3Tensepresentpenroplcssksl-rozajsrruukmkbgfaethuce
imperfectirosl-rozajsrmkbgetce
futurefplcssksl-rozajsrruukce
pastsenroplcssksl-rozajsrruukbgfaethu
pluperfectlrosrce
aoristasl-rozajsrmkbg
recent_pastrce
evident_pastece
perfective_pasttce
4Personfirst1enroplcsskslsl-rozajhrsrbsruukmkbgfaethu
second2enroplcsskslsl-rozajhrsrbsruukmkbgfaethu
third3enroplcsskslsl-rozajhrsrbsruukmkbgfaethu
5Numbersingularsenroplcsskslsl-rozajhrsrbsruukmkbgfaethu
pluralpenroplcsskslsl-rozajhrsrbsruukmkbgfaethu
dualdslsl-rozaj
collectivelsl-rozaj
6Gendermasculinemroplcsskslsl-rozajhrsrbsruukmkbg
femininefroplcsskslsl-rozajhrsrbsruukmkbg
neuternroplcsskslsl-rozajhrsrbsruukmkbg
7Voiceactiveacssl-rozajsrrumkbget
passivepcssl-rozajsrrumkbget
medialmru
8Negativenoncsskslsl-rozajhrsrbsmkfaet
yesycsskslsl-rozajhrsrbsmkfaet
9Definitenessnonbghu
yesybghu
short-artsplrubg
full-artfplrubg
1s2s2hu
10Cliticnonroplsrfa
yesyroplsrfa
agglutinantapl
demandingdpl
11Casenominativenru
genitivegru
dativedru
accusativearu
locativelru
instrumentaliru
illativexet
inessive2et
elativeeet
translative4et
abessive5et
12Animatenoncs
yesycs
13Clitic_snoncs
yesycs
14Aspectprogressivepplskslsl-rozajsrruukmkfa
perfectiveeplskslsl-rozajsrruukmk
biaspectualbslruukce
ambivalentask
iterativerce
semelfactivefce
15Courtesynonsl-rozajfa
yesysl-rozajfa
16Transitivenonfa
yesyfa
17Humannonpl
yesypl
18Classbubce
vuvce
dudce
jujce

2.3.3. Adjective

Common specifications for Adjective
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYAdjectiveAenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
1Typequalificativefenroplcssksl-rozajsrruukmkfahu
indefinitei
possessivescsskslsl-rozajhrsrbsrumk
ordinalosl-rozajsrukmk
participlepplslhrbsuk
generalgslhrbs
2Degreepositivepenroplcsskslsl-rozajhrsrbsruukmkfaethuce
comparativecenroplcsskslsl-rozajhrsrbsruukmkfaethuce
superlativesenroplcsskslsl-rozajhrsrbsruukmkfaethu
elativeesl-rozajsrmk
diminutivedsl-rozaj
3Gendermasculinemroplcsskslsl-rozajhrsrbsruukmkbg
femininefroplcsskslsl-rozajhrsrbsruukmkbg
neuternroplcsskslsl-rozajhrsrbsruukmkbg
commoncuk
4Numbersingularsroplcsskslsl-rozajhrsrbsruukmkbgethu
pluralproplcsskslsl-rozajhrsrbsruukmkbgethu
dualdcsslsl-rozaj
collectivelsl-rozaj
5Casenominativenplcsskslsl-rozajhrsrbsruukethuce
genitivegplcsskslsl-rozajhrsrbsruukfaethu
dativedplcsskslsl-rozajhrsrbsruukhu
accusativeaplcsskslsl-rozajhrsrbsruukhu
vocativevrocsskhrsrbs
locativelplcsskslsl-rozajhrsrbsruuk
instrumentaliplcsskslsl-rozajhrsrbsruukhu
directrro
obliqueoro
partitive1et
illativexethu
inessive2ethu
elativeeethu
allativetethu
adessive3ethu
ablativebethu
translative4et
terminative9ethu
essivewethu
abessive5et
komitativeket
aditive7et
temporalismhu
causalischu
sublativeshu
delativehhu
sociativeqhu
factiveyhu
superessivephu
distributiveuhu
essive-formalfhu
other6ce
6Definitenessnonroslsl-rozajhrsrbsmkbgfa
yesyroslsl-rozajhrsrbsmkbgfa
short-artsplruukbg
full-artfplruukbg
proximalpmk
distaldmk
7Cliticnonro
yesyro
8Animatenonplcssksl-rozajhrsrbsuk
yesyplcssksl-rozajhrsrbsuk
9Formationnominalncs
compoundccs
10Owner_Numbersingularshu
pluralphu
11Owner_Personfirst1hu
second2hu
third3hu
12Owned_Numbersingularshu
pluralphu
13Aspectprogressiveppluk
perfectiveepluk
biaspectualbuk
14Voiceactiveapluk
passiveppluk
15Tensepresentpuk
pastsuk
16Humannonpl
yesypl
17Negationnonpl
yesypl
18Classbubce
vuvce
dudce
jujce
2.3.3.1. Notes
  • In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.
  • For Macedonian, the definiteness attributes can take the values: non definite (no), generally definite (yes), definite at short visible distance (proximal), and definite at longer visible distance (distal).

2.3.4. Pronoun

Common specifications for Pronoun
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYPronounPenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
1Typepersonalpenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
demonstrativedroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
indefiniteiroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
possessivesenroplcsskslsl-rozajhrsrbsruukmkbgethuce
interrogativeqenplcsskslsl-rozajhrsrbsruukmkbgfaethuce
relativerenplcsslsl-rozajhrsrbsruukmkbgethu
exclamativee
reflexivexenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
reciprocalyfaethu
negativezroplcsskslsl-rozajsrruukmkbgce
generalgenplcsskslsl-rozajsrukmkbg
int-relwro
determinalmet
ex-thereten
nonspecificnru
emphatichuk
definitefce
2Personfirst1enroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
second2enroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
third3enroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
3Gendermasculinemenroplcsskslsl-rozajhrsrbsruukmkbg
femininefenroplcsskslsl-rozajhrsrbsruukmkbg
neuternenroplcsskslsl-rozajhrsrbsruukmkbg
4Numbersingularsenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
pluralpenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
dualdcsslsl-rozajsr
paucalcsr
collectivelsl-rozaj
5Casenominativenenroplcsskslsl-rozajhrsrbsruukmkbgethuce
genitivegroplcsskslsl-rozajhrsrbsruukfaethuce
dativedroplcsskslsl-rozajhrsrbsruukmkbghuce
accusativeaenroplcsskslsl-rozajhrsrbsruukmkbgfahu
vocativevroskhrsrbsru
locativelplcsskslsl-rozajhrsrbsruukce
instrumentaliplcsskslsl-rozajhrsrbsruukhuce
directrro
obliqueoro
partitive1et
illativexethu
inessive2ethu
elativeeethu
allativetethuce
adessive3ethu
ablativebethu
translative4et
terminative9ethu
essivewethu
abessive5et
komitativeket
aditive7et
temporalismhu
causalischu
sublativeshu
delativehhu
sociativeqhu
factiveyhu
superessivephu
distributiveuhu
essive-formalfhu
ergativezce
lativejce
comparison8ce
6Owner_Numbersingularsenrocsskslsl-rozajsrmkhu
pluralpenrocsskslsl-rozajsrmkhu
dualdslsl-rozaj
7Owner_Gendermasculinemencsslsl-rozajsrmk
femininefencsslsl-rozajsrmk
neuterncsslsl-rozajsrmk
8Cliticnonroplcssksl-rozajsrmkbgfa
yesyroplcsskslsl-rozajsrmkbgfa
boundbslsl-rozaj
agglutinantapl
9Referent_Typepersonalpplcssksl-rozajbg
possessivesplcssksl-rozajukbg
attributiveabg
quantitativeqbg
10Syntactic_Typenominalnplcssksl-rozajsrruuk
adjectivalaplcssksl-rozajsrruuk
adverbialrplsrruuk
11Definitenessnonsl-rozajmkbg
yesysl-rozajmkbg
short-artsplbg
full-artfplbg
proximalpmk
distaldmk
12Animatenonplcssksl-rozajhrsrbsruuk
yesyplcssksl-rozajhrsrbsruuk
13Clitic_syesycs
noncs
14Pronoun_Formstrongsro
weakwro
15Owner_Personfirst1hu
second2hu
third3hu
16Owned_Numbersingularshu
pluralphu
17Wh_Typerelativeren
questionqen
18Courtesynonfa
yesyfa
19Humannonpl
yesypl
20Inclusioninclusiveice
exclusiveece
2.3.4.1. Notes
  • In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.
  • For Macedonian, the definiteness attributes can take the values: non definite (no), generally definite (yes), definite at short visible distance (proximal), and definite at longer visible distance (distal).

2.3.5. Determiner

Common specifications for Determiner
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYDeterminerDenrofa
1Typedemonstrativedenrofa
indefiniteienrofa
possessivesenro
interrogativeqfa
relativer
exclamativeefa
articleafa
generalgen
int-relwro
negativezro
emphatichro
exceptionalxfa
2Personfirst1enro
second2enro
third3enro
3Gendermasculinemro
femininefro
neuternro
4Numbersingularsenrofa
pluralpenrofa
5Casedirectrro
obliqueoro
6Owner_Numbersingularsenro
pluralpenro
7Owner_Gendermasculinemen
femininefen
neuternen
8Cliticnonro
yesyro
9Modific_Typeprenominero
postnominoro
10Wh_Typerelativeren
questionqen

Notes: In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.

2.3.6. Article

Common specifications for Article
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYArticleTrosl-rozajhu
1Typedefinitefrosl-rozajhu
indefiniteirosl-rozajhu
possessivesro
demonstrativedro
2Gendermasculinemrosl-rozaj
femininefrosl-rozaj
neuternrosl-rozaj
3Numbersingularsrosl-rozaj
pluralprosl-rozaj
dualdsl-rozaj
collectivelsl-rozaj
4Casenominativensl-rozaj
genitivegsl-rozaj
dativedsl-rozaj
accusativeasl-rozaj
locativelsl-rozaj
instrumentalisl-rozaj
directrro
obliqueoro
5Cliticnonro
yesyro
6Animatenonsl-rozaj
yesysl-rozaj

Notes: In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.

2.3.7. Adverb

Common specifications for Adverb
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYAdverbRenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
1Typegeneralgrocsslsl-rozajhrsrbsmkbghu
particleprohu
causalohu
negativezrosr
adjectivalasrmkbg
verbalvsrmkhu
modifiermenrohu
specifiersen
int-relwro
portmanteaucro
interrogativeqsrhu
participlerslhrbs
2Degreepositivepenroplcsskslsl-rozajhrsrbsruukmkfahu
comparativecenroplcsskslsl-rozajhrsrbsruukmkfahu
superlativesenroplcsskslsl-rozajhrsrbsruukmkhu
elativeesl-rozajsrmk
3Cliticnonroplhu
yesyroplhu
agglutinantapl
burkinostkaupl
4Numbersingularshu
pluralphu
5Personfirst1hu
second2hu
third3hu
6Wh_Typerelativeren
questionqen
7Casegenitivegfa

2.3.8. Adposition

Common specifications for Adposition
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYAdpositionSenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
1Typeprepositionpenroplcssksl-rozajsrruukmkbgfaet
postpositiontenfaethuce
2Formationsimplesrocssksl-rozajsrruukmkfa
compoundcrocssksl-rozajsrruukmkfa
3Casenominativenslsl-rozaj
genitivegroplcsskslsl-rozajhrsrbsruuk
dativedroplcsskslsl-rozajhrsrbsruuk
accusativearoplcsskslsl-rozajhrsrbsruuk
locativelplcsskslsl-rozajhrsrbsruuk
instrumentaliplcsskslsl-rozajhrsrbsruuk
4Cliticnonro
yesyro

2.3.9. Conjunction

Common specifications for Conjunction
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYConjunctionCenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
1Typecoordinatingcenrocsskslsl-rozajhrsrbsruukmkbgfaethu
subordinatingsenrocsskslsl-rozajhrsrbsruukmkbgfaethu
portmanteaurro
2Formationsimplesrosl-rozajhrsrbsruukmkbgfahu
compoundcrosl-rozajhrsrbsruukmkbgfahu
3Coord_Typesimplesro
repetitrro
correlatcro
sentencepsrruhu
wordswsrruhu
initialien
non-initialnen
4Sub_Typenegativezrosrru
positiveprosrru
5Cliticnonro
yesyro
6Numbersingularscs
pluralpcs
7Personfirst1cs
second2cs
third3cs

2.3.10. Numeral

Common specifications for Numeral
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYNumeralMenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
1Typecardinalcenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
ordinaloenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
fractalfrofahu
multiplemrocsskhrsrbsru
collectlroplsrrumk
specialscsskslhrsrbsmk
ordinal2rfa
pronominalpsl
2Gendermasculinemroplcsskslsl-rozajhrsrbsruukmkbg
femininefroplcsskslsl-rozajhrsrbsruukmkbg
neuternroplcsskslsl-rozajhrsrbsruukmkbg
3Numbersingularsroplcsskslsl-rozajhrsrbsruukmkbgethuce
pluralproplcsskslsl-rozajhrsrbsruukmkbgethuce
dualdcsslsl-rozajsr
collectivelsl-rozaj
4Casenominativenplcsskslsl-rozajhrsrbsruukethuce
genitivegplcsskslsl-rozajhrsrbsruukfaethuce
dativedplcsskslsl-rozajhrsrbsruukhuce
accusativeaplcsskslsl-rozajhrsrbsruukhu
vocativevskhrbs
locativelplcsskslsl-rozajhrsrbsruukce
instrumentaliplcsskslsl-rozajhrsrbsruukhuce
directrro
obliqueoro
partitive1et
illativexethu
inessive2ethu
elativeeethu
allativetethuce
adessive3ethu
ablativebethu
translative4et
terminative9ethu
essivewethu
abessive5et
komitativeket
aditive7et
temporalismhu
causalischu
sublativeshu
delativehhu
sociativeqhu
factiveyhu
superessivephu
distributiveuhu
essive-formalfhu
multiplicative6hu
ergativezce
lativejce
comparison8ce
5Formdigitdroplcsskslsl-rozajhrsrbsruukmkbgethuce
romanrroplcsskslsl-rozajhrsrbsruukmkbgethuce
letterlroplcsskslsl-rozajhrsrbsruukmkbgethuce
bothbro
m-formmbg
approxabg
6Definitenessnonroslmkbgfa
yesyroslmkbgfa
short-artsbg
full-artfbg
proximalpmk
distaldmk
7Cliticnonro
yesyro
8Classdefinitefplcssk
definite11cssk
definite22cs
definite343plcs
definite2344sk
demonstrativedcssk
indefiniteicssk
interrogativeqcssk
relativercs
9Animatenonplcssksl-rozajhrsrbsruuk
yesyplcssksl-rozajhrsrbsruuk
10Owner_Numbersingularshu
pluralphu
11Owner_Personfirst1hu
second2hu
third3hu
12Owned_Numbersingularshu
pluralphu
13Humannonpl
yesypl

Notes: In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.

2.3.11. Particle

Common specifications for Particle
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYParticleQroplcsskslsl-rozajhrsrbsruukmkbgce
1Typenegativezrohrsrbsbg
infinitivenro
subjunctivesro
aspectaro
futurefro
generalgbg
comparativecbg
verbalvbg
interrogativeqhrsrbsbg
modalohrsrbsbg
affirmativerhrsrbs
2Formationsimplesrumkbg
compoundcrumkbg
3Cliticnonropl
yesyropl
agglutinantapl
demandingdpl

2.3.12. Interjection

Common specifications for Interjection
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYInterjectionIenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
1Typemoodmhu
otherohu
2Formationsimplessrrubg
compoundcsrrubg

2.3.13. Abbreviation

Common specifications for Abbreviation
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYAbbreviationYenroplcsskslsl-rozajhrsrbsruukmkbgfaethu
1Syntactic_Typenominalnrosrruet
verbalvroet
adjectivalaroet
adverbialrrosrruet
pronominalpro
2Gendermasculinemrosrru
femininefrosrru
neuternrosrru
3Numbersingularsrosrruet
pluralprosrruet
paucalcsrru
4Casenominativensrruet
genitivegsrruet
dativedsrru
accusativeasrru
locativelsrru
instrumentalisrru
directrro
obliqueoro
vocativevro
partitive1et
illativexet
inessive2et
elativeeet
allativetet
adessive3et
ablativebet
translative4et
terminative9et
essivewet
abessive5et
komitativeket
aditive7et
5Definitenessyesyro
nonro
2.3.13.1. Notes
  • In the Romanian case system the value 'direct' conflates 'nominative' and 'accusative', while the value 'oblique' conflates 'genitive' and 'dative'.

2.3.14. Residual

Common specifications for Residual
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYResidualXenroplcsskslsl-rozajhrsrbsruukmkbgfaethuce
1Typeforeignfslhrbsce
typotsl
programpsl
webwslhrbs
emoeslhrbs
hashtaghslhrbs
ataslhrbs
2.3.14.1. Notes
  • For Slovene the Type attribute has been introduced on Residual, which distinguishes the values of "foreign", to mark a words in a strech of foreign language text, "typo", a mis-typed word, and "program", where the tokenisation program made a mistake. The second, and esp. the third value are useful for hand-annotation of corpora.

2.3.15. Punctuation

Common specifications for Punctuation
PAttributeValueCodeEnglishRomanianPolishCzechSlovakSloveneResianCroatianSerbianBosnianRussianUkrainianMacedonianBulgarianPersianEstonianHungarianChechen
0CATEGORYPunctuationZslhrbsce
1Typepunctuationpce
sentence_endsce
2.3.15.1. Notes
  • Due to popular demand (i.e. Vladimir Benko:), punctuation has been introduced into the specifications in Version 5. Even though MULTEXT(-East) specifications and the MSDs are meant to describe the morphosyntax of words rather than act directly as corpus tags, they are nevetheless often used for exactly this purpose. This means, that punctuation is tagged in each corpus differently, and these specifications have so far given no explanatin how. Therefore, this Z category is provisionaly introduced; it is currently used only by a few langauges.
  • The sentence end is meant for TreeTagger and RFTagger by Helmut Schmid. However, this type might be reconsidered, as end of sentence is better expressed with XML elements, e.g. <s>
Up: 2. Common MULTEXT Specifications Previous: 2.2. Categories Next: 2.4. Attribute Index
Date: 2016-06-20
This work is licensed under the Creative Commons Attribution-ShareAlike 4.0 International.