MULTEXT-East

TEI header

§file description
§title statement
§title MULTEXT-East Morphosyntactic Specifications
§statement of responsibility
§name, proper noun Tomaž Erjavec
§responsibility Editor
§statement of responsibility
§name, proper noun Dalian Zogaj
§name, proper noun Philipp Wasserscheidt
§responsibility Albanian specifications
§statement of responsibility
§name, proper noun Radoslav Pavlov
§name, proper noun Ludmila Dimitrova
§name, proper noun Lydia Sinapova
§name, proper noun Kiril Simov
§responsibility Bulgarian specifications
§statement of responsibility
§name, proper noun Dietmar Fiesel
§responsibility Chechen specifications
§statement of responsibility
§name, proper noun Vladimír Petkevič
§responsibility Czech specifications
§statement of responsibility
§name, proper noun Ivan Šimko
§responsibility Damaskini specifications
§statement of responsibility
§name, proper noun Nancy Ide
§name, proper noun Greg Priest-Dorman
§name, proper noun Tomaž Erjavec
§name, proper noun Tamas Varadi
§responsibility English specifications
§statement of responsibility
§name, proper noun Heiki-Jaan Kaalep
§responsibility Estonian specifications
§statement of responsibility
§name, proper noun Irina Lobzhanidze
§responsibility Georgian specifications
§statement of responsibility
§name, proper noun Laszlo Tihanyi
§name, proper noun Tamas Varadi
§responsibility Hungarian specifications
§statement of responsibility
§name, proper noun Katerina Zdravkova
§responsibility Macedonian specifications
§statement of responsibility
§name, proper noun Behrang QasemiZadeh
§responsibility Persian specifications
§statement of responsibility
§name, proper noun Natalia Kotsyba
§name, proper noun Ivan Derzhanski
§name, proper noun Adam Radziszewski
§responsibility Polish specifications
§statement of responsibility
§name, proper noun Han Steenwijk
§responsibility Resian specifications
§statement of responsibility
§name, proper noun Dan Tufiş
§name, proper noun Anna Maria Barbu
§responsibility Romanian specifications
§statement of responsibility
§name, proper noun Serge Sharoff
§name, proper noun Mikhail Kopotev
§name, proper noun Tomaž Erjavec
§name, proper noun Anna Feldman
§name, proper noun Dagmar Divjak
§responsibility Russian specifications
§statement of responsibility
§name, proper noun Tomaž Erjavec
§responsibility Serbo-Croatian specifications
§statement of responsibility
§name, proper noun Teodora Vuković
§responsibility Torlak specifications
§statement of responsibility
§name, proper noun Radovan Garabík
§responsibility Slovak specifications
§statement of responsibility
§name, proper noun Tomaž Erjavec
§name, proper noun Simon Krek
§name, proper noun Peter Holozan
§name, proper noun Vojko Gorjanc
§name, proper noun Marko Stabej
§responsibility Slovene specifications
§statement of responsibility
§name, proper noun Natalia Kotsyba
§name, proper noun Igor Shevchenko
§name, proper noun Ivan Derzhanski
§responsibility Ukrainian specifications
§funding body EU FP5 Copernicus Project COP106 "MULTEXT-East"
§funding body EU FP5 Copernicus Concerted Action "TELRI"
§funding body NFS Grant "TEI SGML to XML migration"
§funding body EU FP5 Copernicus Project PL96-1142 "Concede"
§funding body Slovene-Serbian and Slovene-Macedonian bi-lateral projects
§funding body EU FP7 Project "MONDILEX"
§funding body EU Research Infrastructure CLARIN
§funding body Individual partners' grants and contracts.
§edition statement

Version 6 "CLARIN"

§publication statement
§distributor MULTEXT-East Web site
§address http://nl.ijs.si/ME/V6/mte-msd
https://github.com/clarinsi/mte-msd
§availability

This work is licenced under the Attribution-ShareAlike 4.0 International.

§date 2021-08-20
§source description
§fully-structured bibliographic citation
§title statement
title MULTEXT-East Morphosyntactic Specifications, Versions 4 & 5
§publication statement
distributor MULTEXT-East Web site
address http://nl.ijs.si/ME/
date
when = 2004-05-13
May 13th, 2004
date
when = 2010-05-02
May 2nd, 2010
date
when = 2016-06-20
June 20th, 2016
§encoding description
§project description

The MULTEXT-East resources are a multilingual dataset for language engineering research and development. This dataset contains, for Bosnian, Bulgarian, Chechen, Czech, Damaskini, English, Estonian, Georgian, Hungarian, Macedonian, Persian, Polish, Resian, Romanian, Russian, Serbo-Croatian, Slovak, Slovene, Torlak, and Ukrainian, some, or all of the following language resources: the MULTEXT-East morphosyntactic specifications, lexica, and annotated "1984" corpus; the MULTEXT-East parallel and comparable text and speech corpora; and associated documentation.

§text-profile description
§language usage
§language
id = en ident = en
English
§language
id = ro ident = ro
Romanian
§language
id = sq ident = sq
Albanian
§language
id = ru ident = ru
Russian
§language
id = uk ident = uk
Ukrainian
§language
id = pl ident = pl
Polish
§language
id = cs ident = cs
Czech
§language
id = sk ident = sk
Slovak
§language
id = sl ident = sl
Slovenian
§language
id = sl-rozaj ident = sl-rozaj
Resian
§language
id = hbs ident = sh
Serbo-Croatian
§language
id = sr-tor ident = sr-tor
Torlak
§language
id = mk ident = mk
Macedonian
§language
id = bg ident = bg
Bulgarian
§language
id = bg-dam ident = bg-dam
Damaskini
§language
id = fa ident = fa
Persian
§language
id = ka ident = ka
Georgian
§language
id = et ident = et
Estonian
§language
id = hu ident = hu
Hungarian
§language
id = ce ident = ce
Chechen
§revision description
§change
when = 2021-08-20
ET, updated Georgian MSDs.
§change
when = 2021-07-29
ET, add Georgian MSDs, update handle for Damaskini.
§change
when = 2021-03-17
ET, add draft Georgian.
§change
when = 2020-11-08
ET, remove Croatian & Serbian, V6 goes live.
§change
when = 2020-11-04
ET, add Damaskini.
§change
when = 2020-08-31
ET, add Torlak.
§change
when = 2020-08-27
ET, incorporate changes to Macedonian.
§change
when = 2019-11-24
ET, add Albanian (iso code = sq).
§change
when = 2019-02-08
ET, revert Croatian to V4, change Bosnian to Serbo-Croatian.
§change
when = 2018-11-11
ET, add punctuation and change format of lang.spec. sections.
§change
when = 2018-11-06
ET, start working on V6.
§change
when = 2016-06-20
ET, details.
§change
when = 2016-06-15
ET, new version of Slovene specifications, added Bosnian.
§change
when = 2016-06-13
ET, added new X tags, new version of Croatian specifications.
§change
when = 2014-10-07
ET, new version of Croatian specifications.
§change
when = 2014-09-01
ET, new category "Z" for punctuation.
§change
when = 2013-12-06
ET, Chechen.
§change
when = 2010-05-02
ET, some more corrections to Serbian, Slovak, Polish specifications.
§change
when = 2010-03-17
ET, corrections to Slovak, Slovene, Polish, Ukrainian specifications.
§change
when = 2009-10-06
ET, updated English, Czech, Romanian, Serbian, Macedonian, Bulgarian, Estonian, Hungarian, with WFL-generated MSD index.
§change
when = 2009-09-30
ET, added Slovak, merged with common tables.
§change
when = 2009-09-19
ET, new version of Polish and Ukrainian, merged with common tables.
§change
when = 2009-08-27
ET, added Polish specifications - not yet merged with common tables
§change
when = 2009-05-13
ET, updated and merged Hungarian, Farsi; added first version of Ukraninian; tried to merge Slovene, forced to do JOS 1.1 (eventualy)
§change
when = 2009-04-07
ET, updated Resian.
§change
when = 2009-02-09
ET, minor changes.
§change
when = 2007-07-09
ET, various changes, switch to TEI P5.
§change
when = 2006-11-12
ET, initial conversion from LaTeX to TEI P4.
: 2021-08-20
This work is licenced under the Creative Commons Attribution 4.0 licence.