MULTEXT-East Morphosyntactic Specifications Version 4 http://nl.ijs.si/ME/V4/msd/ This directory contains the "tabular" files automatically derived from the XML source. They map MSD tags into various other formats, e.g. to attribute-value pairs. The directory contains the following files, where LL is the particular language: 00README.txt This file msd-fslib-LL.xml File with the language-particular specifications and MSD list encoded as a TEI P5 feature(-structure) library. This library is also included with the annotated parallel 1984 corpus. msd-canon-LL.tbl File with canonical expansions and translation of MSDs. The first column contains the complete list of MSDs in the language-particular (English) encoding. The second column contains the expansion of the MSDs into attribute-value pairs, for all attributes defined in the language-particular specifications, regardless of the category (PoS). The third column contains the MSDs in the MULTEXT-East common encoding. The fourth gives the common MSD expansions into AV pairs for all the MULTEXT-East defined attributes. Note that for many languages the language-particular and common MSDs are identical. msd-human-LL.tbl File with expansions of MSDs mostly useful for humans. Each MSD is given its collating sequence (for sorting MSDs in their specifications-defined order) in column one. The second column contains the language-particular MSD. The third column gives the expansion of the MSD as a list of values; this is the the shortest human readable form of the MSDs. The specifications for a few language (sl, uk, sk) contain also localisation information. For these, the fourth column contains the localised MSD (where MSDs have not been localised, the English ones are given). The fifth column contains the localised short expansion, and the sixth the localised attribute-value expansion. The .tbl files are encoded in UTF-8 with TAB (^I) as record separator and SPACE as the secondary separator. File uses Unix-type end-of-lines (^J). The files were automatically produced by using the MULTEXT-East XSLT scripts from the source TEI XML MULTEXT-East morphosyntactic specifications, http://nl.ijs.si/ME/V4/msd/ ==================================================================================== Tomaz Erjavec, JSI 2010-05-09