<cesHeader
version="4.1"
type="text"
lang=en
creator=HJK
status="update"
date.created="1997-11-28"
date.updated="1997-12-21"
>
<filedesc>
<titlestmt>
<h.title>Multext-East cesAna: Nineteen Eighty-Four, Estonian</h.title>
<respstmt>
<respname>Heiki-Jaan Kaalep</respname>
<resptype>Overall Responsibility</resptype>
<respname>Kadri Muischnek</respname>
<resptype>Hand-tagging of part 1, chapter 1-4;
part 2 chapter 9</resptype>
<respname>Andriela Rääbis</respname>
<resptype>Hand-tagging of part 1, chapter 5-7;
part 3 chapter 1, 3, 4</resptype>
<respname>Heili Orav</respname>
<resptype>Hand-tagging of part 1, chapter 8;
part 3 chapter 2, 5, 6</resptype>
<respname>Helen Potter</respname>
<resptype>Hand-tagging of part 2, chapter 1-7</resptype>
<respname>Külli Habicht</respname>
<resptype>Hand-tagging of part 2, chapter 8</resptype>
<respname>Vladimír Petkevič</respname>
<resptype>Conversion to cesAna DTD </resptype>
</respstmt>
</titlestmt>
<editionstmt version="1.0">MTE Final Release</editionstmt>
<extent>
<wordCount>75433</wordCount>
<byteCount units="MB">18.7 MB</byteCount>
<extnote>wordCount represents he number of TOK TYPE=WORD
elements in the text. byteCount is in megaBytes</extnote>
</extent>
<publicationstmt>
<distributor>
TÜ arvutuslingvistika uurimisgrupp
</distributor>
<pubaddress>Tiigi 78-232, Tartu, Estonia</pubaddress>
<eaddress type="email">hkaalep@psych.ut.ee</eaddress>
<eaddress type="www">http://www.cl.ut.ee</eaddress>
<availability status="free">
Freely available
</availability>
<pubDate value="1998-01-01">January 1st, 1998</pubDate>
</publicationstmt>
<sourcedesc>
<biblfull>
<titlestmt>
<h.title>Multext-East CES1: Nineteen Eighty-Four, Estonian</h.title>
</titlestmt>
<publicationstmt>
<distributor>
TÜ arvutuslingvistika uurimisgrupp
</distributor>
<pubaddress>Tiigi 78-232, Tartu, Estonia</pubaddress>
<eaddress type="email">hkaalep@psych.ut.ee</eaddress>
<eaddress type="www">http://www.cl.ut.ee</eaddress>
<availability status="free">
Freely available
</availability>
<pubDate value="1997-10-01">October 1, 1997</pubDate>
</publicationstmt>
<sourcedesc>
<biblstruct>
<monogr>
<h.title>1984</h.title>
<h.author>George Orwell</h.author>
<h.author>Translator: Elias Treeman</h.author>
<imprint>
<pubdate>1990</pubdate>
<publisher>Loomingu Raamatukogu nr. 48-51</publisher>
<publisher>Perioodika</publisher>
<pubplace>Tallinn</pubplace>
</imprint>
</monogr>
</biblstruct>
</sourcedesc>
</biblfull>
</sourcedesc>
</filedesc>
<encodingdesc>
<projectdesc>
MULTEXT-East:
Multilingual Text Tools and Corpora for Central and Eastern
European Languages.
EU Copernicus Project COP106
</projectdesc>
<editorialdecl>
<transduction>
In the cesDoc to cesAna conversion, DIV, QUOTE tags and
HEAD, POEM, LIST elements have been omitted. cesDoc P
elements are encoded as PAR, and S as S.
Q tags have been encoded as punctuation symbols.
cesDoc sub-S level tags are omitted: DATE, NAME, ABBR, etc.
</transduction>
<quotation>
QUOTE tags from the cesDoc source not retained.
</quotation>
<segmentation>
S segmentation same as in cesDoc source (hand-validated).
TOK segmentation performed with mtseg and manually corrected,
</segmentation>
</editorialdecl>
<tagsdecl>
<tagusage gi=chunkList occurs=1>
Element corresponds to TEXT of the cesDoc source
</tagusage>
<tagusage gi=chunk occurs=1>
Element corresponds to BODY of the cesDoc source
</tagusage>
<tagusage gi=par occurs=1266>
Elements correspond to P elements of the cesDoc source.
The FROM attribute gives the reference to the ID of the
corresponding cesDoc P element.
</tagusage>
<tagusage gi=s occurs=6478>
Elements correspond to S elements of the cesDoc source
The FROM attribute gives the reference to the ID of the
corresponding cesDoc S element.
</tagusage>
<tagusage gi=tok occurs=94906>
Tokens are of TYPE=WORD or PUNCT, with the CLASS attribute
giving the mtseg class of the token.
</tagusage>
<tagusage gi=orth occurs=94906>
Contains the orthography of the token, as found in the
cesDoc source.
</tagusage>
<tagusage gi=disamb occurs=75433>
Contains disambiguated lexical information.
</tagusage>
<tagusage gi=lex occurs=147542>
Contains undisambiguated lexical information.
</tagusage>
<tagusage gi=base occurs=222975>
Base or lemmma of a token.
</tagusage>
<tagusage gi=msd occurs=222975>
Morphosyntactic description of a token.
</tagusage>
<tagusage gi=ctag occurs=94906>
Corpus tag.
</tagusage>
</tagsdecl>
</encodingdesc>
<profiledesc>
<creation date="1997-11-28">
</creation>
<langusage>
<![ %ONECOMPONENT [ &ISOlang; ]]>
</langusage>
</profiledesc>
<revisiondesc>
<change>
<changedate>1997-11-28</changedate>
<respname>Heiki-Jaan Kaalep</respname>
<h.item>Initial header</h.item>
</change>
<change>
<changedate>1997-12-21</changedate>
<respname>Tomaz Erjavec, IJS</respname>
<h.item>Modified EDITIONSTMT and changed ... to …</h.item>
</change>
</revisiondesc>
</cesheader>