<cesHeader
version="4.1"
type="text"
lang=en
creator=LD
status="update"
date.created="1997-11-30"
date.updated="1997-12-21"
>
<filedesc>
<titlestmt>
<h.title>Multext-East cesAna: Nineteen Eighty-Four, Bulgarian</h.title>
<respstmt>
<respname>Ludmila Dimitrova, Lydia Sinapova</respname>
<resptype>Overall Responsibility</resptype>
<respname>Ludmila Dimitrova, Kiril Simov</respname>
<resptype>Hand-tagging of first chapter first part</resptype>
<respname>Ludmila Dimitrova</respname>
<resptype>Hand-tagging of second chapter first part, first
chapter second part</resptype>
<respname>Vladimír Petkevič</respname>
<resptype>Conversion to cesAna DTD </resptype>
</respstmt>
</titlestmt>
<editionstmt version="1.0">MTE Final Release</editionstmt>
<extent>
<wordCount>86020</wordCount>
<byteCount units="MB">29.9</byteCount>
<extnote>wordCount represents the number of TOK TYPE=WORD
elements in the text.
</extnote>
</extent>
<publicationstmt>
<distributor>
Institute of Mathematics and Informatics,
Bulgarian Academy of Sciences, Sofia
</distributor>
<pubaddress>
Acad G. Bonchev st. bl.8
1113 Sofia, Bulgaria
</pubaddress>
<eaddress type="email">ludmila@ling.math.acad.bg</eaddress>
<availability status="restricted">
Available for research purposes upon receipt of signed
agreement
</availability>
<pubDate value="1998-01-01">January 1st, 1998</pubDate>
</publicationstmt>
<sourcedesc>
<biblfull>
<titlestmt>
<h.title>Multext-East CES1: Nineteen Eighty-Four, Bulgarian
</h.title>
</titlestmt>
<publicationstmt>
<distributor>
Institute of Mathematics and Informatics,
Bulgarian Academy of Sciences, Sofia
</distributor>
<pubaddress>
Acad G. Bonchev st. bl.8
1113 Sofia, Bulgaria
</pubaddress>
<eaddress type="email">ludmila@ling.math.acad.bg</eaddress>
<availability status="restricted">
Available for research purposes upon receipt of signed
agreement
</availability>
<pubDate value="1997-10-01">October 1, 1997</pubDate>
</publicationstmt>
<sourcedesc>
<biblfull>
<titlestmt>
<h.title> Electronic form of 1984 by George Orwell in
Bulgarian
</h.title>
<respstmt>
<respname>
Ludmila Dimitrova (BAS), Lydia Sinapova (BAS),
Kiril Simov(BAS)
</respname>
<resptype>
Typing-in 1984.
</resptype>
</respstmt>
</titlestmt>
<publicationstmt>
<distributor>
Institute of Mathematics and Informatics,
Bulgarian Academy of Sciences, Sofia
</distributor>
<pubaddress>
Acad G. Bonchev st. bl.8
1113 Sofia, Bulgaria
</pubaddress>
<availability status=restricted>
Available for research purposes upon receipt of signed
agreement
</availability>
<pubdate>1997</pubdate>
</publicationstmt>
<sourcedesc>
<biblstruct>
<monogr>
<h.title>1984)</h.title>
<h.author>George Orwell</h.author>
<h.author>Translator: Lydia Bozhilova</h.author>
<imprint>
<pubdate>1989</pubdate>
<publisher>Profizdat</publisher>
<pubplace>Sofia, Bulgaria</pubplace>
</imprint>
</monogr>
</biblstruct>
</sourcedesc>
</biblfull>
</sourcedesc>
</biblfull>
</sourcedesc>
</filedesc>
<encodingdesc>
<projectdesc>
MULTEXT-East:
Multilingual Text Tools and Corpora for Central and Eastern
European Languages.
EU Copernicus Project COP106
</projectdesc>
<editorialdecl>
<transduction>
In the cesDoc to cesAna conversion, DIV, QUOTE, Q tags and
HEAD, POEM, LIST elements have been omitted. cesDoc P
elements are encoded as PAR, and S as S.
cesDoc sub-S level tags are omitted: DATE, NAME, ABBR, etc.
</transduction>
<quotation>
Q and QUOTE tags from the cesDoc source not retained.
</quotation>
<segmentation>
S segmentation same as in cesDoc source (hand-validated).
TOK segmentation performed with mtseg and manually corrected,
</segmentation>
</editorialdecl>
<tagsdecl>
<tagusage gi=chunklist occurs=1>
Element corresponds to TEXT of the cesDoc source
</tagusage>
<tagusage gi=chunk occurs=1>
Element corresponds to BODY of the cesDoc source
</tagusage>
<tagusage gi=par occurs=1322>
Elements correspond to P elements of the cesDoc source.
The FROM attribute gives the reference to the ID of the
corresponding cesDoc P element.
</tagusage>
<tagusage gi=s occurs=6682>
Elements correspond to S elements of the cesDoc source
The FROM attribute gives the reference to the ID of the
corresponding cesDoc S element.
</tagusage>
<tagusage gi=tok occurs=101173>
Tokens are of TYPE=WORD or PUNCT, with the CLASS attribute
giving the mtseg class of the token.
</tagusage>
<tagusage gi=orth occurs=101173>
Contains the orthography of the token, as found in the
cesDoc source.
</tagusage>
<tagusage gi=disamb occurs=86020>
Contains disambiguated lexical information.
</tagusage>
<tagusage gi=lex occurs=156002>
Contains undisambiguated lexical information.
</tagusage>
<tagusage gi=base occurs=242022>
Base or lemma of a token.
</tagusage>
<tagusage gi=msd occurs=156002>
Morphosyntactic description of a token.
</tagusage>
<tagusage gi=ctag occurs=257175>
Corpus tag.
</tagusage>
</tagsdecl>
</encodingdesc>
<profiledesc>
<creation date="1997-11-27">
</creation>
<langusage>
<![ %ONECOMPONENT [ &ISOlang; ]]>
<language id=ns-bg iso639=bg>Newspeak Bulgarian</language>
</langusage>
</profiledesc>
<revisiondesc>
<change>
<changedate>1997-12-19</changedate>
<respname>Vladimír Petkevič, ÚTKL FFUK,
Prague</respname>
<h.item>Filled in tags' usage, wordcount and bytecount</h.item>
</change>
<change>
<changedate>1997-12-21</changedate>
<respname>Tomaz Erjavec, IJS</respname>
<h.item>Converted from ISO Cyrillic to SGML entities</h.item>
<h.item>Changed ... to …</h.item>
<h.item>Modified EDITIONSTMT, BYTECOUNT</h.item>
</change>
</revisiondesc>
</cesheader>