This document is a HTML 3.2 rendering of a
Corpus Encoding Specification
DTD document, produced in the scope of the
MULTEXT-East
project, by
Fred.
Note that this HTML translation does not contain all the information from the cesHeader.
CES header
Creator: HJK
Created: 1995-10-18
Updated: 1997-09-25
File Description
- Title Statement
- Title:
- Multext-East CES1: Newspapers, Estonian
- Responsibility
- Urve Talvik
(entered the text)
Riina Mosna
(entered the text)
Heiki-Jaan Kaalep
(supervised the work)
Heiki-Jaan Kaalep
(modified the header for version 4)
- Edition:
- MTE Final Release
- Extent:
- 112003 words
1506247 bytes
Note:
WordCount represents the number of words in this
text exclusive of tags and header information.
ByteCount reflects the approximate size of the
file containing the doctype and cesDoc element
including all text, tags and header information.
- Publication Statement
- Distributor:
-
TÜ arvutuslingvistika uurimisgrupp
- Address:
-
Tiigi 78-232, Tartu, Estonia
- Electronic address:
-
hkaalep@psych.ut.ee
- Availiability:
- Freely available
- Publication date:
- October 1, 1997
- Source Description
- Structured Bibliography
- Analytic
- Title:
- Õhtuleht 25/04/1985
- Title:
- Noorte Hääl 02/11/1985
- Title:
- Noorte Hääl 26/12/1985
- Title:
- Õhtuleht 26/12/1985
- Title:
- Rahva Hääl 21/03/1985
- Title:
- Rahva Hääl 15/05/1985
- Title:
- Rahva Hääl 19/05/1985
- Title:
- Noorte Hääl 28/05/1985
- Title:
- Noorte Hääl 29/05/1985
- Title:
- Punane Täht 11/06/1985
- Title:
- Sirp ja Vasar 20/09/1985
- Monography
- Title:
-
- Imprint
Encoding Description
- Project Description:
-
MULTEXT-East:
Multilingual Text Tools and Corpora for Central and
Eastern European Languages.
EU Copernicus Project COP106
- Tag declaration:
- abbr = 1864
- author = 168
- bibl = 168
- body = 1
- byline = 333
- corr = 1
- date = 11
- distinct = 131
- div = 388
- docauthor = 333
- foreign = 4
- head = 356
- hi = 1008
- item = 244
- list = 39
- name = 7629
- note = 2
- num = 344
- p = 2423
- q = 385
- quote = 89
- ref = 3
- s = 7758
- text = 1
- title = 333
Revision Description
- Date:
10/31/96
Heiki-Jaan Kaalep, UT
-
Changed the header to conform to the new CES version
- Date: 1997-03-20
Tomaz Erjavec, IJS
- Normalisation of corpus component CESHEADER elements:
CESHEADER, EDITIONSTMT, TITLESTMT/H.TITLE
- ISO LANGUAGEs implemented as marked section PUBLIC ent
- Language (WSDs) implemented as PUBLIC entities
- Date: 1997-09-25
Tomaž Erjavec
- Changed editionStmt, byteCount, pubDate
to final form