This document is a HTML 3.2 rendering of a
Corpus Encoding Specification
DTD document, produced in the scope of the
MULTEXT-East
project, by
Fred.
Note that this HTML translation does not contain all the information from the cesHeader.
CES header
Creator: CO
Created: 1996-04-20
Updated: 1997-09-25
File Description
- Title Statement
- Title:
- Multext-East CES1: Newspapers, Hungarian
- Responsibility
- Csaba Oravecz
(CES1 conformant tagging)
- Edition:
- MTE Final Release
- Extent:
- 92233 words
1264874 bytes
- Publication Statement
- Distributor:
-
Research Institute for Linguistics, Hungarian Academy of Sciences
- Address:
-
Budapest, Színház u. 5-9.
- Electronic address:
-
- Electronic address:
-
- Availiability:
-
Available for research purposes upon receipt of signed agreement
- Publication date:
- October 1, 1997
- Source Description
- Full Bibliography
- Title Statement
- Title:
-
Magyar Hírlap: Pre-edited ASCII text version
- Publication Statement
- Distributor:
-
Magyar Hírlap Publishing House Ltd.
- Address:
-
Budapest, Kerepesi út 29/b.
- Availiability:
-
Available for research purposes upon agreement
- Publication date:
- Unknown
- Source Description
- Structured Bibliography
- Analytic
- Title:
- Magyar Hírlap, 25/01/1996 issue
- Title:
- Magyar Hírlap, 31/01/1996 issue
- Title:
- Magyar Hírlap, 22/01/1996 issue
- Monography
- Title:
- MULTEXT-East Hungarian Newspaper Corpus
- Imprint
Encoding Description
- Project Description:
-
MULTEXT-East:
Multilingual Text Tools and Corpora for Central and
Eastern European Languages.
EU Copernicus Project COP106
- Tag declaration:
- abbr = 289
- body = 1
- byline = 341
- caption = 2
- div = 316
- docAuthor = 179
- foreign = 2
- head = 545
- hi = 7
- mentioned = 131
- name = 1110
- note = 19
- p = 1293
- q = 218
- ref = 15
- sp = 145
- text = 1
- title = 11
Revision Description
- Date: 10/20/1996
Csaba Oravecz
- corrected a number of typos
- Date: 10/25/1996
Csaba Oravecz
- modofied header according to ME template
- Date: 1997-03-20
Tomaz Erjavec, IJS
- Normalisation of corpus component CESHEADER elements:
CESHEADER, EDITIONSTMT, TITLESTMT/H.TITLE
- ISO LANGUAGEs implemented as marked section PUBLIC ent
- Language (WSDs) implemented as PUBLIC entities
- Date: 1997-09-25
Tomaž Erjavec
- Changed editionStmt, Extent, pubDate, Availability
to final form