This document is a HTML 3.2 rendering of a
Corpus Encoding Specification
DTD document, produced in the scope of the
MULTEXT-East
project, by
Fred.
Note that this HTML translation does not contain all the information from the cesHeader.
CES header
Creator: VP
Created: 1996-05-10
Updated: 1997-09-25
File Description
- Title Statement
- Title:
- Multext-East CES1: Newspapers, Czech
- Responsibility
-
Vladimír Petkevič
(
Checked and modified markup up for correctness down
to the subparagraph level
)
- Edition:
- MTE Final Release
- Extent:
- 90683 words
1210768 bytes
- Publication Statement
- Distributor:
-
Institute of Theoretical and Computational Linguistics,
Faculty of Philosophy, Charles University, Czech Republic
UTKL FFUK
- Address:
-
Celetná 13,
Prague, Czech Republic
- Electronic address:
-
Vladimir.Petkevic@ff.cuni.cz
- Electronic address:
-
ftp: ucnk.ff.cuni.cz directory: pub/corpora/ME
- Availiability:
-
Available for research purposes upon receipt of signed agreement
- Publication date:
- October 1, 1997
- Source Description
- Full Bibliography
- Title Statement
- Title:
-
Lidové noviny - collection of articles, 1991-1994;
Obtained in electronic form (WordPerfect format)
the Institute of Theoretical and Computational Linguistics,
Faculty of Philosophy, Charles University, Czech Republic
UTKL FFUK
- Responsibility
-
publisher: Lidové noviny, Praha
(
typed in in electronic form (WordPerfect format)
)
- Publication Statement
- Distributor:
-
publisher: Lidové noviny, Praha
distributor of the paper version of newspaper
articles
The electronic texts were made available for the
the Institute of Theoretical and Computational Linguistics,
Faculty of Philosophy, Charles University, Czech Republic
UTKL FFUK for research purposes
- Address:
-
Prague, Czech republic
- Availiability:
-
Electronic form available for non-profit purposes
It was made available for:
Institute of Theoretical and Computational Linguistics,
Faculty of Philosophy, Charles University, Czech Republic
UTKL FFUK
- Publication date:
-
1991-1994
- Source Description
- Structured Bibliography
- Monography
- Title:
-
Lidové noviny - collection of 451 articles from the
1991-1994 period
- Author:
-
various newspapermen
- Imprint
- Publication date:
-
1991-1994
- Publisher:
-
Lidové noviny
- Place:
-
Praha
Encoding Description
- Project Description:
-
MULTEXT-East:
Multilingual Text Tools and Corpora for Central and Eastern
European Languages.
EU Copernicus Project COP106
- Tag declaration:
- abbr = 1241
- body = 1
- byline = 76
- date = 547
- dateline = 189
- div = 450
- foreign = 9
- head = 537
- hi = 59
- name = 802
- num = 1222
- opener = 189
- p = 1360
- q = 588
- text = 1
Revision Description
- Date: 1996-05-10
Vladimír Petkevič, UTKL FFUK, Praha
-
1) Corrected some spelling errors
2) Marked-up to CES1 compliance:
3) created header
4) inserted DIV (one level)
5) inserted subparagraph tags, such as P, Q, NAME,
ABBR, FOREIGN, BYLINE tags etc.
- Date: 1996-10-22
Vladimír Petkevič, UTKL FFUK, Praha
-
Corrected the header so as to meet the
requirements imposed by creating the corpus
containing all corpus components as one SGML
document
- Date: 1997-03-17
Tomaz Erjavec, IJS
- Normalisation: DIV/COMPLETE=Y deleted; is default
- Date: 1997-03-20
Tomaz Erjavec, IJS
- Normalisation of corpus component CESHEADER elements:
CESHEADER, EDITIONSTMT, TITLESTMT/H.TITLE
- ISO LANGUAGEs implemented as marked section PUBLIC ent
- Language (WSDs) implemented as PUBLIC entities
- Date: 1997-09-25
Tomaž Erjavec
- Changed editionStmt, Extent, pubDate, Availability
to final form