TEI Header

§file description
§title statement
§title
id = mten-hu.title
Multext-East cesDoc corpus: Newspapers, Hungarian
§statement of responsibility
§name Csaba Oravecz
§responsibility CES1 conformant tagging
§statement of responsibility
§name Tomaž Erjavec
§responsibility Conversion to XML/TEI P5
§edition statement
§edition MULTEXT-East, Version 4
§extent
§measure
type = words
92233
§publication statement
§address http://nl.ijs.si/ME/V4/
§distributor Research Institute for Linguistics, Hungarian Academy of Sciences
§address Budapest, Színház u. 5-9.
§date
when = 2010-05-09
2010-05-09
§source description
§fully-structured bibliographic citation
§title statement
§title Multext-East CES1: Newspapers, Hungarian
§statement of responsibility
name Csaba Oravecz
responsibility CES1 conformant tagging
§edition statement

MTE Final Release

§publication statement
§distributor Research Institute for Linguistics, Hungarian Academy of Sciences
§address Budapest, Színház u. 5-9.
§availability

Available for research purposes upon receipt of signed agreement

§date
when = 1997-10-01
October 1, 1997
§source description
§fully-structured bibliographic citation
title statement
title Magyar Hírlap: Pre-edited ASCII text version
publication statement
distributor Magyar Hírlap Publishing House Ltd.
address Budapest, Kerepesi út 29/b.
availability

Available for research purposes upon agreement

date Unknown
source description
citation list
bibliographic citation
title Magyar Hírlap, 25/01/1996 issue
bibliographic citation
title Magyar Hírlap, 31/01/1996 issue
bibliographic citation
title Magyar Hírlap, 22/01/1996 issue
§encoding description
§project description

MULTEXT-East: Multilingual Text Tools and Corpora for Central and Eastern European Languages. EU Copernicus Project COP106

§editorial practice declaration
§normalization

Corpus Encoding Standard, Version 4.0 CES LEVEL: 1

§tagging declaration
§namespace
name = http://www.tei-c.org/ns/1.0
§tag usage
gi = abbr occurs = 289
abbreviation
§tag usage
gi = body occurs = 1
text body
§tag usage
gi = byline occurs = 341
byline
§tag usage
gi = caption occurs = 2
caption
§tag usage
gi = div occurs = 316
text division
§tag usage
gi = docauthor occurs = 179
docauthor
§tag usage
gi = foreign occurs = 2
foreign
§tag usage
gi = head occurs = 545
heading
§tag usage
gi = hi occurs = 7
highlighted
§tag usage
gi = mentioned occurs = 131
mentioned
§tag usage
gi = name occurs = 1110
name
§tag usage
gi = note occurs = 19
note
§tag usage
gi = p occurs = 1293
paragraph
§tag usage
gi = q occurs = 218
separated from the surrounding text with quotation marks
§tag usage
gi = ref occurs = 15
reference
§tag usage
gi = sp occurs = 145
speech
§tag usage
gi = text occurs = 1
text
§tag usage
gi = title occurs = 11
title
§text-profile description
§text classification
§category reference
target = news
§revision description
§change 10/20/1996<date>Csaba Oravecz<name>corrected a number of typos
§change 10/25/1996<date>Csaba Oravecz<name>modofied header according to ME template
§change 1997-03-20<date>Tomaz Erjavec, IJS<name>Normalisation of corpus component CESHEADER elements: CESHEADER, EDITIONSTMT, TITLESTMT/H.TITLE
§change 1997-03-20<date>Tomaz Erjavec, IJS<name>ISO LANGUAGEs implemented as marked section PUBLIC ent
§change 1997-03-20<date>Tomaz Erjavec, IJS<name>Language (WSDs) implemented as PUBLIC entities
§change 1997-09-25<date>Tomaž Erjavec<name>Changed editionStmt, Extent, pubDate, Availability to final form
§change 2004-05-10<date>Tomaž Erjavec<name>Converted to TEI P4, prepared for MTE V3
§change 2010-05-09<date>Tomaž Erjavec<name>Conversion to MULTEXT-East TEI P5.