Next:
Introduction
Up:
Multext-East D2.1 F
Previous:
Multext-East D2.1 F
Contents
Contents
Introduction
Background
Description of the Corpus
License agreements
Corpus Encoding
Multilingual Parallel: Orwell's ``1984''
Overview
Structure of the Corpus
The languages of ``1984''
Sentence segmentation
ID marking
English
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Bulgarian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Czech
Description of the Corpus
Structure of the Corpus
Markup Process
Estonian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Hungarian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Romanian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Slovene
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Multilingual Comparable 1: Fiction
Bulgarian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Czech
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Estonian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Hungarian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Romanian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Slovene
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Multilingual Comparable 2: Newspapers
Bulgarian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Czech
Description of the corpus
Structure of the corpus
Structure of the original
The markup
Estonian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Hungarian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Romanian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Slovene
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Multilingual Parallel Speech Corpus
Organisation of the MULTEXT-East EUROM Corpus
Structure of the CES Speech Corpus
TELRI Appendix 1: Additional ``1984''
Overview
Corpus Encoding
Sentence segmentation
ID marking
HTML rendering
Latvian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Lithuanian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Serbo-Croatian
Description of the Corpus
Structure of the Corpus
Structure of the Original
Markup Process
Multext-East