This document is a HTML 3.2 rendering of a Corpus Encoding Specification DTD document,
by Fred, using the ceshdr2html_tmap.fred translation map.
Note that this HTML translation does not contain all the information
from the original document.
Uses ISO 8859-1 (Latin-1) encoding.
CES header
Version: 4.1, Type: text, Language: en,
Creator: CK, Status: update, Created: 1997-12-03, Updated: 1997-12-28
File Description
- Title Statement
- Title:
- TELRI CES: Nineteen Eighty-Four, Serbo-Croatian
- Responsibility
- Cvetana Krstev
(Error correction, CES1 conformance,
and supporting awk script.)
Dusko Vitas
(Text acquisition from the Oxford Text Archive
and consulting.
)
Tomaz Erjavec
(CES1 conformance, encoding harmonisation
with the MULTEXT-East '1984' corpus.)
- Edition:
- TELRI Final Release
- Extent:
- 89749 words, 863 kb
Note:
WordCount represents the number of words in this text exclusive
of tags and header information. ByteCount reflects the
size of the file containing the doctype and cesDoc element including
all text, tags and header information.
- Publication Statement
- Distributor:
-
Computer Science Departement
Faculty of Mathematics
- Address:
- Studentski trg 16, 11000 Belgrade, Yugoslavia (Serbia)
- Electronic address:
- email: cvetana@matf.bg.ac.yu
- Availiability:
-
Available for research purposes upon receipt of signed agreement
- Publication date:
- January 1st, 1998
- Source Description
- Full Bibliography
- Title Statement
- Title:
- Orwell's 1984: electronic edition
- Responsibility
- Oxford Text Archive
(
The four versions of Orwell's 1984 in the OTA
were all prepared by the OUCS KDEM service in
1985 for Dr David C Bennett of the School of
Oriental And African Studies at London
University. The texts here have not been
encoded or proofread in any way since they were
produced (other than the English text, which was
converted to an SGML like encoding by John
Price-Wilkin, and subsequently automatically
converted to conform to the OTA's dtd by myself
and Alan Morrison. The other languages were
converted to TEI conformant SGML by the ECI
project 1993. --LB, Nov 1992
)
- Edition:
-
Public Domain TEI edition prepared at the Oxford Text
Archive
- Publication Statement
- Distributor:
- Oxford Text Archive
- Address:
-
Oxford University Computing Service
13 Banbury Road
Oxford OX2 6NN UK
archive@ox.ac.uk
- Availiability:
-
Freely available for non-commercial
use provided that this header is included in its
entirety with any copy distributed
- Publication date:
- 19 Nov 1992
- Source Description
- Structured Bibliography
- Monography
- Title:
- 1984
- Author:
- George Orwell
- Author:
- Translator: Vlada Stoji&lx;kovi&cx;
- Edition:
- Second edition
- Imprint
- Publication date:
- 1984
- Publisher:
- Beogradski izdava&cy;ko-grafi&cy;ki zavod
- Place:
- Beograd
Encoding Description
- Project Description:
- TELRI
- Editorial declaration:
- Conformance:
-
Corpus Encoding Specification, Version 4.3
- Correction:
-
Typographical mistakes corrected while preparing the electronic
edition, though not systematically.
- Quotation:
-
Rendition attribute values on HI, Q and QUOTE tags
are adapted from ISOpub and ISOnum standard entity set names
The 'default' rendition of Q (PRE mdash) has not been included in Q
- Hyphenation:
-
All end-of-line hyphenation removed.
- Segmentation:
-
Marked up to the level of paragraph: P, QUOTE, LIST, POEM
plus marking of particular sub-paragraph elements: NAME, Q.
Page breaks left in the document as comments.
- Hyphenation:
- End-of-line hyphenation present in the
OTA digital original.
- Tag declaration:
- abbr = 14
-
- body = 1
-
- date = 39
-
- div = 28
-
- foreign = 7
-
- head = 27
-
- hi = 323
-
- item = 4
-
- l = 32
-
- list = 1
-
- name = 1371
-
- note = 1
-
- p = 1282
-
- poem = 10
-
- q = 2245
-
Q tags with a attribute of "type=MI" have
been inserted automatically after S insertion.
- quote = 35
-
- s = 6643
-
S tags have been inserted automatically using the awk script
written for this purpose and then cleaned up by hand.
- text = 1
-
- title = 4
-
Revision Description
- Date: 1997-12-10 (Cvetana Krstev, Faculty of Mathematics, Belgrade)
- Corrected the header: number of words
and data about the publisher -- the right edition
was finally been retrieved.
- A number of systematic errors was automatically corrected.
The errors were of the following type:
1. a word immediatelly followed by a start-tag;
2. an end-tag immediatelly followed by a word;
3. an entity (mdash or hellip) immediatelly followed by
a start-tag;
4. an end-tag immediatelly followed by an entity
(mdash or hellip).
- Date: 1997-12-10 (Tomaz Erjavec)
- IDed structural elements: they are prefixed with
'Oshs' to distinguish them from the other, 'Croatian'
Serbo-Croatian translation, also contained in OTA
- Corrected some markup in body (chapter titles of 'The Book')
to better correspond to English edition
- Updated BYTECOUNT and DATE.UPDATED
- Date: 1997-12-18 (Tomaz Erjavec)
- H.TITLE, EDITIONSTMT, AVAILABILITY to final form
- Date: 1997-12-25 (Cvetana Krstev)
- Correction of some typographic errors:
1. 'upi&cx;uje' changed to 'upu&cx;uje' in sentence
id="Oshs.1.6.68.11" (scanning error);
2. A full stop inserted after 'besmislice' in sentence
id="Oshs.2.5.30.3" and new sentence tag inserted (scanning
error);
3. A blank inserted before a left parenthesis in sentence
id="Oshs.2.10.27.13.7";
4. A whole missing line inserted in sentence
id="Oshs.3.3.4.8". After 'u preponu, u' added:
'mo&sx;nice, u trti&cy;nu kost. Bilo je trenutaka kadaje'.
New sentence tag inserted.
5. A full stop missing after 're&cy;e on' in sentence
id="Oshs.3.5.27.1" and new sentence tag inserted.
6. 'o&cy;ajjno' changed to 'o&cy;ajno' in sentence
id="Oshs.3.7.13.5" (scanning error);
7. A full stop deleted after 'mrm&lx;aju&cx;i' in sentence
id="Oshs.2.10.33.11" and sentence tag deleted (error in
printed edition);
8. A full stop changed to a comma after 'Drug Oglivi' in
sentence id='Oshs.1.5.27.5" and a sentence tag deleted
(scanning error);
9. id atribute of p tag changed from "Oshs..44.14" to
"Oshs.2.44.14".
- As a result of a process of a hand-validation of the
Serbo-Croatian/English alignement the following cahnges in
sentence tags were done:
1. Tag id="Oshs.1.3.12.2" inserted after a question mark in
sentence id="Oshs.1.3.12.1";
2. Tag id="Oshs.1.3.21.2" inserted after a question mark in
sentence id="Oshs.1.3.21.1";
3. Tag id="Oshs.1.6.10.2" inserted after a question mark in
sentence id="Oshs.1.6.10.1";
4. Tag id="Oshs.1.5.18.1.2" deleted after a date tag.
5. Tag id="Oshs.1.5.24.2" deleted before a left parenthesis
in sentence id="Oshs.1.5.24.1";
6. Tag id="Oshs.1.5.24.4" deleted after a right parenthesis
in sentence id="Oshs.1.5.24.3";
7. Tag id="Oshs.1.8.4.5" deleted after exclamation mark in
sentence id="Oshs.1.8.4.4";
8. Tag id="Oshs.2.3.16.4" deleted after exclamation mark in
sentence id="Oshs.2.3.16.3";
9. Tag id="Oshs.2.3.27.5" deleted after question mark in
sentence id="Oshs.2.3.27.4";
10. Tag id="Oshs.2.3.47.6" deleted after exclamation mark in
sentence id="Oshs.2.3.47.5";
11. Tag id="Oshs.2.5.8.2" deleted after exclamation mark in
sentence id="2.5.8.1";
12. Tag id="Oshs.2.11.31.3" deleted after a colon in
sentence id="Oshs.2.11.31.2". Type agrument of value "MI"
removed form the following q tag;
13. Tag id="Oshs.3.2.25.2" deleted after exclamation mark in
sentence id="Oshs.3.2.25.1";
14. Tag id="Oshs.3.2.70.3" deleted after exclamation mark
in sentence id="Oshs.3.2.70.2";
15. Tag id="Oshs.3.4.13.2" deleted after exclamation mark
in sentence id="Oshs.3.4.13.1";
16. Tag id="Oshs.3.5.22.7" deleted after exclamation mark
in sentence id="Oshs.3.5.22.6";
17. Tag id="Oshs.3.5.22.10" deleted after exclamation mark
in sentence id="Oshs.3.5.22.9";
18. Tag id="Oshs.4.14.8" deleted after an abbreviation 'itd.'
in sentence id="Oshs.4.14.7" (more abbr tags added for
'itd.';
19. Tag id="Oshs.4.15.3" deleted after an abbreviation 'tj.'
in sentence id="Oshs.4.15.3" (more abbr tags added for
'tj.';
- Updated WORDCOUNT, BYTECOUNT, TAGUSAGE and DATE.UPDATED
- Date: 1997-12-28 (Tomaz Erjavec)
- Revision clash; re-did header elements (H.TITLE, EDITIONSTMT, AVAILABILITY)
- Removed redundant whitespace from tags, after 'ID="..." '
- Corrected wrong sr entities in 'mocx;', '&;&cy;;'
Meta-Made by et