type: | corpus |
---|---|
id: | ijs-elan-sample.h |
language: | slen |
updated: | 1999-06-24 |
This parallel aligned corpus is freely available, provided that the the sources described in this header or in the headers of its tei.2 text elements are acknowledged.
Ta vzporedni poravnani korpus je prosto dostopen, pod pogojem, da se citira njegove vire, dokumentirane v tej glavi ali v glavah njegovih tei.2 besedil.
This corpus is composed of 15 texts:
This IJS-ELAN corpus is part of the LJU1 site contribution to the EU MLIS project ELAN: European Language Activity Network For more information see the IJS-ELAN homepage <http://nl.ijs.si/elan/> and the ELAN project homepage <http://solaris3.ids-mannheim.de/elan/>
All formatting removed from originals removed.
Only ASCII characters and SGML entities used: see the DTD for defined entities.
List bullets normalised to -, or left as *
Quotation marks converted to " or '
Start / end quote is indicated with the open / close values of the TYPE attribute of C.
Segmentation into translation units and segments semi automatic with various tools
Tokeisation perfomed with MULTEXT mtseg, correcting the results with Perl & Emacs.