Slovene Dependency Treebank
Version 0.4


TEI Header

type: corpus
created: ET
date: 2005-12-15 (created) 2006-05-17 (updated)

File Description

Title Statement:
Title:
Slovene Dependency Treebank
Responsibility Statement:
Tomaž Erjavec
Jožef Stefan Institute, Ljubljana
tomaz.erjavec@ijs.si
Project coordination, all things TEI
Responsibility Statement:
Nina Ledinek
Faculty of Arts, Ljubljana
nina.ledinek@siol.net
Syntactic annotation
Responsibility Statement:
Zdeněk Žabokrtský
Center for Computational Linguistics, Prague
zabokrtsky@ckl.mff.cuni.cz
TrEd macros for annotation of Slovene, parser of Slovene
Responsibility Statement:
Petr Pajas
Center for Computational Linguistics, Prague
pajas@ckl.mff.cuni.cz
Author of tree editor TrEd (and its backend to TEI-XML)
Responsibility Statement:
Andreja Žele
Fram Ramovš Institute of the Slovenian Language
Scientific Research Center of the Slovene Academy of Sciences and Arts
Ljubljana
Annotation advisor
Responsibility Statement:
Sašo Džeroski
Jožef Stefan Institute, Ljubljana
saso.dzeroski@ijs.si
Project advisor
Edition Statement:
Edition:
Version 0.4
Publication Statement:
Place of publication:
http://nl.ijs.si/sdt/
Availiability (restricted) :

Available for purposes of academic research and teaching only. When using SDT, please acknowledge this in your publications, making reference to the following paper:
Sašo Džeroski , Tomaž Erjavec , Nina Ledinek , Petr Pajas , Zdenek Žabokrtský , Andreja Žele : Towards a Slovene Dependency Treebank . In: Proceedings of the Fifth International Conference on Language Resources and Evaluation, LREC'06, Genoa, Italy, 2006.
Please check http://nl.ijs.si/sdt/ for a copy of this paper as well as more recent publications on the subject.

Date:
May 17th, 2006
Source Description:
  • Multext-East cesAna: Nineteen Eighty-Four, Slovene
    Edition: MULTEXT-East, Version 3 First third of novel (Part I.) http://nl.ijs.si/ME/

Encoding Description

Project description:

SDT: Slovene Dependency Treebank

http://nl.ijs.si/sdt/

Tags declaration:
text = 1
body = 1
div = 9
p = 362
s = 1998
gap = 1
w = 29991
c = 6563

Profile Description

Language use:
sl = Slovene
en = English

Revision Description


TEI Header

type: afun-library
created: et
date: 2005-12-15 (created) 2006-05-17 (updated)

File Description

Title Statement:
Title:
Slovene Dependency Treebank: Specifications of Analytical Functions
Responsibility Statement:
Tomaž Erjavec, IJS
Editor
Edition Statement:

Version 0.2

Publication Statement:
Distributor:
Slovene Dependency Treebank Web site

http://nl.ijs.si/sdt/
Availiability (free) :

Freely available.

Date:
May 17th, 2006
Source Description:
Monograph:
Title:
ANNOTATIONS AT ANALYTICAL LEVEL
Title:
Instructions for annotators
Eva Hajičová, Petr Sgall (ed.)
Imprint:
Place of publication:
http://ufal.mff.cuni.cz/pdt2.0/doc/manuals/en/a-layer/html/index.html
11.10.1999

Encoding Description

Project description:

Slovene Dependency Treebank Project

Editorial declaration:
This document expresses PDT analytical functions as TEI features and feature-structures.

Revision Description


TEI Header

type: msd-library
created: et
date: 2000-10-30 (created) 2006-05-17 (updated)

File Description

Title Statement:
Title:
MULTEXT-East Morphosyntactic Specifications
Responsibility Statement:
Tomaž Erjavec, IJS
Editor
Responsibility Statement:
Nancy Ide, Vassar
English data
Radoslav Pavlov, L.Dimitrova, Ludmila Sinapova, Kiril Simov
Bulgarian specification
Vladimír Petkevič
Czech specification
Heiki-Jaan Kaalep
Estonian specification
Nancy Ide, Greg Priest-Dorman, Tomaž Erjavec, Tamas Varadi
English specification
Laszlo Tihanyi, Tamas Varadi
Hungarian specification
Dan Tufiş, Anna Maria Barbu
Romanian specification
Tomaž Erjavec, Peter Holozan, Vojko Gorjanc, Marko Stabej
Slovene specification
Marko Tadić
Croatian specification
Cvetana Krstev, Duško Vitas
Serbian specification
Han Steenwijk
Resian specification
EU Copernicus Project COP106 "MULTEXT-East" EU Copernicus Project PL96-1142 "Concede" Individual partners' grants and contracts.
Edition Statement:

MULTEXT-East Morphosyntactic Specifications, Version 3

Publication Statement:
Distributor:
MULTEXT-East Web site

http://nl.ijs.si/ME/V3/msd/
Availiability (free) :

Freely available.

Date:
May 5th, 2004
Source Description:
Monograph:
Title:
MULTEXT-East Morphosyntactic Specifications, Concede Edition
Tomaž Erjavec (ed.)
Imprint:
Place of publication:
http://nl.ijs.si/ME/V2/msd/
2001-04-09

Encoding Description

Project description:

MULTEXT-East: Multilingual Text Tools and Corpora for Central and Eastern European Languages. EU Copernicus Project COP106

Concede: Consortium for Central European Dictionary Encoding. EU Copernicus Project PL96-1142

Tags declaration:
fLib = 14
Library for Morphosyntactic Specifications, one for each category. Attributes are:
  • type: gives the name of the category (part-of-speech) that the contained features are describing
f = 524
Defined features, one for each category, and one for each defined attribute-value pair. Attributes are:
  • id: the identifier of the feature composed of the part-of-speech code, the feature number, a period and the feature code, e.g. id="A1.f".
  • select: the languages that the feature-value is appropriate for
  • name: the name of the attribute; "PoS" for category.
sym = 524
A feature-value. Attributes are:
  • value: the name of the value
fsLib = 14
Library of Morphosyntactic Descriptions, one for each category. Attributes are:
  • type: gives the name of the category (part-of-speech) that the contained MSDs belong to
fs = 6278
Valid morphosyntactic description. Attributes are:
  • id: the lexical/corpus MSD
  • select: the languages that the MSD is appropriate for
  • feats: references to the definitions of a attribute-values.

Revision Description


TEI Header

type:
created: ET
date: 2005-12-15 (created) 2006-05-17 (updated)

File Description

Title Statement:
Title:
Slovene Dependency Treebank: Nineteen Eighty-Four, Part I.
Edition Statement:
Edition:
Version 0.4
Publication Statement:
Place of publication:
http://nl.ijs.si/sdt/
Availiability (restricted) :

Available for purposes of academic research and teaching only. When using SDT, please acknowledge this in your publications, making reference to the following paper:
Sašo Džeroski , Tomaž Erjavec , Nina Ledinek , Petr Pajas , Zdenek Žabokrtský , Andreja Žele : Towards a Slovene Dependency Treebank . In: Proceedings of the Fifth International Conference on Language Resources and Evaluation, LREC'06, Genoa, Italy, 2006.
Please check http://nl.ijs.si/sdt/ for a copy of this paper as well as more recent publications on the subject.

Date:
May 17th, 2006
Source Description:
Title Statement:
Title:
Multext-East cesAna: Nineteen Eighty-Four, Slovene
Responsibility Statement:
Tomaž Erjavec
Overall Responsibility
Edition Statement:
Edition:
MULTEXT-East, Version 3
Publication Statement:
Distributor:
Dept. of Knowledge Technologies, Jožef Stefan Institute

Jamova 39
SI-1000 Ljubljana
Slovenia

tomaz.erjavec at ijs.si
http://nl.ijs.si/ME/
Availiability (restricted) :

Freely available for non-commercial use provided that this Header is included in its entirety with any copy distributed

Date:
November 1st, 2000
Source Description:
Title Statement:
Title:
Multext-East cesAna: Nineteen Eighty-Four, Slovene
Responsibility Statement:
Tomaž Erjavec
Overall Responsibility
Aleksandra Bizjak, Primož Jakopin
Tagging
Tomaž Erjavec
Tagging correction
Conversion to TEI
Edition Statement:
Edition:
MULTEXT-East Final Release
Publication Statement:
Distributor:
Dept. for Intelligent Systems, Jožef Stefan Institute

Jamova 39
SI-1000 Ljubljana,
Slovenia

tomaz.erjavec at ijs.si
http://nl.ijs.si/ME/
Availiability (restricted) :

Available for research purposes upon receipt of signed agreement.

Date:
January 1st, 1998
Source Description:
Title Statement:
Title:
Multext-East CES1: Nineteen Eighty-Four, Slovene
Publication Statement:
Distributor:
Dept. for Intelligent Systems, Jozef Štefan Institute

Jamova 39
SI-1000 Ljubljana
Slovenia

tomaz.erjavec@ijs.si
http://nl.ijs.si/ME/
Date:
October 1, 1997
Source Description:
Title Statement:
Title:
The European Corpus Initiative Multilingual Corpus 1: 1984 by George Orwell (Slovene)
Responsibility Statement:
Association for Computational Linguistics
Converted from OTA's DTD to ECI DTD
Publication Statement:
Distributor:
ACL

ACL
Date:
1994
Source Description:
Title Statement:
Title:
Orwell's 1984: electronic edition
Responsibility Statement:
Oxford Text Archive
The four versions of Orwell's 1984 in the OTA were all prepared by the OUCS KDEM service in 1985 for Dr David C Bennett of the School of Oriental And African Studies at London University. The texts here have not been encoded or proofread in any way since they were produced (other than the English text, which was converted to an SGML like encoding by John Price-Wilkin, and subsequently automatically converted to conform to the OTA's dtd by myself and Alan Morrison. The other languages were converted to TEI conformant SGML by the ECI project 1993.) --LB, Nov 1992
Edition Statement:

Public Domain TEI edition prepared at the Oxford Text Archive

Publication Statement:
Distributor:
Oxford Text Archive

Oxford University Computing Service
13 Banbury Road
Oxford OX2 6NN UK
archive@ox.ac.uk
Availiability (restricted) :

Freely available for non-commercial use provided that this Header is included in its entirety with any copy distributed

Date:
19 Nov 1992
Source Description:
Monograph:
Title:
1984
George Orwell Translator: Alenka Puhar
Imprint:
1983
Publisher:
Knjižnica Kondor
Publisher:
Mladinska knjiga
Place of publication:
Ljubljana

Revision Description