Also included here are the additional languages, whose resources have been added due to the TELRI Concerted action.
English is the meta-language of the project.
ISO designations:
ISO 639 code: | en |
---|---|
ISO 8859 character set: | ISO 8859-1 (Latin 1) |
ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 1//EN |
HTML resources:
1984 | Report | Header | Sampler |
---|---|---|---|
Speech | Report | Header | Sampler |
Lexicon | Report | Sampler | |
Morphosyntax | Report |
Resources (WWW access restricted):
'1984' Corpus: | corp/1984/*-en.* |
---|---|
Speech Corpus: | corp/spch/*-en.* |
Word-Form Lexicon: | lexi/*-en.* |
MULTEXT tool resources: | tool/Multext/en |
Data on the language from the Ethnologue. http://www.ethnologue.com/show_language.asp?code=SLV
ISO designations:
ISO 639 code: | bg |
---|---|
ISO 8859 character set: | ISO 8859-5 (ISO Cyrillic) |
ISO 8879 entities: | ISO 8879-1986//ENTITIES Russian Cyrillic//EN |
ISO 8879-1986//ENTITIES Non Russian Cyrillic//EN |
HTML resources:
1984 | Report | Header | Sampler | Alignment |
---|---|---|---|---|
Fiction | Report | Header | Sampler | |
Newspapers | Report | Header | Sampler | |
Speech | Report | Header | Sampler | |
Lexicon | Report | Sampler | ||
Morphosyntax | Report |
Resources (WWW access restricted):
'1984' Corpus: | corp/1984/*-bg.* |
---|---|
Comparable Corpus: | corp/comp/*-bg.* |
Speech Corpus: | corp/spch/*-bg.* |
Word-Form Lexicon: | lexi/*-bg.* |
MULTEXT tool resources: | tool/Multext/bg |
Data on the language from the Ethnologue.
ISO designations:
ISO 639 code: | cs |
---|---|
ISO 8859 character set: | ISO 8859-2 (Latin 2) |
ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
1984 | Report | Header | Sampler | Alignment |
---|---|---|---|---|
Fiction | Report | Header | Sampler | |
Newspapers | Report | Header | Sampler | |
Speech | Report | Header | Sampler | |
Lexicon | Report | Sampler | ||
Morphosyntax | Report |
Resources (WWW access restricted):
'1984' Corpus: | corp/1984/*-cs.* |
---|---|
Comparable Corpus: | corp/comp/*-cs.* |
Speech Corpus: | corp/spch/*-cs.* |
Word-Form Lexicon: | lexi/*-cs.* |
MULTEXT tool resources: | tool/Multext/cs |
Data on the language from the Ethnologue.
ISO designations:
ISO 639 code: | et |
---|---|
ISO 8859 character set: | ISO 8859-10 (ISO Latin 6) |
sloppily: | ISO 8859-2 (ISO Latin 2) |
ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 1//EN |
ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
1984 | Report | Header | Sampler | Alignment |
---|---|---|---|---|
Fiction | Report | Header | Sampler | |
Newspapers | Report | Header | Sampler | |
Speech | Report | Header | Sampler | |
Lexicon | Report | Sampler | ||
Morphosyntax | Report |
Resources (WWW access restricted):
'1984' Corpus: | corp/1984/*-et.* |
---|---|
Comparable Corpus: | corp/comp/*-et.* |
Speech Corpus: | corp/spch/*-et.* |
Word-Form Lexicon: | lexi/*-et.* |
MULTEXT tool resources: | tool/Multext/et |
Data on the language from the Ethnologue.
ISO designations:
ISO 639 code: | hu |
---|---|
ISO 8859 character set: | ISO 8859-2 (Latin 2) |
ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
1984 | Report | Header | Sampler | Alignment |
---|---|---|---|---|
Fiction | Report | Header | Sampler | |
Newspapers | Report | Header | Sampler | |
Speech | Report | Header | Sampler | |
Lexicon | Report | Sampler | ||
Morphosyntax | Report |
Resources (WWW access restricted):
'1984' Corpus: | corp/1984/*-et.* |
---|---|
Comparable Corpus: | corp/comp/*-et.* |
Speech Corpus: | corp/spch/*-et.* |
Word-Form Lexicon: | lexi/*-et.* |
MULTEXT tool resources: | tool/Multext/et |
Data on the language from the Ethnologue.
ISO designations:
ISO 639 code: | ro |
---|---|
ISO 8859 character set: | ISO 8859-2 (Latin 2) |
ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
1984 | Report | Header | Sampler | Alignment |
---|---|---|---|---|
Fiction | Report | Header | Sampler | |
Newspapers | Report | Header | Sampler | |
Speech | Report | Header | Sampler | |
Lexicon | Report | Sampler | ||
Morphosyntax | Report |
Resources (WWW access restricted):
'1984' Corpus: | corp/1984/*-ro.* |
---|---|
Comparable Corpus: | corp/comp/*-ro.* |
Speech Corpus: | corp/spch/*-ro.* |
Word-Form Lexicon: | lexi/*-ro.* |
MULTEXT tool resources: | tool/Multext/ro |
Data on the language from the Ethnologue.
ISO designations:
ISO 639 code: | sl |
---|---|
ISO 8859 character set: | ISO 8859-2 (Latin 2) |
ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
1984 | Report | Header | Sampler | Alignment |
---|---|---|---|---|
Fiction | Report | Header | Sampler | |
Newspapers | Report | Header | Sampler | |
Speech | Report | Header | Sampler | |
Lexicon | Report | Sampler | ||
Morphosyntax | Report |
Resources (WWW access restricted):
'1984' Corpus: | corp/1984/*-sl.* |
---|---|
Comparable Corpus: | corp/comp/*-sl.* |
Speech Corpus: | corp/spch/*-sl.* |
Word-Form Lexicon: | lexi/*-sl.* |
MULTEXT tool resources: | tool/Multext/sl |
Data on the language from the Ethnologue.
ISO designations:
ISO 639 code: | lv |
---|---|
ISO 8859 character set: | ISO 8859-10 (ISO Latin 6) |
sloppily: | ISO 8859-2 (ISO Latin 2) |
ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 1//EN |
ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
1984 | Report | Header | Sampler | Alignment |
---|
Data on the language from the Ethnologue.
ISO designations:
ISO 639 code: | lt |
---|---|
ISO 8859 character set: | ISO 8859-10 (ISO Latin 6) |
sloppily: | ISO 8859-2 (ISO Latin 2) |
ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 1//EN |
ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
1984 | Report | Header | Sampler | Alignment |
---|
Resources (WWW access restricted):
'1984' Corpus: | corp/1984/*-lt.* |
---|
Data on the language from the Ethnologue.
ISO designations:
ISO 639 code: | sh |
---|---|
ISO 8859 character set: | ISO 8859-2 (ISO Latin 2) |
or | ISO 8859-5 (ISO Cyrillic) |
ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 1//EN |
ISO 8879-1986//ENTITIES Added Latin 2//EN | |
or | ISO 8879-1986//ENTITIES Russian Cyrillic//EN |
ISO 8879-1986//ENTITIES Non Russian Cyrillic//EN |
HTML resources:
1984 | Report | Header | Sampler | Alignment |
---|
Resources (WWW access restricted):
'1984' Corpus: | corp/1984/*-sc.* |
---|
Data on the language from the Ethnologue.
ISO designations:
ISO 639 code: | ru |
---|---|
ISO 8859 character set: | ISO 8859-5 (ISO Cyrillic) |
ISO 8879 entities: | ISO 8879-1986//ENTITIES Russian Cyrillic//EN |
HTML resources:
1984 | Header | Sampler |
---|
Resources (WWW access restricted):
'1984' Corpus: | corp/1984/*-ru.* |
---|