Also included here are the additional languages, whose resources have been added due to the TELRI Concerted action.
English is the meta-language of the project.
ISO designations:
| ISO 639 code: | en |
|---|---|
| ISO 8859 character set: | ISO 8859-1 (Latin 1) |
| ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 1//EN |
HTML resources:
| 1984 | Report | Header | Sampler |
|---|---|---|---|
| Speech | Report | Header | Sampler |
| Lexicon | Report | Sampler | |
| Morphosyntax | Report |
Resources (WWW access restricted):
| '1984' Corpus: | corp/1984/*-en.* |
|---|---|
| Speech Corpus: | corp/spch/*-en.* |
| Word-Form Lexicon: | lexi/*-en.* |
| MULTEXT tool resources: | tool/Multext/en |
Data on the language from the Ethnologue. http://www.ethnologue.com/show_language.asp?code=SLV
ISO designations:
| ISO 639 code: | bg |
|---|---|
| ISO 8859 character set: | ISO 8859-5 (ISO Cyrillic) |
| ISO 8879 entities: | ISO 8879-1986//ENTITIES Russian Cyrillic//EN |
| ISO 8879-1986//ENTITIES Non Russian Cyrillic//EN |
HTML resources:
| 1984 | Report | Header | Sampler | Alignment |
|---|---|---|---|---|
| Fiction | Report | Header | Sampler | |
| Newspapers | Report | Header | Sampler | |
| Speech | Report | Header | Sampler | |
| Lexicon | Report | Sampler | ||
| Morphosyntax | Report |
Resources (WWW access restricted):
| '1984' Corpus: | corp/1984/*-bg.* |
|---|---|
| Comparable Corpus: | corp/comp/*-bg.* |
| Speech Corpus: | corp/spch/*-bg.* |
| Word-Form Lexicon: | lexi/*-bg.* |
| MULTEXT tool resources: | tool/Multext/bg |
Data on the language from the Ethnologue.
ISO designations:
| ISO 639 code: | cs |
|---|---|
| ISO 8859 character set: | ISO 8859-2 (Latin 2) |
| ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
| 1984 | Report | Header | Sampler | Alignment |
|---|---|---|---|---|
| Fiction | Report | Header | Sampler | |
| Newspapers | Report | Header | Sampler | |
| Speech | Report | Header | Sampler | |
| Lexicon | Report | Sampler | ||
| Morphosyntax | Report |
Resources (WWW access restricted):
| '1984' Corpus: | corp/1984/*-cs.* |
|---|---|
| Comparable Corpus: | corp/comp/*-cs.* |
| Speech Corpus: | corp/spch/*-cs.* |
| Word-Form Lexicon: | lexi/*-cs.* |
| MULTEXT tool resources: | tool/Multext/cs |
Data on the language from the Ethnologue.
ISO designations:
| ISO 639 code: | et |
|---|---|
| ISO 8859 character set: | ISO 8859-10 (ISO Latin 6) |
| sloppily: | ISO 8859-2 (ISO Latin 2) |
| ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 1//EN |
| ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
| 1984 | Report | Header | Sampler | Alignment |
|---|---|---|---|---|
| Fiction | Report | Header | Sampler | |
| Newspapers | Report | Header | Sampler | |
| Speech | Report | Header | Sampler | |
| Lexicon | Report | Sampler | ||
| Morphosyntax | Report |
Resources (WWW access restricted):
| '1984' Corpus: | corp/1984/*-et.* |
|---|---|
| Comparable Corpus: | corp/comp/*-et.* |
| Speech Corpus: | corp/spch/*-et.* |
| Word-Form Lexicon: | lexi/*-et.* |
| MULTEXT tool resources: | tool/Multext/et |
Data on the language from the Ethnologue.
ISO designations:
| ISO 639 code: | hu |
|---|---|
| ISO 8859 character set: | ISO 8859-2 (Latin 2) |
| ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
| 1984 | Report | Header | Sampler | Alignment |
|---|---|---|---|---|
| Fiction | Report | Header | Sampler | |
| Newspapers | Report | Header | Sampler | |
| Speech | Report | Header | Sampler | |
| Lexicon | Report | Sampler | ||
| Morphosyntax | Report |
Resources (WWW access restricted):
| '1984' Corpus: | corp/1984/*-et.* |
|---|---|
| Comparable Corpus: | corp/comp/*-et.* |
| Speech Corpus: | corp/spch/*-et.* |
| Word-Form Lexicon: | lexi/*-et.* |
| MULTEXT tool resources: | tool/Multext/et |
Data on the language from the Ethnologue.
ISO designations:
| ISO 639 code: | ro |
|---|---|
| ISO 8859 character set: | ISO 8859-2 (Latin 2) |
| ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
| 1984 | Report | Header | Sampler | Alignment |
|---|---|---|---|---|
| Fiction | Report | Header | Sampler | |
| Newspapers | Report | Header | Sampler | |
| Speech | Report | Header | Sampler | |
| Lexicon | Report | Sampler | ||
| Morphosyntax | Report |
Resources (WWW access restricted):
| '1984' Corpus: | corp/1984/*-ro.* |
|---|---|
| Comparable Corpus: | corp/comp/*-ro.* |
| Speech Corpus: | corp/spch/*-ro.* |
| Word-Form Lexicon: | lexi/*-ro.* |
| MULTEXT tool resources: | tool/Multext/ro |
Data on the language from the Ethnologue.
ISO designations:
| ISO 639 code: | sl |
|---|---|
| ISO 8859 character set: | ISO 8859-2 (Latin 2) |
| ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
| 1984 | Report | Header | Sampler | Alignment |
|---|---|---|---|---|
| Fiction | Report | Header | Sampler | |
| Newspapers | Report | Header | Sampler | |
| Speech | Report | Header | Sampler | |
| Lexicon | Report | Sampler | ||
| Morphosyntax | Report |
Resources (WWW access restricted):
| '1984' Corpus: | corp/1984/*-sl.* |
|---|---|
| Comparable Corpus: | corp/comp/*-sl.* |
| Speech Corpus: | corp/spch/*-sl.* |
| Word-Form Lexicon: | lexi/*-sl.* |
| MULTEXT tool resources: | tool/Multext/sl |
Data on the language from the Ethnologue.
ISO designations:
| ISO 639 code: | lv |
|---|---|
| ISO 8859 character set: | ISO 8859-10 (ISO Latin 6) |
| sloppily: | ISO 8859-2 (ISO Latin 2) |
| ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 1//EN |
| ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
| 1984 | Report | Header | Sampler | Alignment |
|---|
Data on the language from the Ethnologue.
ISO designations:
| ISO 639 code: | lt |
|---|---|
| ISO 8859 character set: | ISO 8859-10 (ISO Latin 6) |
| sloppily: | ISO 8859-2 (ISO Latin 2) |
| ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 1//EN |
| ISO 8879-1986//ENTITIES Added Latin 2//EN |
HTML resources:
| 1984 | Report | Header | Sampler | Alignment |
|---|
Resources (WWW access restricted):
| '1984' Corpus: | corp/1984/*-lt.* |
|---|
Data on the language from the Ethnologue.
ISO designations:
| ISO 639 code: | sh |
|---|---|
| ISO 8859 character set: | ISO 8859-2 (ISO Latin 2) |
| or | ISO 8859-5 (ISO Cyrillic) |
| ISO 8879 entities: | ISO 8879-1986//ENTITIES Added Latin 1//EN |
| ISO 8879-1986//ENTITIES Added Latin 2//EN | |
| or | ISO 8879-1986//ENTITIES Russian Cyrillic//EN |
| ISO 8879-1986//ENTITIES Non Russian Cyrillic//EN |
HTML resources:
| 1984 | Report | Header | Sampler | Alignment |
|---|
Resources (WWW access restricted):
| '1984' Corpus: | corp/1984/*-sc.* |
|---|
Data on the language from the Ethnologue.
ISO designations:
| ISO 639 code: | ru |
|---|---|
| ISO 8859 character set: | ISO 8859-5 (ISO Cyrillic) |
| ISO 8879 entities: | ISO 8879-1986//ENTITIES Russian Cyrillic//EN |
HTML resources:
| 1984 | Header | Sampler |
|---|
Resources (WWW access restricted):
| '1984' Corpus: | corp/1984/*-ru.* |
|---|