Natural Language Server
nl.ijs.si
Here you can find services and data resources primarily for Slovene
but also for other languages. The resources have been produced at JSI
or in projects where JSI was a partner; they are made as freely
avaiable as possible, which in often means it is possible to download
the complete resource (such as a corpus) under a Creative Commons
licence.
 |
OAI-PMH
Basic Metadata Descriptions
for language resources @ nl.ijs.si:
OLAC
or
CLARIN
format
|
c.f.
FLaReNet/CLARIN Harvesting Day
Resources
- MULTEXT-East:
Multilingual corpus, lexical resources and morphosyntactic specifications
- JOS:
Linguistic annotation of Slovene: corpora and services for linguistic annotatiom
- eZISS:
Scholarly digital editions of Slovene literature
- AHLib:
Digital library and corpus of XIXth century Slovene books
- SDT:
Slovene dependency treebank
- SVEZ-IJS:
Slovene-English parallel corpus of EU legal texts
- IJS-ELAN:
Slovene-English parallel corpus
Services
- ToTaLe:
on-line lemmatisation and MSD tagging of Slovene texts.
- Concordances:
- SBL:
the Slovenian Biographical Lexicon
- jaSlo:
Japanese - Slovene on-line learner's dictionary
Intitiatives
- SDJT:
the Slovenian Language Technologies Society
- GNUsl: an Open Source
effort for Slovene localisation (a bit dated)
The language resources on nl.ijs.si
are typically stored in XML, according to the
Text Encoding Initiative
Guidelines.
This server is operational since 1994 and was among the first
http servers in Slovenia; it is running Linux, with Apache and
Fedora Commons, and many many CGI scripts.
Related
Tomaž Erjavec, 2011-05-31