Research on Natural Language
@ the Dept. of Knowledge Technologies
Jozef Stefan Institute

Areas of research Resources and Services People Projects Events organized History The Natural Language Server

Language related research at the Department

The department is involved in various areas of computational linguistics, natural language processing and Human Language Technologies, which focus, to a large extent, on the Slovene language. Areas of expertise include standards for text encoding, linguistic annotation of textual data, development and processing of mono- and multilingual language corpora, machine learning of language structure, text mining, information retrieval and extraction, terminology extraction, computer-aided translation, computational lexicography and production of complex digital editions.

We activelly promote the development of HLT for the Slovene language; we are among the founding members of the Slovenian Language Technologies Society, which organises bi-annual conferences, while the language resources we produce are encoded according to international standards (in particular, TEI) and freely downloadable for research use.

Related areas of research at the department are Text and Web Mining and Learning Language in Logic.

E8 JSI HLT Resources and Services

The department has been involved in numerous projects that deal with the compilation of language resources, mainly for Slovene in an multilingual setting. Whenever possible, we make the results publicly available.

People

The main people at the department that are involved in various areas of HLT are


Language Related Projects at the Department

Slovene projects: EU projects: Bilateral projects:

Old projects

Slovene projects: EU projects:

Organisation of HLT-related events


History

Research into Natural Language Processing has been carried out at the Institute Jozef Stefan since the 70's, at the Dept. for Computer Systems E4. The head of the E4 Laboratory for Natural Language was Dr. Peter Tancig. In 1995, the Lab was merged with the Artificial Intelligence Laboratory into the Dept. of Intelligent Systems, E8.

The members of the former lab were for a while known as the Language and Speech (Technologies) Group and cooperated in the project RR(S)J: Computational Understanding of (the Slovene) Language. But many of the students and researchers left to lead different lives, while members of the ex-AI lab became involved in various aspects of processing natural language; the boundary between the 'natural language' and 'artificial intelligence' members of the department thus became rather blurry.

Listed below are former members of and students at the NL Laboratory:

In 2004 the Department split into two new departments: E8, the Dept. of Knowledge Technologies and E9, the Dept. of Intelligent Systems. HLT activities continue in both departments, although this page only documents the work at the Dept. of Knowledge Technologies.
Valid HTML 4.01! Page last updated 2006-05-18, et