next up previous contents
Next: Contents

MULTEXT-EAST Multilingual Text Tools and Corpora for Eastern and Central European Languages

Copernicus Joint Project: COP 106

Abstract:

The language industries rely increasingly on the availability of large-scale language resources, appropriate software tools, and standards to make them maximally reusable. However, while the development of resources, tools, and standards is well on its way for EU languages, there have been very few comparable efforts for the languages of Central and Eastern Europe (CEE). MULTEXT-East is intended to fill this gap by developing resources in CEE languages and adapting existing tools and standards to them.

MULTEXT-East is a spin-off of the LRE project MULTEXT , one of the largest EU projects in the domain of language tools and resources. MULTEXT-East will extend the scope of MULTEXT by transferring its expertise, methodologies, and tools to CEE countries. Together, MULTEXT and MULTEXT-East will create a unique network of more than twenty academic research centers and companies, all developing and using common lingware and methodologies, as well as producing the first annotated large-scale multilingual corpus for 12 EU and CEE languages.

MULTEXT-East involves eight partners. It started in May 1995 and will last for two years.

This document is based on the technical annex of MULTEXT-East . It was created with LaTeX2HTML; both the WWW and the DVI versions of the document are available. The MULTEXT-East sites are linked, in the WWW document, to the Site Profiles of ElsNet Nodes and to the Site Profiles of Language Engineering Organisations in Central and Eastern Europe, collected by ElsNet .





Tomaz Erjavec
Mon May 20 13:01:13 MDT 1996