next up previous contents
Next: Markup Up: Background and approach Previous: Background and approach

Tools

 

In order to ensure the project's feasibility, MULTEXT is using only state-of-the-art methods in tool development. The project uses these methods to produce a set of tools that is freely available, coherent, extensible, and language-independent. The tools are implemented under UNIX. All MULTEXT tools are designed with an engine-based approach where all language-dependent materials are provided as data. Therefore, extension of the tools in MULTEXT-East to cover CEE languages will generally only involve providing the appropriate tables and rules. However, some adjustments are expected in the engines, given the new range of problems posed by different families of languages. The tools fall into two general categories:

Corpus annotation tools:

Corpus exploitation tools:

All tools are integrated by means of a common user interface into a general-purpose corpus manipulation system suitable for NLP research.



Tomaz Erjavec
Mon May 20 13:01:13 MDT 1996