Next: Description of the
Up: MULTEXT-East work and
Previous: MULTEXT-East work and
As regards the types of information to be included in the lexicons and
the ``fine-grainedness'' of distinctions, the MULTEXT approach can
simply be described in the following way:
- in principle, all the distinctions that seemed to complicate the
process of disambiguation by means of an automatic tagger have been
discarded;
- a specification for the encoding of a distinction has been
included if it affects the word-form level, i.e. if a different
graphical word-form is recognized by that feature.
The motivation for this is to provide the information necessary for our
final purpose, i.e. automatic corpus tagging.
Tomaz Erjavec
Wed Oct 16 12:08:36 MDT 1996