\documentclass[a4paper,11pt]{report}
\setlength{\parskip}{1ex}
\setlength{\parindent}{0in}
\setlength{\textwidth}{6.3in}
\setlength{\textheight}{8.8in}
\setlength{\topmargin}{-0.25in}
\setlength{\headheight}{.25in}
\setlength{\headsep}{.25in}
\setlength{\oddsidemargin}{0in}
\setlength{\evensidemargin}{0in}

%\pagestyle{multipage}
\pagestyle{myheadings}

\newcommand{\delID}{Deliverable D1.1 F}

\newcommand{\mte}{{\sc multext}-East}
\newcommand{\multext}{{\sc multext}}
\newcommand{\eagles}{{\sc eagles}}

%\newcommand{\mte}{Multext-East}
%\newcommand{\multext}{Multext}
%\newcommand{\eagles}{Eagles}

\begin{document}
\thispagestyle{empty}
\begin{center}

\vspace{1.5in}

{\Huge\bf Specifications and Notation for Lexicon Encoding}

\vspace{0.5in}

{\Large\bf
COP Project 106 \mte

Work Package WP1 -- Task 1.1

\delID
}


\vspace{0.5in}

{\Large\bf
Final Report

29 August 1997
%\today

%Work in Progress
}

\vspace{1in}

{\large
Workpackage Coordinators:\\
        CNRS - Nancy Ide and Jean Veronis

Task Leader:\\
        LJUBLJANA - Toma\v{z} Erjavec\\
        PISA - Monica Monachini

\vspace{0.5in}

Contributors:\\
Bulgarian: R.Pavlov, L.Dimitrova, L.Sinapova, K.Simov \\
Czech: V.Petkevi\v c\\
Estonian: H.J.Kaalep\\
Hungarian: L.Tihanyi\\
Romanian: D.Tufi\c{s}, A.M.Barbu\\
Slovene: T.Erjavec, P.Holozan
}

\vspace{0.5in}
\end{center}

\renewcommand{\thepage}{\roman{page}}
\markright{COP project 106 \mte{\hfill} \delID
           Lexicon specifications and notation{\hfill}}

\tableofcontents


\chapter{Introduction}
\renewcommand{\thepage}{\arabic{page}}
\setcounter{page}{1}
\markright{COP project 106 \mte{\hfill} \delID --- Introduction{\hfill}}

The purpose of this document is to (i) provide harmonized lexical
specifications for the six languages involved in the \mte\ project ---
Bulgarian, Czech, Estonian, Hungarian, Romanian, Slovene --- and (ii)
formulate the relevant notation to be used in the lexicons contributed
by each language group.

The two proposals of lexicon specifications presented in the \multext\
D1-6-1B Deliverable of the \multext\ Project (Bel, Calzolari and
Monachini eds. 1995) and in the \eagles\ document of the Lexicon
sub-group on Morphosyntactic annotation (Monachini and Calzolari 1995)
--- which is the basis of the previous one --- constitute together the
starting point and the model of the work and results presented here.

The \mte\ partners have evaluated the two proposals from the point of
view of the coverage with respect to their languages, have added the
specifications needed to encode the peculiarities of their languages
and have produced concrete applications of the proposed
specifications.  The work has been done through a cyclical process of
adjustments and re-application, giving rise to three versions of this
document: the initial IM1 version under the coordination from
\textsc{pisa}, a subsequent, significantly revised and validated
version at project milestone M, and this final version, incorporating
the last changes to the language specific tables, the latter two
coordinated by \textsc{ljubljana}.

The result is the common proposal for lexicon specifications of the
Central \& Eastern European languages contained in Chapter
\ref{chp:common}. On the basis of the \eagles\ and \multext\ models,
they are presented as sets of attribute-values --- displayed in
tabular format --- and the notation proposed follows the ``string of
characters in fixed positions'' strategy.

The applications of the proposal to the six languages are given in
Chapter \ref{chp:LSA}.They have been contributed respectively by:
\begin{description}
\item[Bulgarian:] R.Pavlov, L.Dimitrova, L.Sinapova and K.Simov;
\item[Czech:] V.Petkevi\v{c};
\item[Estonian:] H.J.Kaalep;
\item[Hungarian:] L.Tihanyi;
\item[Romanian:] D.Tufi\c{s} and A.M.Barbu;
\item[Slovene:] T.Erjavec and P.Holozan.
\end{description}

The language application parts present the lexicon specifications
category by category and are structured as follows:
\begin{enumerate}
\item[(a)] section which describes the features and values
  pertinent to a given category in the form of tables with
  examples from the language in question;
\item[(b)] a second section which provides the combinations of values:
  the way in which different values combine together, giving rise to
  all the possible lexicon specifications for the items belonging to
  that category, is displayed.
\end{enumerate}

Pisa, October 1995\\
Ljubljana, August 1997
\newpage

%\section{Introduction}

%The aim of the present document is to formulate harmonized
%specifications and a common notation for the lexicons of the six
%languages involved in the \mte\ Project, i.e.\ Bulgarian, Czech,
%Estonian, Hungarian, Romanian, Slovene. These lexicons have been
%contributed in the form of word-form lists associated to their
%morhosyntactic information by the six partners.

%The work carried out within this Task 1.1 ``Lexicon specification and
%resource building" can, hence, be seen as parallel to that developed
%within the \multext\ Task 1.6 (\multext\ 1993), in which harmonized
%lexicon specifications with the respective notation for the six
%project languages have been formulated and used to encode the lexical
%lists contributed within the Task 5.4.

%This will allow us to obtain lexical material with harmonized information
%encoded in an agreed way. These resources will run under the same tools
%and make it possible to perform comparisons across languages.

%One of the most important goals of \mte\ is to make its
%standards, software and data freely available for research in both
%academia and industry. The consortium adopts a policy of maximum access
%to all project resources. A primary goal is to provide the greatest
%possible opportunity for feedback from the potential community of users.
%Therefore, the project will provide free access to early versions of
%software and data for use within the NLP and MT community.

%In a larger perspective, the \mte\ work fits into the framework
%of the standardization efforts in the academic and industrial
%environment, provided by \eagles\ and \multext.  The Eastern work,
%indeed, serves as a refinement and validation of \multext\ itself. With
%respect to the \eagles\ initiative, it is an example of the large impact
%and dissemination of the proposal within the scientific community and
%constitutes a further contribution to the testing and application of
%the specifications, obviously increasing their coverage from a
%multilingual point of view.


\section{Background work}

The \multext\ D1-6-1B Deliverable (Bel, Calzolari and Monachini eds.
1995) and the \eagles\ proposal of specifications to be encoded in
lexica (Monachini and Calzolari 1995) have constituted the basis of
the results presented in what follows. 

The two documents together collect the outcomes of the main efforts in
Europe towards standardization in large linguistic resources and,
among other fields, in linguistic annotation.  The relationships and
the interdependencies between the two actions are reported in the
introduction of the \eagles\ document, where the \multext\ contribution to
the \eagles\ effort is described in detail and commented also in the
introduction of the \multext\ Deliverable.  Summing up, the two actions
have proceeded with the same bottom-up approach --- that is also
followed here ---, producing similar sets of specifications: the \eagles\
one, aiming at a general description of the languages, is intended to
offer a more general proposal and cover a vast range of purposes,
while the \multext\ set has to be considered in the context of the
project itself, whose aim was to produce resources to be used within
the specific application of automatic corpus tagging.  The \multext\ set
of lexicon specifications is, hence, task-oriented, being designed
with a special look to the final purpose.


\section{\mte\ work and approach}

The \mte\ partners have analysed and evaluated the
set of \multext\ specifications with respect to their languages; when
the scheme has not been sufficient, then, the \eagles\ inventory has
been consulted. If necessary, new distinctions for
language-specific information have been proposed.

The work proceeded in a cyclical process: firstly, the partners have
circulated the results of their evaluation work in applications that
have been, in turn, evaluated by Pisa, and later by Ljubljana.  These
applications have constituted the basis for the proposal of a set of
Central \& Eastern specifications.

The nucleus of common features, already isolated within \multext,
proved to be applicable to the Central \& Eastern languages but the
selection of distinctions needed for encoding the information peculiar
to the \mte\ languages and the harmonisation of proposals has required
much further work.  Some of the features proposed by the partners in
their first applications have been dropped, as they pertained to a
not-purely morphosyntactic level (i.e.\ Transitivity); other
distinctions have been kept out from the set since considered too
fine.

The emerged set has been again circulated among the partners for new
cycles of revisions and re-applications, until the specifications have
been considered acceptable for all the language groups and stable
enough for the first version (IM1) of this deliverable.

However, there were still significant problems with the IM1 this
version of the deliverable, in particular:
\begin{itemize}
\item different attributes or values were used to describe the same
  phenomena with different categories / languages (e.g. \texttt{1st}
  and \texttt{1} for first person);
\item especially the three Slavic languages at times described the
  same phenomena in different ways;
\item language independent aspects of the specifications (e.g. the
  'form' of numerals: digit, roman, etc.) were treated differently for
  different languages;
\item the ordering of attributes in some categories was sub-optimal,
  often necessitating long morphosyntactic descriptions.
\item the attributes and values used different 'punctuation' (e.g.\
  \texttt{full  art}, \texttt{Modific.Type}, \texttt{Pron-Form},
  \texttt{SubType};
\item the common tables were formated differently for different categories.
\end{itemize}

Therefore it was decided to produce, for milestone M, version 2 of the
deliverable; this effort was led by the Ljubljana site. This
harmonisation led to a more motivated and --- on the average --- more
compact morpho-syntactic descriptions for the \mte\ languages. The
formalisation of the tables and descriptions had an added benefit:
a simple Perl program (\texttt{mtems-expand}) was written,
which could, working directly with common tables of the
morpho-syntactic descriptions of this report, either expand or
validate lexical morphosyntactic descriptions. This program was used
to validate the word-form lexica of the project, thus ensuring that
all the morpho-syntactic descriptions in the lexica of the particular
languages are well-formed. Another Perl program, \texttt{mtems-split}
was also written, which, again working on the common tables, produces
language specific tables. These tables were circulated to the
partners, thus ensuring that the language specific section do in fact
reflect the common tables.

In the tagging of the \mte\ corpus, it was found that the initial
lexica still contained errors or non-optimal choices; this lead to a
revised set of lexica, and this to new language applications of the
morpho-syntactic tables. These were incorporated in this Final report.


\section{Description of the proposal}

The proposal has been prepared in the usual \multext\ table format, which
displays the specifications (as sets of attribute-values, see below for
further details about the notation), with their respective codes used to
mark them in the lexicons. Two types of features are distingished:
\begin{itemize}

\item[(i)] the minimal {\em core features}

  These are shared by most of the languages and are highlighted in the
  tables with asterisks (*).  We tried to keep this set in common to
  all the \multext\ and \mte\ languages.  In such a way, the
  comparability across the information encoded in the lexical lists of
  Central \& Eastern and of the six original \multext\ languages is
  ensured to a certain extent.

\item (ii) the \mte\ {\em language-specific features}

  These are shared by the \mte\ languages, but differ from the
  \multext\ ones. The formulation of this set has been, as already
  mentioned, highly delicate, due also to the fact that many
  language-specific values were presented in the applications and
  sometimes the same (or 'similar enough') morphosyntactic phenomenon
  was referred to with two different attribute or value names. The
  phase of recognition and harmonization of semantics of some
  attributes, values and naming conventions has, hence, required much
  effort.
\end{itemize}

If a feature-value is used by only one \mte\ language (i.e.\ if a
value is {\em language-specific}) it has been marked with {\tt l.s.}.
This marking is used only when a subset of the feature's values is
language specific --- in case a whole attribute (i.e. all its
features) is language specific, then the mark {\tt l.s.} is not used.

This representation, with the concrete applications which display and
exemplify the attributes and values and provide their internal
constraints and relationships, makes the proposal self-explanatory.
Other groups can easily test the specifications on their language,
simply by following the method of the applications.  The possibility
of incorporating idiosyncratic classes and distinctions after the
common core features makes the proposal relativelly adaptable and
flexible, without ruining compatibility.


\section{Lexical lists}

The specifications presented here have been used in the encoding the
word-form lexica of the project. These lexcion contain one entry per
line, where an entry has the following form:

\begin{center}
     Word-form, lemma, morphosyntactic description, (TAG)
\end{center}

This reports describes the morphosyntactic descriptions used in the
lexica and the tags developed by the partners. It should be noted that
the TAG field is optional in the lexica, partly because it easier and
more compact to provide the mapping between the morphosyntactic
information and the tags in separate tables and due to the preliminary
nature of the tagsets developed. In other words, the lexica with the
morphosyntactic descriptions provide a much more mature resource than
do the tags.


\section{Notation}

In \multext, the notation has been chosen following current practices
for NLP, where information is represented in attribute-value
formalisms and following the idea that it should also be
self-informative for human understanding. At the same time, a
relativelly compact encoding was maintained. The notation format
suggested has the following main characteristics:
\begin{itemize}
\item attributes are marked by positions;
\item values are represented by a single character;
\item a special marker reflects the non applicability of a given
  attribute.
\end{itemize}

These characteristics make the proposed lexical notation similar to
attribute-value pairs used in unification based formalisms (see the
\multext\ D1-6-1B Deliverable for further details).

The linear strings of characters representing the morphosyntactic
descriptions are constructed following
the philosophy of the Intermediate Format proposed in the \eagles\
Corpus proposal (Leech and Wilson, 1994), i.e.\ of having agreed
symbols in predefined and fixed positions: the positions of a string
of characters are numbered 0, 1, 2, etc. in the following way:
\begin{itemize}
\item[a.] the agreed character at position 0 encodes part-of-speech;
\item[b.] each character at position 1, 2, n, encodes the value of one
  attribute (person, gender, number, etc.);
\item[c.] if an attribute does not apply, the corresponding position
  in the string contains a special marker, in our case '\texttt{-}'
  (hyphen).
\end{itemize}

Example: \verb!Ncms- (Noun, common, masculine, singular, nocase)!

This notation adopts the \eagles\ Intermediate Format with a small
revision: the Intermediate Format encodes information by means of
digits, while in \multext\ characters of a mnemonic nature are
preferred.

The marker '\texttt{-}' has a special semantics and it means
'not-applicable'. As stated above, its
function is just to keep the relationship established between
attributes and values. It might be used for the following cases:
\begin{itemize}
\item[(a)] not relevant \emph{to a particular language}, e.g.\ Gender
  to Estonian.
\item[(b)] not applicable \emph{to a particular combination of
   attribute-values}, i.e.\ although the attribute is used by a
   category in a given language it does not apply to a particular
   subclass of the category; e.g., Person applies to Pronouns, but not
   to the Type demonstrative.
 \item[(c)] not applicable \emph{to a particular lexical item}, i.e.
   although the attribute applies to the rest of its paradigm, e.g.,
   Gender in the paradigm of English Personal Pronouns applies only to
   the 3rd person, {\sl I, you} vs. {\sl she, he}).
\end{itemize}


\section{Organization of the language-specific sections}

Following the procedure of the \multext\ Deliverable, the
language-specific sections consist of two distinct parts:
\begin{itemize}
\item a descriptive section, where the features and values relevant to
  the lexicon of the language are displayed in tabular form, with
  examples taken from the language;
\item the combination section, where the way in which the different
  values combine together is shown.
\end{itemize}

Two different strategies of displaying combinations have been
followed:
\begin{itemize}
\item[(i)] all the {\em admitted\/} combinations are provided together
  with an example. This has the disadvantage of producing big lists,
  given the high number of features and values to combine together,
  but the advantage of providing the only legal combinations with the
  relevant constraints in the application of some features/values in
  presence of other features or values or combination of them (see e.g
  Gender in Personal Pronouns).
\item[(ii)] a mathematical expression describing the combinations is
  provided, which can subsequently used to generate {\em all\/}
  combinations, including those which are not valid.
\end{itemize}


\chapter{Morphosyntactic Specifications for Central \& Eastern Lexicons}
\label{chp:common}
\markright{COP project 106 \mte{\hfill} \delID --- Common Tables{\hfill}}

The categories listed below with the relevant attributes and values
are based on the \multext\ and \eagles\ documents.  The specifications
constitute a ``harmonized'' set of features which properly describe
lexical items of the different languages.

In keeping with the general aim of the project, these harmonized
specifications --- and the related resources --- will contribute to
the standardization of the corpus annotation work.  This common set of
features, besides the advantages described in the Introduction, will
also be a common ground to perform comparisons of different annotation
tool results, because the existence of many lexical description
systems is causing nowadays a problem for comparing results.

Therefore the categories and features listed below are the common
reference for the work done by the different groups.


\newpage
\section{Table of categories}

\begin{verbatim}
=============== ====
Part-of-Speech  Code
=============== ====
Noun             N
Verb             V
Adjective        A
Pronoun          P
Determiner       D
Article          T
Adverb           R
Adposition       S
Conjunction      C
Numeral          M
Interjection     I
Residual         X
Abbreviation     Y
Particle         Q
=============== ====
\end{verbatim}

Each category acts a type, i.e. it introduces attributes and their
values. The equivalence of a particular attribute-value at a
particular position in a morpho-syntactic description is thus defined
only with respect to a given category.  The attributes and the values
pertinent to the above categories are presented in the next section.

After the section devoted to the tables, another section presents all
the values listed in alphabetical order and provides the respective
codes.  A further section contains the list of the attributes,
providing the information of the category/categories in which they are
used, and for some of them, a very synthetic definition of their
semantics.


\newpage
\section{Tables of attribute-values}

The common \mte\ tables of attribute values are given for all
categories above, and have a rigid structure, which makes them
suitable for automatically verifying the conformance of a particular
morphosyntactic description with the tables, or for expanding a
morphosyntactic description into its more verbose form.

At the beginning of each category the number of attributes introduced
by the category is given. This is followed by a string summarizing the
valid attributes of the category.  The features constituting the
minimal core, in common with \multext\ (and \eagles) are graphically
contained between strings of stars (*).

This information is followed by the table proper. The table has the
following columns:
\begin{enumerate}
\item \textbf{Position} gives the position of the attribute in the string of
  the morphosyntactic description;
\item \textbf{Attribute} gives the name of the attribute;
\item \textbf{Value} gives the name and one-letter code of the
  attribute-value;
\item \textbf{Ro, Sl, Cs, Bg, Et, Hu} i.e. a column for each of the
  \mte\ languages. For easier comparison between them they have been
  grouped by language family: first Romance (Romanian), then Slavic
  (Bulgarian, Czech, Slovene), and finaly Finno-Ungric (Estonian,
  Hungarian).
\end{enumerate}

The language columns define (by marking with 'x'), in the first line,
whether the category is used by the language, and in subsequent lines,
which attribute-values a particular language uses.


\subsection{Noun (N)}

{\small
\begin{verbatim}
11 Positions

**** **** **** **** **** ---- ---- ---- ---- ---- ----
PoS  Type Gend Numb Case Def  Cltc Anim OwnN OwnP OwdN
**** **** **** **** **** ---- ---- ---- ---- ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   x   x
= ============== ============== =
1 Type           common         c  x   x   x   x   x   x
                 proper         p  x   x   x   x   x   x
- -------------- -------------- -
2 Gender         masculine      m  x   x   x   x
                 feminine       f  x   x   x   x
                 neuter         n  x   x   x   x
- -------------- -------------- -
3 Number         singular       s  x   x   x   x   x   x
                 plural         p  x   x   x   x   x   x
                 dual           d      x   x
          l.s.   count          t              x
- -------------- -------------- -
4 Case           nominative     n      x   x   x   x   x
                 genitive       g      x   x       x   x
                 dative         d      x   x           x
                 accusative     a      x   x           x
                 vocative       v  x       x   x
                 locative       l      x   x
                 instrumental   i      x   x           x
           l.s.  direct         r  x
           l.s.  oblique        o  x
           l.s.  partitive      1                  x
                 illative       x                  x   x
                 inessive       2                  x   x
                 elative        e                  x   x
                 allative       t                  x   x
                 adessive       3                  x   x
                 ablative       b                  x   x
           l.s.  translative    4                  x
                 terminative    9                  x   x
                 essive         w                  x   x
           l.s.  abessive       5                  x
           l.s.  komitative     k                  x
           l.s.  aditive        7                  x
           l.s.  temporalis     m                      x
           l.s.  causalis       c                      x
           l.s.  sublative      s                      x
           l.s.  delative       h                      x
           l.s.  sociative      q                      x
           l.s.  factive        y                      x
           l.s.  superessive    p                      x
           l.s.  distributive   u                      x
* ***************************** *
5 Definiteness   no             n  x           x
                 yes            y  x           x
           l.s.  short_art      s              x
           l.s.  full_art       f              x
- -------------- -------------- -
6 Clitic         no             n  x
                 yes            y  x
- -------------- -------------- -
7 Animate        no             n          x           x
                 yes            y          x           x
- -------------- -------------- -
8 Owner_Number   singular       s                      x
                 plural         p                      x
- -------------- -------------- -
9 Owner_Person   first          1                      x
                 second         2                      x
                 third          3                      x
---------------- -------------- -
10Owned_Number   singular       s                      x
                 plural         p                      x
=================================
\end{verbatim}
}
\newpage


\subsection{Verb (V)}

{\small
\begin{verbatim}
14 Positions

**** **** **** **** **** **** **** ---- ---- ---- ---- ---- ---- ----
PoS  Type VFrm Tens Pers Numb Gend Voic Neg  Def  Cltc Case Anim Clt2
**** **** **** **** **** **** **** ---- ---- ---- ---- ---- ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   x   x
= ============== ============== =
1 Type           main           m  x   x   x   x   x   x
                 auxiliary      a  x   x   x   x   x   x
                 modal          o  x   x   x       x
                 copula         c  x   x   x    
- -------------- -------------- -
2 VForm          indicative     i  x   x   x   x   x   x
                 subjunctive    s  x
                 imperative     m  x   x   x   x   x   x
                 conditional    c      x   x       x   x
                 infinitive     n  x   x   x       x   x
                 participle     p  x   x   x   x   x
                 gerund         g  x           x   x
                 supine         u      x           x
           l.s.  transgressive  t          x    
           l.s.  quotative      q                  x
- -------------- -------------- -
3 Tense          present        p  x   x   x   x   x   x
                 imperfect      i  x           x   x
                 future         f      x   x
                 past           s  x   x   x   x   x   x
           l.s.  pluperfect     l  x
           l.s.  aorist         a              x
- -------------- -------------- -
4 Person         first          1  x   x   x   x   x   x
                 second         2  x   x   x   x   x   x
                 third          3  x   x   x   x   x   x
- -------------- -------------- -
5 Number         singular       s  x   x   x   x   x   x
                 plural         p  x   x   x   x   x   x
           l.s.  dual           d      x
- -------------- -------------- -
6 Gender         masculine      m  x   x   x   x
                 feminine       f  x   x   x   x
                 neuter         n  x   x   x   x
*********************************
7 Voice          active         a      x   x   x   x
                 passive        p      x   x   x   x
- -------------- -------------- -
8 Negative       no             n      x   x       x
                 yes            y      x   x       x
- -------------- -------------- -
9 Definiteness   no             n              x       x
                 yes            y              x       x
                 short_art      s              x
                 full_art       f              x
           l.s.  1s2s           2                      x
- -------------- -------------- -
10Clitic         no             n  x
                 yes            y  x
- -------------- -------------- -
11Case           nominative     n
                 genitive       g
                 dative         d
                 accusative     a
                 locative       l
                 instrumental   i
                 illative       x                  x
                 inessive       2                  x
                 elative        e                  x
                 translative    4                  x
                 abessive       5                  x
- -------------- -------------- -
12Animate        no             n          x
                 yes            y          x
- -------------- -------------- -
13Clitic_s       no             n          x
                 yes            y          x
=================================
\end{verbatim}
}
\newpage


\subsection{Adjective (A)}

{\small
\begin{verbatim}
13 Positions

**** **** **** **** **** **** ---- ---- ---- ---- ---- ---- ----
PoS  Type Degr Gend Numb Case Def  Cltc Anim Form OwnN OwnP OwdN
**** **** **** **** **** **** ---- ---- ---- ---- ---- ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   x   x
= ============== ============== =
1 Type           qualificative  f  x   x   x           x
                 indefinite     i
                 possessive     s      x   x
           l.s.  ordinal        o      x
- -------------- -------------- -
2 Degree         positive       p  x   x   x       x   x
                 comparative    c  x   x   x       x   x
                 superlative    s  x   x   x       x   x
- -------------- -------------- -
3 Gender         masculine      m  x   x   x   x
                 feminine       f  x   x   x   x
                 neuter         n  x   x   x   x
- -------------- -------------- -
4 Number         singular       s  x   x   x   x   x   x
                 plural         p  x   x   x   x   x   x
                 dual           d      x   x
- -------------- -------------- -
5 Case           nominative     n      x   x       x   x
                 genitive       g      x   x       x   x
                 dative         d      x   x           x
                 accusative     a      x   x           x
                 vocative       v  x       x
                 locative       l      x   x
                 instrumental   i      x   x           x
           l.s.  direct         r  x
           l.s.  oblique        o  x
           l.s.  partitive      1                  x
                 illative       x                  x   x
                 inessive       2                  x   x
                 elative        e                  x   x
                 allative       t                  x   x
                 adessive       3                  x   x
                 ablative       b                  x   x
           l.s.  translative    4                  x
                 terminative    9                  x   x
                 essive         w                  x   x
           l.s.  abessive       5                  x
           l.s.  komitative     k                  x
           l.s.  aditive        7                  x
           l.s.  temporalis     m                      x
           l.s.  causalis       c                      x
           l.s.  sublative      s                      x
           l.s.  delative       h                      x
           l.s.  sociative      q                      x
           l.s.  factive        y                      x
           l.s.  superessive    p                      x
           l.s.  distributive   u                      x
           l.s.  essive_formal  f                      x
* ***************************** *
6 Definiteness   no             n  x           x
                 yes            y  x           x
           l.s.  short_art      s              x
           l.s.  full_art       f              x
- -------------- -------------- -
7 Clitic         no             n  x
                 yes            y  x
- -------------- -------------- -
8 Animate        no             n          x           x
                 yes            y          x           x
- -------------- -------------- -
9 Formation      nominal        n          x
                 compound       c          x
- -------------- -------------- -
10Owner_Number   singular       s                      x
                 plural         p                      x
- -------------- -------------- -
11Owner_Person   first          1                      x
                 second         2                      x
                 third          3                      x
---------------- -------------- -
12Owned_Number   singular       s                      x
                 plural         p                      x
=================================
\end{verbatim}
}
\newpage


\subsection{Pronoun (P)}

{\small
\begin{verbatim}
17 Positions

**** **** **** **** **** **** **** **** ---- ---- ---- ---- ---- ---- ...
PoS  Type Pers Gend Numb Case OwnN OwnG Cltc RefT SynT Def  Anim Clt2
**** **** **** **** **** **** **** **** ---- ---- ---- ---- ---- ---- ...
 ... ---- ---- ----
     PrFr OwnP OwdN 
 ... ---- ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   x   x
= ============== ============== =
1 Type           personal       p  x   x   x   x   x   x
                 demonstrative  d  x   x   x   x   x   x
                 indefinite     i  x   x   x   x   x   x
                 possessive     s  x   x   x   x   x   x
                 interrogative  q      x   x   x   x   x
                 relative       r      x   x   x   x   x
           l.s.  exclamative    e
                 reflexive      x  x   x   x   x   x   x
                 reciprocal     y                  x   x
           l.s.  int_rel        w  x
                 negative       z  x   x   x   x
           l.s   emphatic       h  x
           l.s.  determinal     m                  x
                 general        g      x   x   x
- -------------- -------------- -
2 Person         first          1  x   x   x   x   x   x
                 second         2  x   x   x   x   x   x
                 third          3  x   x   x   x   x   x
- -------------- -------------- -
3 Gender         masculine      m  x   x   x   x
                 feminine       f  x   x   x   x
                 neuter         n  x   x   x   x
- -------------- -------------- -
4 Number         singular       s  x   x   x   x   x   x
                 plural         p  x   x   x   x   x   x
                 dual           d      x   x
- -------------- -------------- -
5 Case           nominative  *  n  x1  x   x   x   x   x
                 genitive    *  g  x2  x   x       x   x
                 dative      *  d  x2  x   x   x       x
                 accusative  *  a  x1  x   x   x       x
                 vocative       v  x
                 locative       l      x   x
                 instrumental   i      x   x           x
           l.s.  direct      *  r  xR1
           l.s.  oblique     *  o  xR2
           l.s.  partitive      1                  x
                 illative       x                  x   x
                 inessive       2                  x   x
                 elative        e                  x   x
                 allative       t                  x   x
                 adessive       3                  x   x
                 ablative       b                  x   x
           l.s.  translative    4                  x
                 terminative    9                  x   x
                 essive         w                  x   x
           l.s.  abessive       5                  x
           l.s.  komitative     k                  x
           l.s.  aditive        7                  x
           l.s.  temporalis     m                      x
           l.s.  causalis       c                      x
           l.s.  sublative      s                      x
           l.s.  delative       h                      x
           l.s.  sociative      q                      x
           l.s.  factive        y                      x
           l.s.  superessive    p                      x
           l.s.  distributive   u                      x
           l.s.  essive_formal  f                      x
- -------------- -------------- -
6 Owner_Number   singular       s  x   x   x           x
                 plural         p  x   x   x           x
           l.s.  dual           d      x
- -------------- -------------- -
7 Owner_Gender   masculine      m      x   x
                 feminine       f      x   x
                 neuter         n      x   x
*********************************
8 Clitic         no             n  x   x   x   x
                 yes            y  x   x   x   x
- -------------- -------------- -
9 Referent_Type  personal       p      x   x   x
                 possessive     s      x   x   x
                 attributive    a              x
                 quantitative   q              x
- -------------- -------------- -
10Syntactic_Type nominal        n      x   x
                 adjectival     a      x   x
- -------------- -------------- -
11Definiteness   no             n              x
                 yes            y              x
                 short_art      s              x
                 full_art       f              x
- -------------- -------------- -
12Animate        no             n          x
                 yes            y          x
- -------------- -------------- -
13Clitic_s       yes            y          x
                 no             n          x
- -------------- -------------- -
14Pronoun_Form   strong         s  x
                 weak           w  x
- -------------- -------------- -
15Owner_Person   first          1                      x
                 second         2                      x
                 third          3                      x
- -------------- -------------- -
16Owned_Number   singular       s                      x
                 plural         p                      x
=================================
\end{verbatim}
}

{\bf Notes}

* In the Romanian case system the value 'direct' conflates
'nominative' and 'accusative', while the value 'oblique' conflates
'genitive' and 'dative'.

If we want to list these values as belonging to the same Case
universe, i.e.\ if we want either 'direct', 'nominative' and
'accusative' or 'oblique', 'genitive' and 'dative' appear all together
in the same system, their semantics should be explained and their
internal relationships represented. We suggest here to follow the same
strategy adopted within the \eagles\ validation exercise, where
different kinds of relationships between values have been formally
represented by adding indexes to the values of interest:

- in this example, the kind of relation 'direct' and 'oblique' have
with other values of the system is of 'replacement'. We mark this
labelling 'direct' and 'oblique' with the index 'R' and adding the
numerical indexes 1 to 'direct' and 2 to 'oblique', which point to the
values which are respectively replaced.

\newpage


\subsection{Determiner (D)}

{\small
\begin{verbatim}
10 Positions

**** **** **** **** **** **** **** **** ---- ----
PoS  Type Pers Gend Numb Case OwnN OwnG Cltc Mod
**** **** **** **** **** **** **** **** ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   -   -   -   -   -
= ============== ============== =
1 Type           demonstrative  d  x
                 indefinite     i  x
                 possessive     s  x
                 interrogative  q
                 relative       r
                 exclamative    e
                 article        a
           l.s.  int_rel        w  x
           l.s.  negative       z  x
           l.s.  emphatic       h  x
- -------------- -------------- -
2 Person         first          1  x
                 second         2  x
                 third          3  x
- -------------- -------------- -
3 Gender         masculine      m  x
                 feminine       f  x
                 neuter         n  x
- -------------- -------------- -
4 Number         singular       s  x
                 plural         p  x
- -------------- -------------- -
5 Case           direct         r  x
                 oblique        o  x
- -------------- -------------- -
6 Owner_Number   singular       s  x
                 plural         p  x
- -------------- -------------- -
7 Owner_Gender   masculine      m
                 feminine       f
                 neuter         n
*********************************
8 Clitic         no             n  x
                 yes            y  x
- -------------- -------------- -
9 Modific_Type   prenomin       e  x
                 postnomin      o  x
=================================
\end{verbatim}
}
\newpage


\subsection{Article (T)}

{\small
\begin{verbatim}
6 Positions

**** **** **** **** **** ----
PoS  Type Gend Numb Case Cltc
**** **** **** **** **** ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   -   -   -   -   x
= ============== ============== =
1 Type           definite       f  x                   x
                 indefinite     i  x                   x
           l.s.  possessive     s  x
           l.s.  demonstrative  d  x
- -------------- -------------- -
2 Gender         masculine      m  x
                 feminine       f  x
                 neuter         n  x
- -------------- -------------- -
3 Number         singular       s  x
                 plural         p  x
- -------------- -------------- -
4 Case     l.s.  direct         r  x
           l.s.  oblique        o  x
*********************************
5 Clitic         no             n  x
                 yes            y  x
=================================
\end{verbatim}
}
\newpage


\subsection{Adverb (R)}

{\small
\begin{verbatim}
6 Positions

**** **** **** ---- ---- ----
PoS  Type Degr Cltc Numb Pers
**** **** **** ---- ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   x   x  
= ============== ============== =
1 Type           general        g  x   x   x   x       x
                 particle       p  x                   x
           l.s.  causal         o                      x
           l.s.  negative       z  x
           l.s.  adjectival     a              x
           l.s.  verbal         v                      x
                 modifier       m  x                   x
           l.s.  int_rel        w  x
           l.s.  interrogative  q                      x
- -------------- -------------- -
2 Degree         positive       p  x   x   x
                 comparative    c  x   x   x
                 superlative    s  x   x   x
*********************************
3 Clitic         no             n  x                   x
                 yes            y  x                   x
- -------------- -------------- -
4 Number         singular       s                      x
                 plural         p                      x
- -------------- -------------- -
5 Person         first          1                      x
                 second         2                      x
                 third          3                      x
=================================
\end{verbatim}
}
\newpage


\subsection{Adposition (S)}

{\small
\begin{verbatim}
5 Positions

**** **** **** ---- ----
PoS  Type Form Case Cltc
**** **** **** ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   x   x
= ============== ============== =
1 Type           preposition    p  x   x   x   x   x
                 postposition   t                  x   x
- -------------- -------------- -
2 Formation      simple         s  x   x   x
                 compound       c  x   x   x
*********************************
3 Case           nominative     n
  (req.by prep.) genitive       g  x   x   x
                 dative         d  x   x   x
                 accusative     a  x   x   x
                 locative       l      x   x
                 instrumental   i      x   x
- -------------- -------------- -
4 Clitic         no             n  x
                 yes            y  x
=================================
\end{verbatim}
}
\newpage


\subsection{Conjunction (C)}

{\small
\begin{verbatim}
8 Positions

**** **** ---- ---- ---- ---- ---- ----
PoS  Type Form CTyp SubT Cltc Numb Pers
**** **** ---- ---- ---- ---- ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   x   x
= ============== ============== =
1 Type           coordinating   c  x   x   x   x   x   x
                 subordinating  s  x   x   x   x   x   x
*********************************
2 Formation      simple         s  x   x       x       x
                 compound       c  x   x       x       x
- -------------- -------------- -
3 Coord_Type     simple         s  x
                 repetit        r  x
                 correlat       c  x
                 sentence       p                      x
                 words          w                      x
- -------------- -------------- -
4 Sub_Type       negative       z  x
                 positive       p  x
- -------------- -------------- -
5 Clitic         no             n  x
                 yes            y  x
- -------------- -------------- -
6 Number         singular       s          x
                 plural         p          x
- -------------- -------------- -
7 Person         first          1          x
                 second         2          x
                 third          3          x
=================================
\end{verbatim}
}
\newpage


\subsection{Numeral (M)}

{\small
\begin{verbatim}
13 Positions

**** **** **** **** **** ---- ---- ---- ---- ---- ---- ---- ----
PoS  Type Gend Numb Case Frm  Def  Cltc Clas Anim OwnN OwnP OwdN
**** **** **** **** **** ---- ---- ---- ---- ---- ---- ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   x   x
= ============== ============== =
1 Type           cardinal       c  x   x   x   x   x   x
                 ordinal        o  x   x   x   x   x   x
                 fractal        f  x                   x
                 multiple       m  x   x   x
                 collect        l  x                   x
                 special        s      x   x
- -------------- -------------- -
2 Gender         masculine      m  x   x   x   x
                 feminine       f  x   x   x   x
                 neuter         n  x   x   x   x
- -------------- -------------- -
3 Number         singular       s  x   x   x   x   x   x
                 plural         p  x   x   x   x   x   x
                 dual           d      x   x
- -------------- -------------- -
4 Case           nominative     n      x   x       x   x
                 genitive       g      x   x       x   x
                 dative         d      x   x           x
                 accusative     a      x   x           x
                 locative       l      x   x
                 instrumental   i      x   x           x
           l.s.  direct         r  x
           l.s.  oblique        o  x
           l.s.  partitive      1                  x
                 illative       x                  x   x
                 inessive       2                  x   x
                 elative        e                  x   x
                 allative       t                  x   x
                 adessive       3                  x   x
                 ablative       b                  x   x
           l.s.  translative    4                  x
                 terminative    9                  x   x
                 essive         w                  x   x
           l.s.  abessive       5                  x
           l.s.  komitative     k                  x
           l.s.  aditive        7                  x
           l.s.  temporalis     m                      x
           l.s.  causalis       c                      x
           l.s.  sublative      s                      x
           l.s.  delative       h                      x
           l.s.  sociative      q                      x
           l.s.  factive        y                      x
           l.s.  superessive    p                      x
           l.s.  distributive   u                      x
           l.s.  essive_formal  f                      x
           l.s.  multiplicative 6                      x
*********************************
5 Form           digit          d  x   x   x   x   x   x
                 roman          r  x   x   x   x   x   x
                 letter         l  x   x   x   x   x   x
           l.s.  both           b  x
           l.s.  m-form         m              x
           l.s.  approx         a              x
- -------------- -------------- -
6 Definiteness   no             n  x           x
                 yes            y  x           x
                 short_art      s              x
                 full_art       f              x
- -------------- -------------- -
7 Clitic         no             n  x            
                 yes            y  x             
- -------------- -------------- -
8 Class          definite1      1          x
                 definite2      2          x
                 definite34     3          x
                 definite       f          x
                 demonstrative  d          x
                 indefinite     i          x
                 interrogative  q          x
                 relative       r          x
- -------------- -------------- -
9 Animate        no             n          x
                 yes            y          x
- -------------- -------------- -
10Owner_Number   singular       s                      x
                 plural         p                      x
- -------------- -------------- -
11Owner_Person   first          1                      x
                 second         2                      x
                 third          3                      x
- -------------- -------------- -
12Owned_Number   singular       s                      x
                 plural         p                      x
=================================
\end{verbatim}
}
\newpage


\subsection{Interjection (I)}

{\small
\begin{verbatim}
3 Positions

**** ---- ----
PoS  Type Form
**** ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   x   x
= ============== ============== =
1 Type           mood           m                      x
                 other          o                      x
- -------------- -------------- -
2 Formation      simple         s              x
                 compound       c              x
=================================
\end{verbatim}
}

\newpage
\subsection{Residual (X)}

{\small
\begin{verbatim}
1 Position

****
PoS 
****

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   x   x
= ============== ============== =

=================================
\end{verbatim}
}
\newpage


\subsection{Abbreviation (Y)}

{\small
\begin{verbatim}
6 Positions

**** ---- ---- ---- ---- ----
PoS  SynT Gend Numb Case Def
**** ---- ---- ---- ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   x   x
= ============== ============== =
1 Syntactic_Type nominal        n  x               x
                 verbal         v  x               x
                 adjectival     a  x               x
                 adverbial      r  x               x
- -------------------------------
2 Gender         masculine      m  x
                 feminine       f  x
                 neuter         n  x
- -------------- -------------- -
3 Number         singular       s  x               x
                 plural         p  x               x
- -------------- -------------- -
4 Case     l.s.  nominative     n                  x
           l.s.  genitive       g                  x
           l.s.  direct         r  x
           l.s.  oblique        o  x
           l.s.  partitive      1                  x
           l.s.  illative       x                  x
           l.s.  inessive       2                  x
           l.s.  elative        e                  x
           l.s.  allative       t                  x
           l.s.  adessive       3                  x
           l.s.  ablative       b                  x
           l.s.  translative    4                  x
           l.s.  terminative    9                  x
           l.s.  essive         w                  x
           l.s.  abessive       5                  x
           l.s.  komitative     k                  x
           l.s.  aditive        7                  x
- -------------- -------------- -
5 Definiteness   yes            y  x
                 no             n  x
=================================
\end{verbatim}
}
\newpage


\subsection{Particle (Q)}

{\small
\begin{verbatim}
5 Positions

**** ---- ---- ----
PoS  Type Form Cltc
**** ---- ---- ----

= ============== ============== =  RO  SL  CS  BG  ET  HU
P ATT            VAL            C  x   x   x   x   -   -
= ============== ============== =
1 Type           negative       z  x           x
                 infinitive     n  x
                 subjunctive    s  x
                 aspect         a  x
                 future         f  x
                 general        g              x
                 comparative    c              x
                 verbal         v              x
                 interrogative  q              x
                 modal          o              x
- -------------- -------------- -
2 Formation      simple         s              x
                 compound       c              x
- -------------- -------------- -
3 Clitic         no             n  x
                 yes            y  x
=================================
\end{verbatim}
}
\newpage


\section{List of values with respective codes}

The values presented within the tables are, in the following, listed in
alphabetical order; the first column gives the name of the value, the
second column its code and the third lists attributes for which the
value is appropriate.

{\scriptsize
\begin{verbatim}
1s2s           2    Definiteness
abessive       5    Case
ablative       b    Case
accusative     a    Case
active         a    Voice
adessive       3    Case
aditive        7    Case
adjectival     a    Syntactic_Type, Type
allative       t    Case
aorist         a    Tense
approx         a    Form
article        a    Type
aspect         a    Type
attributive    a    Referent_Type
auxiliary      a    Type
both           b    Form
cardinal       c    Type
causal         o    Type
causalis       c    Case
collect        l    Type
common         c    Type
comparative    c    Degree, Type
compound       c    Formation
conditional    c    VForm
coordinating   c    Type
copula         c    Type
correlat       c    Coord_Type
count          t    Number
dative         d    Case
definite       f    Class, Type
definite1      1    Class
definite2      2    Class
definite34     3    Class
delative       h    Case
demonstrative  d    Class, Type
determinal     m    Type
digit          d    Form
direct         r    Case
distributive   u    Case
dual           d    Number, Owner_Number
elative        e    Case
emphatic       h    Type
essive         w    Case
essive_formal  f    Case
exclamative    e    Type
factive        y    Case
feminine       f    Gender, Owner_Gender
first          1    Owner_Person, Person
fractal        f    Type
full_art       f    Definiteness
future         f    Tense, Type
general        g    Type
genitive       g    Case
gerund         g    VForm
illative       x    Case
imperative     m    VForm
imperfect      i    Tense
indefinite     i    Class, Type
indicative     i    VForm
inessive       2    Case
infinitive     n    Type, VForm
instrumental   i    Case
int_rel        w    Type
interrogative  q    Class, Type
komitative     k    Case
letter         l    Form
locative       l    Case
main           m    Type
masculine      m    Gender, Owner_Gender
modal          o    Type
modifier       m    Type
mood           m    Type
multiple       m    Type
multiplicative 6    Case
negative       z    Sub_Type, Type
neuter         n    Gender, Owner_Gender
no             n    Animate, Clitic, Clitic_s, Definiteness, Negative
nominal        n    Formation, Syntactic_Type
nominative     n    Case
oblique        o    Case
ordinal        o    Type
other          o    Type
participle     p    VForm
particle       p    Type
partitive      1    Case
passive        p    Voice
past           s    Tense
personal       p    Referent_Type, Type
pluperfect     l    Tense
plural         p    Number, Owned_Number, Owner_Number
positive       p    Degree, Sub_Type
possessive     s    Referent_Type, Type
postnomin      o    Modific_Type
postposition   t    Type
prenomin       e    Modific_Type
preposition    p    Type
present        p    Tense
proper         p    Type
qualificative  f    Type
quantitative   q    Referent_Type
quotative      q    VForm
reciprocal     y    Type
reflexive      x    Type
relative       r    Class, Type
repetit        r    Coord_Type
roman          r    Form
second         2    Owner_Person, Person
sentence       p    Coord_Type
short_art      s    Definiteness
simple         s    Coord_Type, Formation
singular       s    Number, Owned_Number, Owner_Number
sociative      q    Case
special        s    Type
strong         s    Pronoun_Form
subjunctive    s    Type, VForm
sublative      s    Case
subordinating  s    Type
superessive    p    Case
superlative    s    Degree
supine         u    VForm
temporalis     m    Case
terminative    9    Case
third          3    Owner_Person, Person
transgressive  t    VForm
translative    4    Case
verbal         v    Type
vocative       v    Case
weak           w    Pronoun_Form
words          w    Coord_Type
yes            y    Animate, Clitic, Clitic_s, Definiteness, Negative
\end{verbatim}
}
\newpage


\section{List of attributes}

In this section, all the attributes presented in the tables are listed
in alphabetical order. For some attributes which are not
self-explanatory, a brief description of their semantics is provided.

\vspace{2ex}

\begin{center}
\begin{tabular}{|l l|}
\hline
{\bf Attribute} & {\bf Relevant to Category} \\
\hline
Animate       & Adj Noun Num Pron Verb\\
Case          & Abbr Adj Adpos Art Det Noun Num Pron Verb\\
Class         & Num\\
Clitic        & Adj Adpos Adv Art Conj Det Noun Num Part Pron Verb\\
Clitic\_s     & Pron\\
Coord\_Type   & Conj\\
Definiteness  & Abbr Adj Noun Num Pron Verb\\
Degree        & Adj Adv \\
Form          & Num\\
Formation     & Adj Adpos Conj Interj Part\\
Gender        & Abbr Adj Art Det Noun Num Pron Verb\\
Modific\_Type & Det\\
Negative      & Verb\\
Number        & Abbr Adj Adv Art Conj Det Noun Num Pron Verb\\
Owned\_Number & Adj Noun Num Pron\\
Owner\_Gender & Det Pron\\
Owner\_Number & Adj Det Noun Num Pron\\
Owner\_Person & Adj Noun Num Pron\\
Person        & Adv Conj Det Pron Verb\\
Pronoun\_Form & Pron\\
Referent\_Type& Pron\\
Sub\_Type     & Conj\\
Syntactic\_Type& Abbr Pron\\
Tense         & Verb\\
Type          & Adj Adpos Adv Art Conj Det Interj Noun Num Part Pron Verb\\
VForm         & Verb\\
Voice         & Verb\\
\hline
\end{tabular}
\end{center}

\begin{description}
\item[Class:] distinguishes subtypes of Numerals in Czech which
  have a distinct syntactic distributions: e.g.\ subclasses for 1, 2,
  3\&4, etc.\ are distinguished.
\item[Clitic\_s:] the 'yes' value of the Clitic\_s attribute denotes 
  Czech pronouns having the clitic morpheme 's' appended as a suffix.
\item[Definiteness:] corresponds to the definite and indefinite
  article in English, which is expressed in Bulgarian by suffixes;
  these differ according to the gender and number of a word. For
  singular masculine there are two forms: full article and short
  article (full is used when a sing.masc. form is the syntactic
  subject of the clause, otherwise short article is used). The
  distinction full vs.  short is not made for feminine, neuter and
  plural forms.  Definiteness is also used in Romanian.
\item[Form:] used to distinguish different forms of numerals (Roman,
  digit, 'letter') and in Bulgarian also for the semantic distinction
  between numerals that refer to male persons (but not children) or
  groups of male+female.
\item[Formation:] refers to the graphical components: simple, i.e.\ 
  consisting of one word; compound, i.e.\ consisting of more than one
  word.
\item[Modific\_Type:] refers to the prenominal or postnominal
  positions of Determiners which distinguish different forms in
  Romanian.
\item[Negative:] the value 'yes' encodes negative verbal word-forms in
  Czech, Slovene and in Estonian.
\item[Owned\_Number:] in the Hungarian system, different word-forms
  are distinguished for nominals on the basis of so called 'anaphoric
  possessive' number, i.e.\ the number of the thing(s) possessed by the
  nominal in question.
\item[Owner\_Gender:] used to encode the Gender of the possessor in
  Pronouns and (in Romanian) Determiners.
\item[Owner\_Number:] used to specify the possessor number in
  Pronouns, as well as (in Romanian) in Determiners, and (in
  Hungarian) in Adjectives and Nouns.
\item[Owner\_Person:] used to specify the possessor person in in
  Hungarian in Adjectives and Nouns.
\item[Pronoun\_Form:] used to encode weak and strong pronouns in
  Romanian.
\item[Referent\_Type:] used to distinguish reflexive personal from
  reflexive possessive pronouns in the Slavic languages. In Bulgarian,
  it also describes a subdivision on the basis of semantic features
  which have effect on the morpho-syntactic paradigm, e.g.\ 
  quantitative: the pronoun refers to quantity, etc.
\item[Sub\_Type:] used in Romanian to distinguish negative from positive
  Conjunctions.
\item[Syntactic\_Type:] used to distinguish the nominal and adjectival
  function of Pronouns in Slovene and Czech.  Also used in
  Abbreviations to signal the Part of Speech of the abbreviation;
  currently used only by Romanian and Estonian.
\end{description}


\chapter{Language Specific Applications}
\label{chp:LSA}

\input{mteD11F-bg.tex} \newpage
\input{mteD11F-cs.tex} \newpage
\input{mteD11F-et.tex} \newpage
\input{mteD11F-hu.tex} \newpage
\input{mteD11F-ro.tex} \newpage
\input{mteD11F-sl.tex}


\chapter{References}

Bel N., Calzolari N. and Monachini M. eds. (1995): ``Common
Specifications and Notation for Lexicon Encoding and Preliminary
Proposal for the Tagsets'', MULTEXT Deliverable D1.6.1B, Pisa.

Leech J.  and A.  Wilson (1994): ``Morphosyntactic Annotation'',
EAGLES Interim Report, Pisa.

Monachini M.  and N.  Calzolari (1995): ``Synopsis and Comparison of
Morphosyntactic Phenomena Encoded in Lexicons and in Corpora and
Application to European Languages'', EAGLES document
EAG-LSG-T4.6/CSG-T3.2, Pisa.

MULTEXT (1993): ``MULTEXT Technical Annex'', Aix-en-Provence.

MULTEXT WP1.6 Report A2 (1994): ``Common Specifications and Notation
for Lexicon Encoding'', MULTEXT Report Milestone A2.

%Teufel S. (1995): ``EAGLES Validation: Tagset Mapping'', Draft Report,
%Stuttgart.

\end{document}
