next up previous contents
Next: Application to Slovene Up: Language Specific Applications Previous: Compounding

Application to Romanian

COP project 106 MULTEXT-East Deliverable D1.1 M --- Romanian

The application to Romanian has been elaborated by D.Tufis and A.M.Barbu.

The Romanian diacritic characters used have been encoded in the following way:

1. Noun (N)

In Romanian the following attribute-value pairs are applicable to Nouns:


1.1 Type

Attribute           value                Ro. example
Type                common  (c)          carte
                    proper  (p)          Ion


1.2 Gender

Attribute           value                Ro. example
Gender              masculine  (m)       ba(iatul
                    feminine   (f)       casa
                    neuter     (n)       fir (m.sg.),fire (f.pl.)

In Romanian the declension of a neuter noun always follows in singular
a masculine paradigm and in plural a feminine one. Specific
implementations could take advantage of this rule and by organizing
the paradigmatic space in partial paradigms (masc-sing, masc-pl,
fem-sing, fem-pl) to get rid of neuter value for the gender attribute.


1.3 Number

Attribute           value                Ro. example
Number              singular  (s)        fata(
                    plural    (p)        fete


1.4 Case

There are five functional cases in Romanian (nominative, genitive,
dative, accusative and vocative) but they are expressed by at most
three syncretic forms for each number: 1. nominative-accusative;
2. genitive-dative; 3. vocative. The ambiguity of a syncretic case
form may be solved at the syntactic level. Taking into account this
syncretism we considered the following attribute-value combinations.

Attribute           value                Ro. example
Case                direct(d)            omul
                    oblique (o)          omului
                    vocative (v)         omule

At the word level, the distinction between the three syncretic forms
is possible only for the definite nouns. Masculine indefinite nouns
are not differentiated by case (except, sometimes for vocative. The
feminine indefinite nouns have different forms in singular for
nominative-accusative, genitive-dative and vocative, but the
genitive-dative singular forms are also indistinguishable from the
plural forms:

fata(      - nominative-accusative, vocative singular, indefinite
fato       - vocative singular, indefinite
fete       - genitive-dative singular, nominative-accusative, 
             genitive-dative, vocative plural, indefinite
ba(iat     - nominative-accusative, genitive-dative, vocative 
             singular, indefinite
ba(iete    - vocative singular, indefinite
ba(iet,i   - nominative-accusative, genitive-dative, vocative,
             plural, indefinite


1.5 Definiteness

In Romanian, nouns can be marked for definiteness with the enclitic
definite article.

Attribute           value                Ro. example
Definiteness        yes (y)              omul
                    no (n)               om


1.6 Clitic

See discussion in 2.10

Attribute           value                Ro. example
Clitic              no (n)               sora
                    yes (y)              soru-mea


Combinations

Here, and in all the sections to follow, the trailing hyphens are deleted.

=========================
Tag          Example              
=========================
Ncmsrn       frate
Ncmson       frate
Ncmsvn       frate
Ncmsry       fratele
Ncmsoy       fratelui
Ncmprn       frat,i
Ncmpon       frat,i
Ncmpvn       frat,i
Ncmpry       frat,ii
Ncmpoy       frat,ilor
Ncmpvy       frat,ilor
Ncfsrn       sora(
Ncfsvn       soro
Ncfson       surori
Ncfprn       surori
Ncfpon       surori
Ncfpvn       surori
Ncfsoyy      sora((-sii)
Ncfpry       surorile
Ncfpoy       surorilor
Ncfpvy       surorilor
Ncfsryy      sora((-sa)
Ncmsrn       creion
Ncmson       creion
Ncmsryy      creionu-(i)
Ncmsoy       creionului
Ncfprn       creioane
Ncfpon       creioane
Ncfpry       creioanele
Ncfpoy       creioanelor
Npfsr        Ioana
Npfso        Ioanei
Npmsrn       Bucures,ti
Npmsry       Bucures,tiul



2. Verb (V)

The following attribute-value pairs are applicable to verbs in Romanian:


2.1 Type

Attribute           value                Ro. example
Type                main (m)             a vedea
                    auxiliary (a)        a avea, a fi, a voi
                    modal (o)            a putea, a trebui
                    copulative (c)       a fi, a deveni


2.2 VForm

Traditionally, Romanian linguistics distinguishes between predicative
and non-predicative moods. This distinction may be easily mapped into
finite/non-finite dichotomy: indicative, subjunctive and imperative
are finite; infinitive, participle and gerund are non-finite (only
synthetic (non-compound) moods were mentioned; we use the opposition
synthetic-analytic to distinguish between concatenative (synthetic)
and compound (analytic) morpho-lexical phenomena).

Attribute           value                Ro. example
VForm               indicative(i)        vine
                    subjunctive (s)      vina(
                    imperative (m)       vino
                    infinitive (n)       veni
                    participle (p)       venit
                    gerund (g)           venind

As only synthetic forms were considered, the values conditional and
presumptive for the VForm attribute were left out in Romanian. Another
value for VForm which was left out is the Supine. It appears mostly
with a preposition, except for a few intransitive verbs when they are
subordinated to the impersonal verb a trebui (must). Only the
preposition allows for differentiating a supine from a
participle-masculine-singular.


2.3 Tense

The synthetic values for the attribute Tense, as listed in the table
below, apply to the indicative mood. The value p (present) is used
also for subjunctive and infinitive. All the other moods either have
no tense or have compound form tenses.

Attribute           value                Ro. example
Tense               present (p)          va(d
                    imperfect (i)        vedeam
                    past (s)             va(zui
                    pluperfect (l)       va(zusem


2.4 Person

Attribute           value                Ro. example
Person              first (1)            va(d
                    second (2)           vezi
                    third (3)            vede

The following features are pertinent to those moods which permit an
adjectival use, i.e. participle and gerund. However, the adjectival
use of gerund is extremely rare (o ma>na( tremurnda(
- a shaking hand) and therefore gender and number apply mainly for the
participle.


2.5 Number

Attribute           value                Ro. example
Number              singular (s)         ba(tut
                    plural (p)           ba(tut,i


2.6 Gender

Attribute           value                Ro. example
Gender              masculine(m)         ba(tut
                    feminine(f)          ba(tuta(
                    neuter(n)            ba(tut-ba(tute


2.7, 2.8, 2.9 Voice, Negation, Definiteness

As these feature values are either realized as compound forms or
irrelevant for Romanian, in our word-level encoding they were omitted.


2.10 Clitic 

The cliticization phenomenon in Romanian is not restricted to
verb-pronoun relationship, but may also be observed with the (main)
verb and the auxiliary, the noun or adjective with pronoun, with noun
or adjective with copula, pronoun with auxiliary, preposition with
(indefinite) article, numeral or (indefinite) pronoun, negative adverb
with verb, auxiliary or pronoun, and some others (mainly created
through the contracted forms of the verb "a fi"-to be). We restrict
ourselves to considering only the graphically marked clicitizations.
In such cases, the two, three or (sometimes) four constituents of a
cliticized word-form are always separated by a hyphen. Omitting the
hyphen in such cases is an unacceptable error in written Romanian.
The examples below illustrate the specific types of cliticization we
will consider:

da(-mi-l !                (you give +to me+it/him = give it/him to me)
la(satu-ne-ai             (left + us+ you have = you have left us)
sparge-t,i-s-ar lampa     (break + your + s-part-conj + has lamp = 
                           I wish your lamp would break)
sora(-mii                 (sister + my = to my sister)
fat,a-i                   (his/her face | the face is)
ros,u-i                   (his/her red | the red is )
m-au                      (me + they have )
i>ntr-o gaura(            (in + a hole)
i>ntr-o ora(              (in + one hour)
i>ntr-unele               (into + some of)
n-aud                     (not  + I hear = I do not hear)
n-am                      (not + I/we have = I have not )
nu-mi                     (not + to me )

The order in which different constituents of a cliticized form appear
is governed by precise morphological rules. For instance, the
auxiliaries always appear in the last position. The main verbs, except
those beginning either with the letter "a" or the letter i> and
the contracted forms (-s and -i) of the verb "a fi"- to be, always
appear in the first position, nouns and adjectives always precede the
cliticized pronouns, the negative adverbial particles nu- and n- only
appear in the first position, and so on.  However, in order to reduce
spurious ambiguity in morpho-lexical encoding, we considered the
attribute CLITIC as relevant only in those cases where the
clicitization phenomenon resulted in graphemic modification of the
cliticized term (as it is the epenthetic u in the gerund form shown in
the table below).

Attribute           value                Ro. example
Clitic              no (n)               am ridicat
                    yes (y)              ridica>ndu-l


Combinations
========================
Tag           Example
========================
Vmii1s        abandonam
Vmii2s        abandonai
Vmii3s        abandona
Vmii1p        abandonam
Vmii2p        abandonat,i
Vmii3p        abandonau
Vmis1s        abandonai
Vmis2s        abandonas,i
Vmis3s        abandona(
Vmis1p        abandonara(m
Vmis2p        abandonara(t,i
Vmis3p        abandona(
Vmil1s        abandonasem
Vmil2s        abandonases,i
Vmil3s        abandonase
Vmil1p        abandonasera(m
Vmil2p        abandonasera(t,i
Vmil3p        abandonasera(
Vmip1s        abandonez
Vmsp1s        abandonez
Vmip2s        abandonezi
Vmsp2s        abandonezi
Vmip3s        abandoneaza(
Vmip3p        abandoneaza(
Vmsp3s        abandoneze
Vmsp3p        abandoneze
Vmsp1p        abandona(m
Vmsp2p        abandonat,i
Vmm-2s        abandoneaza(
Vmm-2p        abandonat,i
Vmnp          abandona
Vmp--sm       abandonat
Vmp--sm---y   abandonatu
Vmp--sf       abandonata(
Vmp--pf       abandonate
Vmp--pm       abandonat,i
Vmg           abandona>nd
Vmg-------y   abandona>ndu
Va--1s        as,
Voip          trebuie
Vcip1s        sunt
Vcip3p        sunt
Vcip1s----y   -s
Vcip3p----y   -s



3. Adjective (A)


3.1 Type

Although it is not common practice in Romanian linguistics, one could
make the distinction between qualificative and determinative
adjectives. However, the attribute-value pairs proposed in this
section are appropriate for the qualificative adjectives.

Attribute           value                Ro. example
Type                qualificative (f)    frumos


3.2 Degree

The default value is positive; adjectives have also comparative and
superlative degrees, but in most of the cases they are expressed by
means of analytical forms (e.g. comp. mai bun (better), superl. cel
mai bun (the best)). A few adjectives have intrinsic etymological
comparative or superlative meanings (e.g. comparatives: anterior,
major; superlatives: optim, maxim, extrem etc.). The prefixes super-,
extra-, ultra- etc., are quite productive in forming the
quasi-analytic1 superlatives.  Adjectives are characterized by gender,
number and case.

Attribute           value                Ro. example
Degree              positive (p)         frumos
                    comparative (c)      ulterior
                    superlative (s)      extrem


3.3 Gender 

Attribute           value                Ro. example
Gender              masculine (m)        bun
                    feminine (f)         buna(
                    neuter (n)           sg.bun/pl.bune


3.4 Number

Attribute           value                Ro. example
Number              singular (s)         bun
                    plural (p)           buni


3.5 Case

The adjectives present the same case syncretism as the nouns, except
for few adjectives that have an additional special form for
Genitive-Dative cases in plural (e.g. G.D.pl.  multor).

Attribute           value                Ro. example
Case                direct(r)            bunul
                    oblique (o)          bunului
                    vocative (v)         bunule


3.6 Definiteness

In noun-adjective construction, the definite article may attach
enclitically to either adjectives or modified nouns (never to both of
them). If present, the definite article attaches to the right of the
first word in the sequence.

Bunul  om  (The kind man)
Omul  bun. (The kind man)

Attribute           value                Ro. example
Definiteness        yes (y)              bunul
                    no (n)               bun


3.7 Clitic 

See discussion on clitics in 2.10



Combinations
====================        
Tag            Example        
====================        
Afpmsrn        bun        
Afpmson        bun        
Afpmsvn        bun        
Afpmprn        buni        
Afpmpon        buni        
Afpmpvn        buni        
Afpmsry        bunul        
Afpmsoy        bunului        
Afpmpry        bunii        
Afpmpoy        bunilor        
Afpfsrn        buna(        
Afpfsvn        buna(        
Afpfson        bune
Afpfprn        bune
Afpfpon        bune
Afpfpvn        bune
Afpfsry        buna
Afpfsoy        bunei
Afpfpry        bunele
Afpfpoy        bunelor
Afcmsrn        ulterior
Afcmson        ulterior
Afsmsrn        extrem
Afp            gri



4. Pronoun (P)

4.1 Type

In Romanian it is worth differentiating the negative pronoun from
other indefinite pronouns: a negative pronoun cannot be an argument
for a verb unless the verb itself is negated too (e.g. Nu am
va(zut pe nimeni / *Am va(zut pe nimeni).

Attribute           value                Ro. example
Type                demonstrative(d)     acesta
                    indefinite (i)       oricine
                    possesive(s)         (al) meu
                    int_rel (w)          ce
                    personal(p)          eu
                    reflexive(x)         se
                    negative (z)         nimeni
                    emphatic (h)         i>nsumi


4.2 Person

Attribute           value                Ro. example
Person              first (1)            eu
                    second (2)           tu
                    third (3)            el


4.3 Gender

Attribute           value                Ro. example
Gender              masculine (m)        el
                    feminine (f)         ea
                    neuter (n)           sg.acesta/pl.acestea


4.4 Number

Attribute           value                Ro. example
Number              singular (s)         eu
                    plural (p)           noi


4.5 Case

For the second person of the personal pronoun in both singular and
plural there is the vocative case too.The direct and oblique values
are needed for the syncretic causal forms of the other pronouns than
the personal ones.

Attribute           value                Ro. example
Case                nominative (n)       el
                    genitive (g)         (al) lui
                    dative (d)           lui
                    accusative (a)       (pe) el
                    vocative (v)         tu, voi!
                    direct(r)            acesta
                    oblique (o)          acestuia


4.6 Owner_Number

This attribute is meaningful for the Possessive pronouns and refers to
the grammatical number of the possessor.

Attribute           value                Ro. example
Owner_Number        singular (s)         meu
                    plural (p)           nostru

The gramatical number of the possesed object(s) is expressed by the
attribute Number (described above).


4.7 Owner_Gender

This attribute is irrelevant for Romanian.


4.8 Clitic 

See discussion on clitics in 2.10.


4.9, 4.10, 4.11, 4.12, 4.13 Referent_Type, Syntactic_Type,
Definiteness, Animate, Clitic_s

These attributes are irrelevant for Romanian.


4.14 Pronoun_Form

For Romanian we need an attribute (called Pronoun_Form) to make the
distinction between strong and weak forms of the same pronoun. All the
weak forms can be adjoined to the adjacent words both proclitically or
enclitically. In such cases the junction is always graphically marked
by a hyphen between the pronoun and the neighboring word. The hyphen
also marks possible elisions from either pronoun or the adjacent word.
Although in traditional grammar books the demonstrative, int_rel and
indefinite pronouns are not characterised by person, in our
dictionaries they are recorded (for reasons beyond morpho-lexical
encoding) as 3rd person (the same as nouns). However, for the
automatic tagging this value has been marked as irrelevant.

Attribute           value                Ro. example
Pronoun_Form        strong (s)           lui
                    weak (w)             i>i, i-


Combinations
====================================
Tag                    Example
====================================
Pp1msn--------s        eu
Pp1msd--------w        mi
Pp1msd--------s        mie
Pp1msd--y-----s        mi-
Pd-msr                 acesta
Pd-mso                 acestuia
Pi-mpr                 tot,i
Ps1fsrs                mea
Pw-mso                 ca(rui
Pn-msr                 nimeni
Ph1msr                 i>nsumi
Ph1fsr                 i>nsa(mi
Px3msa--------s        sine
Px3msa--------w        se
Px3msa--y-----w        s-



5. Determiner (D)


5.1 Type

The need for a negative value of the determiners' Type attribute is
argued on the same lines as in the section on pronoun' s Type.  In
Romanian the negative determiner is expressed by the unit nici +
indefinite article (e.g. nici un, nici o).  In Romanian, there are
specific forms for the so-called emphatic determiner, which may
accompany both a noun and a personal pronoun: fata
i>nsa(s,i (the girl herself), also ea
i>nsa(s,i (she herself).

Attribute           value                Ro. example
Type                demonstrative (d)    acest
                    indefinite (i)       orice
                    possessive (s)       meu
                    int_rel (w)          ce
                    negative (z)         nici un
                    emphatic (h)         i>nsus,i


5.2 Person

This attribute is meaningful for the Possessive determiners and refers
to the grammatical person of the possessor.

Attribute           value                Ro. example
Person              first (1)            meu
                    second (2)           ta(u
                    third (3)            sa(u


5.3 Gender

Attribute           value                Ro. example
Gender              masculine (m)        meu
                    feminine (f)         mea
                    neuter (n)           sg.meu/pl.mele


5.4 Number 

Attribute           value                Ro. example
Number              singular (s)         meu
                    plural (p)           mei


5.5 Case

Attribute           value                Ro. example
Case                direct(d)            aceasta
                    oblique (o)          acestei


5.6 Owner_Number

This attribute is meaningful for the Possessive determiners and refers
to the grammatical number of the possessor.

Attribute           value                Ro. example
Owner_Number       singular (s)          meu
                   plural (p)            nostru


5.7 Owner_Gender

This attribute is irrelevant for Romanian.


5.8 Clitic

See discussion on clitics in 2.10.

Attribute           value                Ro. example
Clitic              no (n)               mama mea  
                    yes (y)              maica(- mea


5.9 Modific_Type 

As mentioned in the corresponding section on Pronoun, the Modific_Type
attribute is relevant for some determiners too. The prenominal
determiner always precedes the noun (e.g.acest ba(iat - this
boy), whereas the postnominal determiner appears only after the noun
(e.g. ba(iatul acesta - this boy).

Attribute           value                Ro. example
Modific_Type        prenominal (e)       acest
                    postnominal (o)      acesta


Combinations
=======================
Tag           Example
=======================
Dd-mso---e    acestui
Dd-mso---o    acestuia
Di-mpr        tot,i
Ds1fsrs       mea
Dw-msr        care
Dw-mso        ca(rui
Dz-msr        nici_un
Dh1msr        i>nsumi
Dh1fsr        i>nsa(mi

Although in traditional grammar books the demonstrative, indefinite
and int_rel determiners are not characterised by person, in our
dictionaries they are recorded (for reasons beyond morpho-lexical
encoding) as 3rd person (the same as nouns). However, for the
automatic tagging this value has been marked as irrelevant.



6. Article (T)


6.1 Type

Although it presents only a few items, the article in Romanian has
four types, unlike in most of the European languages. Beside the two
recommended types: definite and indefinite which have the generally
known semantic value, Romanian uses two additional types of articles,
which are semantically subordinated to the definite article but which
have special forms and meanings:

- the possessive article (also called genitival article) is an element
  in the structure of the possessive pronoun, of the ordinal numeral
  (e.g. al meu (mine) and al treilea (the third)), and of the indefinite
  genitive forms of the nouns (e.g. capitol al ca(rt,ii
  (chapter of the book)).

- the demonstrative article links a definite noun to its determinants,
  links a numeral or an adjective to a noun, and it is a constituent
  part of the relative superlative (e.g.  fata cea mare (the elder
  girl), cel lenes, (the lazy), respectively prietenul cel mai bun
  (the best friend)).

Notice that the definite article has only enclitic forms, except for
one proclitical form (lui + proper noun: lui Ion).

Attribute           value                Ro. example
Type                definite (f)         lui
                    indefinite (i)       un
                    possessive (s)       al
                    demonstrative (d)    cel


6.2 Gender

Attribute           value                Ro. example
Gender              masculine (m)        un
                    feminine (f)         o
                    neuter (n)           sg.cel/pl.cele


6.3 Number

Attribute           value                Ro. example
Number              singular (s)         un
                    plural (p)           nis,te



6.4 Case

Attribute           value                Ro. example
Case                direct(r)            cel
                    oblique (o)          celui


6.5 Clitic

The inflected forms of the foreign-origin words (mainly nouns) not
fully assimilated, are usually written with a hyphen between the
base-form and the inflectional ending.  In our encoding, we classified
these endings (which are supposed to be split by the segmenter) as
clitic articles (clitic attribute is always "y") which can be either
definite (type=f, "-istul") or indefinite (type=i, "ist") and are
characterised by gender (gender=m, "ist"; gender=f, "ista("),
number (number=s, "ist"; number=p, "is,ti") and case (case=r,
"istul"; case=o, "istului").


Combinations
=====================
Tag          Example
=====================
Tfmso        lui
Tffso        lui
Timsr        un
Tsmpr        ai
Tdfso        celei
Timsry       -ist
Timsoy       -ist
Tfmsry       -istul
Tfmsoy       -istului



7. Adverb (R)


7.1 Type

The distinction proposed here considers the principal syntactic
properties of the adverbs.  For Romanian, the general type includes
most of the pronominal adverbs (demonstrative: aici (here),
indefinite: oriunde (anywhere)). As argued before for pronouns and
determiners, a distinct negative value is needed for adverbs as well
(nica(ieri - nowhere, niciodata( - never). The particle
type covers those adverbs which can dislocate verbal compound forms
(ex. Ea a tot ca>ntat -- She has ever sung) or mark degrees
(ex. circa (about), foarte (very), prea (too)). Such adverbs are cam,
mai, prea, s,i, tot, foarte etc.  A useful distinction in
Romanian considers the adverbs which can have predicative role, that
is they can govern a subordinate sentence (ex. Fires,te
ca( o s,tiu -- Certainly I know it). Here (for uniformity
within a multilingual environment), they are squeezed into the
modifier class.  No formal distinction is made between the
interrogative adverbs and the relative ones.

Attribute           value                Ro. example
Type                general (g)          bine, acolo
                    particle (p)         mai, cam
                    negative (z)         nica(ieri
                    modifier (m)         fires,te, poate
                    int_rel(w)           cum


7.2 Degree

In Romanian, the comparative and superlative of adverbs is formed
analytically with mai (put,in), cel mai (put,in), foarte:
ex. mai repede (faster), cel mai devreme (at the
earliest). Nonetheless there are some adverbs with comparative or
superlative meaning (ex. optim, ulterior,. definitiv). These adverbs
can be used for expressing the absolute superlative of other adverbs
or adjectives: ex. extrem de bine, formidabil de frumos.

Attribute           value                Ro. example
Degree              positive (p)         bine
                    comparative (c)      ulterior
                    superlative (s)      extrem


7.3 Clitic

See discussion on clitics in 2.10.


Combinations
=====================
Tag        Example
=====================
Rgp        repede
Rgs        extraordinar
Rgc        ulterior
Rp         mai
Rz         nica(ieri
Rm         probabil
Rw         cum



8. Adposition (S)


8.1 Type

It is the preposition type which is only pertinent to Romanian,
although some intercalating adpositions may be seen as a sort of
circumposition, for instance i>ntre...s,i... 
(between..and...).

Attribute           value                Ro. example
Type                preposition (p)      la, pe, i>n


8.2 Formation 

In Romanian there is a distinct class of compound prepositions. Each
of them forms a formal and semantic unit, although graphically they
stay unfused, e.g. de la, pe la, de pe, etc.

Attribute           value                Ro. example
Formation           simple (s)           la, pe, i>n
                    compound (c)         de la


8.3 Case

This attribute marks the subcategorisation properties of the adpositions.

Attribute           value                Ro. example
Case                genitive (g)         i>naintea
                    dative (d)           datorita(
                    accusative (a)       la


8.4 Clitic 

See discussion on clitics in 2.10.


Combinations
======================
Tag         Example
======================
Spsa        i>n
Spsay       i>ntr-
Spsd        datorita(
Spca        de_la



9. Conjunction (C)


9.1 Type

The distinction between coordinating and subordinating conjunctions is
pertinent to Romanian as well.

Attribute           value                Ro. example
Type                coordinating (c)     s,i, dar
                    subordinating (s)    ca(, daca(


9.2 Formation 

Likewise prepositions, we can distinguish  two kinds of conjunctions in
Romanian:

- simple conjunctions: e.g. s,i,dar,des,i  etc.

- conjunctions formed periphrastically, with some word/phrase combined
  by a conjunction: din moment ce, fa(ra( sa(,
  fat,a( de cum etc.
         
Attribute           value                Ro. example
Formation           simple (s)           deoarece
                    compound (c)         de_vreme_ce


9.3 Coord_Type

In Romanian, there are three kinds of conjunctions depending on their
usage: as such or together with other conjunctions or adverbs:

- simple, between conjuncts: Ion s,i Maria (John and Mary);

- repetitive, before each conjunct: ori  Ion ori  Maria ori...
  (either John or Mary or...)

- correlative, before a conjoined phrase, it requires specific
  coordinators between conjuncts: ata>t mama ca>t s,i
  tata (both mother and father).
         
With respect to the place of the conjunctions, most of them stay
before the conjunct, except for: as,adar, deci (so), dar,
i>nsa( (but), daca( (if), which also appear with
expressive value inside the conjoined sentence.

Attribute           value                Ro. example
Coord-Type          simple (s)           s,i,deoarece
                    repetit (r)          fie...fie...
                    correlat (c)         ati>t...ca>t s,i


9.3 Sub_Type

In Romanian, each conjunction requires another mood, so that the
diversity may be controlled by subcategorisation rules.

Attribute           value                Ro. example
Sub-Type            negative (z)         nici
                    positive (p)         dar

This attribute distinguishes among the positive and negative
conjunctions, providing means to control verbal double negation, (as
in case of the negative pronouns, determiners and adverbs): nici NU am
venit, nimeni NU vorbes,te, nici_un tren N-a trecut,
nica(ieri N-am va(zut


9.5 Clitic

See discussion on clitics in 2.10.

Attribute           value                Ro. example
Clitic              no (n)               ca( as,a
                    yes (y)              c-as,a


Combinations
======================
Tag          Example
======================
Ccsps        s,i
Ccrps        fie...fie
Csrzs        nici...nici
Csspc        de_vreme_ce



10. Numeral (M)


10.1 Type

Traditional Romanian grammars usually distinguish seven numeral types,
where five of them have specific forms and the other two are obtained
by composition.

The first group is made up by the following numeral types: cardinal
(trei-three), ordinal (al treilea-the third), fractional (treime-one
third), multiple (i>ntreit-trine), collective
(ama>ndoi-both).

The second group contains the numeral types which are composed by
means of other parts of speech: distributive (ca>te trei-...each
three...), adverbial (de trei ori-thrice) and again the collective
numeral which also has compound forms (tot,i trei-all three).
Nonetheless, as the numerals of the second group have a weak syntactic
cohesion, namely each composition element may be regarded as an
element of the sentence, with its own grammatical function, these last
numeral types are irrelevant for the morphosyntactic annotation.

Attribute           value                Ro. example
Type                cardinal (c)         trei
                    ordinal (o)          (al) treilea
                    fractal (f)          treime
                    multiple (m)         i>ntreit
                    collect(l)           tustrei

In Romanian (as in many other languages) several numerals have noun
behaviour (some grammarians classify such numerals as nouns) with
gender and declension of their own, which they preserve even in the
composition of the superior order numerals; these are, for instance,
suta( (hundred), mie (thousand), milion (million) and miliard
(billion). In a sentence most numerals may fulfill the function of
other parts of speech like noun, determiner or adverb.


10.2 Gender

Attribute           value                Ro. example
Gender              masculine (m)        doi, primul
                    feminine  (f)        doua(, prima
                    neuter (n)           (un) milion, (doua() milioane


10.3 Number

Attribute           value                Ro. example
Number              singular (s)         primul
                    plural (p)           primii


10.4 Case

Attribute           value                Ro. example
Case                direct(r)            primul
                    oblique (o)          primului


10.5 Form

Attribute           value                Ro. example
Form                digit (d)            1690
                    letter (l)           unsprezece
                    both (b)             5 mii
                    roman (r)            XIV


10.6 Definiteness 

By virtue of their noun or adjective value, some numerals may take the
enclitic article (prim/primul - first/the first). Consequently for the
Romanian, definiteness attribute helps distinguish the enclitic forms
from the other forms.

Attribute           value                Ro. example
Definiteness        yes (y)              primul
                    no (n)               prim


10.7 Clitic

See discussion on clitics in 2.10.


Combinations
=======================
Tag           Example
=======================
Mcmprl        doi
Mcmpol        doi
Momsrl        doilea
Momsol        doilea
Mlmpr         ama>ndoi
Momsrlyy      primu-i
Mffpoly       treimilor


11. Interjections (I)

In Romanian there are no relevant  subcategories of interjections.

====================
Tag        Example
====================
I        oh,ah,au
====================



12. Residual (X)

No attributes are defined for this category.

==========================
Tag        Example
==========================
X        show, a+b, retro-
==========================



13. Abbreviation (Y)

The Syntactic_Type attribute is useful for specifying the grammatical
category of an abbreviation. Although the values for this attribute
could range over the part of speech categories in the language, in
Romanian most of the abbreviations falls into noun class.


13.1 Syntactic_Type

Attribute           value                Ro. example
Syntactic_Type      nominal(n)           d-na (doamna)
                    verbal (v)           v. (vezi)
                    adjectival (a)       ant. (anterior)
                    adverbial(r)         f. (foarte)


13.2 Gender

Attribute           value                Ro. example
Gender              masculine (m)        d-lui
                    feminine (f)         d-na
                    neuter(n)            apt.


13.3 Number

Attribute           value                Ro. example
Number              singular (s)         d-na
                    plural (p)           d-nele


13.4 Case

Attribute           value                Ro. example
Case                direct (r)           d-na
                    oblique (o)          d-nei


13.5 Definiteness

Attribute           value                Ro. example
Definiteness        yes (y)              d-nele
                    no (n)               d-ne


Combinations
======================
Tag           Example
======================
Ynmsry        d-ul
Ynfsoy        d-nei
Ynnsry        apt.



14. Particle (Q)


14.1 Type

Attribute           value                Ro. example
Type                negation(z)          nu,n-
                    infinitive (n)       a
                    subjunctive (s)      sa(
                    aspect               fi
                    future               o


14.2 Formation

This attribute is not irrelevant for Romanian.



14.3 Clitic

See  discussion on clitics in 2.10.

Attribute           value                Ro. example
Clitic              yes (y)              n-am
                    no (n)               nu am


Combinations
=================
Tag       Example
=================
Qz        nu
Qz-y      n-
Qn        a
Qs        sa(
Qa        fi
Qf        o



Tomaz Erjavec
Wed Oct 16 12:08:36 MDT 1996