next up previous contents
Next: Application to Slovene Up: Language Specific Applications Previous: Application to English

Subsections

Application to Romanian

Multext-East / Concede Morphosyntactic specifications V.2-- Romanian

Authors:
D.Tufiș and A.M.Barbu.

 

Noun (N)

In Romanian the following attribute-value pairs are applicable to Nouns:


1.1 Type

Attribute           value                Ro. example
Type                common  (c)          carte
                    proper  (p)          Ion


1.2 Gender

Attribute           value                Ro. example
Gender              masculine  (m)       băiatul
                    feminine   (f)       casa
                    neuter     (n)       fir (m.sg.),fire (f.pl.)

In Romanian the declension of a neuter noun always follows in singular
a masculine paradigm and in plural a feminine one. Specific
implementations could take advantage of this rule and by organizing
the paradigmatic space in partial paradigms (masc-sing, masc-pl,
fem-sing, fem-pl) to get rid of neuter value for the gender attribute.


1.3 Number

Attribute           value                Ro. example
Number              singular  (s)        fată
                    plural    (p)        fete


1.4 Case

There are five functional cases in Romanian (nominative, genitive,
dative, accusative and vocative) but they are expressed by at most
three syncretic forms for each number: 1. nominative-accusative;
2. genitive-dative; 3. vocative. The ambiguity of a syncretic case
form may be solved at the syntactic level. Taking into account this
syncretism we considered the following attribute-value combinations.

Attribute           value                Ro. example
Case                direct(d)            omul
                    oblique (o)          omului
                    vocative (v)         omule

At the word level, the distinction between the three syncretic forms
is possible only for the definite nouns. Masculine indefinite nouns
are not differentiated by case (except, sometimes for vocative. The
feminine indefinite nouns have different forms in singular for
nominative-accusative, genitive-dative and vocative, but the
genitive-dative singular forms are also indistinguishable from the
plural forms:

fată       - nominative-accusative, vocative singular, indefinite
fato       - vocative singular, indefinite
fete       - genitive-dative singular, nominative-accusative, 
             genitive-dative, vocative plural, indefinite
băiat      - nominative-accusative, genitive-dative, vocative 
             singular, indefinite
băiete     - vocative singular, indefinite
băieți     - nominative-accusative, genitive-dative, vocative,
             plural, indefinite


1.5 Definiteness

In Romanian, nouns can be marked for definiteness with the enclitic
definite article.

Attribute           value                Ro. example
Definiteness        yes (y)              omul
                    no (n)               om


1.6 Clitic

See discussion in 2.10

Attribute           value                Ro. example
Clitic              no (n)               sora
                    yes (y)              soru-mea


Combinations

Here, and in all the sections to follow, the trailing hyphens are deleted.

=========================
Tag          Example              
=========================
Ncmsrn       frate
Ncmson       frate
Ncmsvn       frate
Ncmsry       fratele
Ncmsoy       fratelui
Ncmprn       frați
Ncmpon       frați
Ncmpvn       frați
Ncmpry       frații
Ncmpoy       fraților
Ncmpvy       fraților
Ncfsrn       soră
Ncfsvn       soro
Ncfson       surori
Ncfprn       surori
Ncfpon       surori
Ncfpvn       surori
Ncfsoyy      soră(-sii)
Ncfpry       surorile
Ncfpoy       surorilor
Ncfpvy       surorilor
Ncfsryy      soră(-sa)
Ncmsrn       creion
Ncmson       creion
Ncmsryy      creionu-(i)
Ncmsoy       creionului
Ncfprn       creioane
Ncfpon       creioane
Ncfpry       creioanele
Ncfpoy       creioanelor
Npfsr        Ioana
Npfso        Ioanei
Npmsrn       București
Npmsry       Bucureștiul

 

Verb (V)

The following attribute-value pairs are applicable to verbs in Romanian:


2.1 Type

Attribute           value                Ro. example
Type                main (m)             a vedea
                    auxiliary (a)        a avea, a fi, a voi
                    modal (o)            a putea, a trebui
                    copulative (c)       a fi, a deveni


2.2 VForm

Traditionally, Romanian linguistics distinguishes between predicative
and non-predicative moods. This distinction may be easily mapped into
finite/non-finite dichotomy: indicative, subjunctive and imperative
are finite; infinitive, participle and gerund are non-finite (only
synthetic (non-compound) moods were mentioned; we use the opposition
synthetic-analytic to distinguish between concatenative (synthetic)
and compound (analytic) morpho-lexical phenomena).

Attribute           value                Ro. example
VForm               indicative(i)        vine
                    subjunctive (s)      vină
                    imperative (m)       vino
                    infinitive (n)       veni
                    participle (p)       venit
                    gerund (g)           venind

As only synthetic forms were considered, the values conditional and
presumptive for the VForm attribute were left out in Romanian. Another
value for VForm which was left out is the Supine. It appears mostly
with a preposition, except for a few intransitive verbs when they are
subordinated to the impersonal verb a trebui (must). Only the
preposition allows for differentiating a supine from a
participle-masculine-singular.


2.3 Tense

The synthetic values for the attribute Tense, as listed in the table
below, apply to the indicative mood. The value p (present) is used
also for subjunctive and infinitive. All the other moods either have
no tense or have compound form tenses.

Attribute           value                Ro. example
Tense               present (p)          văd
                    imperfect (i)        vedeam
                    past (s)             văzui
                    pluperfect (l)       văzusem


2.4 Person

Attribute           value                Ro. example
Person              first (1)            văd
                    second (2)           vezi
                    third (3)            vede

The following features are pertinent to those moods which permit an
adjectival use, i.e. participle and gerund. However, the adjectival
use of gerund is extremely rare (o mână tremurndă
- a shaking hand) and therefore gender and number apply mainly for the
participle.


2.5 Number

Attribute           value                Ro. example
Number              singular (s)         bătut
                    plural (p)           bătuți


2.6 Gender

Attribute           value                Ro. example
Gender              masculine(m)         bătut
                    feminine(f)          bătută
                    neuter(n)            bătut-bătute


2.7, 2.8, 2.9 Voice, Negation, Definiteness

As these feature values are either realized as compound forms or
are irrelevant for Romanian they were omitted in our word-level encoding.


2.10 Clitic 

The cliticization phenomenon in Romanian is not restricted to
verb-pronoun relationship, but may also be observed with the (main)
verb and the auxiliary, the noun or adjective with pronoun, with noun
or adjective with copula, pronoun with auxiliary, preposition with
(indefinite) article, numeral or (indefinite) pronoun, negative adverb
with verb, auxiliary or pronoun, and some others (mainly created
through the contracted forms of the verb "a fi"-to be). We restrict
ourselves to considering only the graphically marked clicitizations.
In such cases, the two, three or (sometimes) four constituents of a
cliticized word-form are always separated by a hyphen. Omitting the
hyphen in such cases is an unacceptable error in written Romanian.
The examples below illustrate the specific types of cliticization we
will consider:

dă-mi-l !                 (you give +to me+it/him = give it/him to me)
lăsatu-ne-ai              (left + us+ you have = you have left us)
sparge-ți-s-ar lampa      (break + your + s-part-conj + has lamp = 
                           I wish your lamp would break)
soră-mii                  (sister + my = to my sister)
fața-i                    (his/her face | the face is)
roșu-i                    (his/her red | the red is )
m-au                      (me + they have )
într-o gaură              (in + a hole)
într-o oră                (in + one hour)
într-unele                (into + some of)
n-aud                     (not  + I hear = I do not hear)
n-am                      (not + I/we have = I have not )
nu-mi                     (not + to me )

The order in which different constituents of a cliticized form appear
is governed by precise morphological rules. For instance, the
auxiliaries always appear in the last position. The main verbs, except
those beginning either with the letter "a" or the letter î and
the contracted forms (-s and -i) of the verb "a fi"- to be, always
appear in the first position, nouns and adjectives always precede the
cliticized pronouns, the negative adverbial particles nu- and n- only
appear in the first position, and so on.  However, in order to reduce
spurious ambiguity in morpho-lexical encoding, we considered the
attribute CLITIC as relevant only in those cases where the
clicitization phenomenon resulted in graphemic modification of the
cliticized term (as it is the epenthetic u in the gerund form shown in
the table below).

Attribute           value                Ro. example
Clitic              no (n)               am ridicat
                    yes (y)              ridicându-l


Combinations
========================
Tag           Example
========================
Vmii1s        abandonam
Vmii2s        abandonai
Vmii3s        abandona
Vmii1p        abandonam
Vmii2p        abandonați
Vmii3p        abandonau
Vmis1s        abandonai
Vmis2s        abandonași
Vmis3s        abandonă
Vmis1p        abandonarăm
Vmis2p        abandonarăți
Vmis3p        abandonă
Vmil1s        abandonasem
Vmil2s        abandonaseși
Vmil3s        abandonase
Vmil1p        abandonaserăm
Vmil2p        abandonaserăți
Vmil3p        abandonaseră
Vmip1s        abandonez
Vmsp1s        abandonez
Vmip2s        abandonezi
Vmsp2s        abandonezi
Vmip3s        abandonează
Vmip3p        abandonează
Vmsp3s        abandoneze
Vmsp3p        abandoneze
Vmsp1p        abandonăm
Vmsp2p        abandonați
Vmm-2s        abandonează
Vmm-2p        abandonați
Vmnp          abandona
Vmp--sm       abandonat
Vmp--sm---y   abandonatu
Vmp--sf       abandonată
Vmp--pf       abandonate
Vmp--pm       abandonați
Vmg           abandonând
Vmg-------y   abandonându
Va--1s        as,
Voip          trebuie
Vcip1s        sunt
Vcip3p        sunt
Vcip1s----y   -s
Vcip3p----y   -s

 

Adjective (A)

3.1 Type

Although it is not common practice in Romanian linguistics, one could
make the distinction between qualificative and determinative
adjectives. However, the attribute-value pairs proposed in this
section are appropriate for the qualificative adjectives.

Attribute           value                Ro. example
Type                qualificative (f)    frumos


3.2 Degree

The default value is positive; adjectives have also comparative and
superlative degrees, but in most of the cases they are expressed by
means of analytical forms (e.g. comp. mai bun (better), superl. cel
mai bun (the best)). A few adjectives have intrinsic etymological
comparative or superlative meanings (e.g. comparatives: anterior,
major; superlatives: optim, maxim, extrem etc.). The prefixes super-,
extra-, ultra- etc., are quite productive in forming the
quasi-analytic1 superlatives.  Adjectives are characterized by gender,
number and case.

Attribute           value                Ro. example
Degree              positive (p)         frumos
                    comparative (c)      ulterior
                    superlative (s)      extrem


3.3 Gender 

Attribute           value                Ro. example
Gender              masculine (m)        bun
                    feminine (f)         bună
                    neuter (n)           sg.bun/pl.bune


3.4 Number

Attribute           value                Ro. example
Number              singular (s)         bun
                    plural (p)           buni


3.5 Case

The adjectives present the same case syncretism as the nouns, except
for few adjectives that have an additional special form for
Genitive-Dative cases in plural (e.g. G.D.pl.  multor).

Attribute           value                Ro. example
Case                direct(r)            bunul
                    oblique (o)          bunului
                    vocative (v)         bunule


3.6 Definiteness

In noun-adjective construction, the definite article may attach
enclitically to either adjectives or modified nouns (never to both of
them). If present, the definite article attaches to the right of the
first word in the sequence.

Bunul  om  (The kind man)
Omul  bun. (The kind man)

Attribute           value                Ro. example
Definiteness        yes (y)              bunul
                    no (n)               bun


3.7 Clitic 

See discussion on clitics in 2.10



Combinations
====================        
Tag            Example        
====================        
Afpmsrn        bun        
Afpmson        bun        
Afpmsvn        bun        
Afpmprn        buni        
Afpmpon        buni        
Afpmpvn        buni        
Afpmsry        bunul        
Afpmsoy        bunului        
Afpmpry        bunii        
Afpmpoy        bunilor        
Afpfsrn        bună        
Afpfsvn        bună        
Afpfson        bune
Afpfprn        bune
Afpfpon        bune
Afpfpvn        bune
Afpfsry        buna
Afpfsoy        bunei
Afpfpry        bunele
Afpfpoy        bunelor
Afcmsrn        ulterior
Afcmson        ulterior
Afsmsrn        extrem
Afp            gri

 

Pronoun (P)

4.1 Type

In Romanian it is worth differentiating the negative pronoun from
other indefinite pronouns: a negative pronoun cannot be an argument
for a verb unless the verb itself is negated too (e.g. Nu am
văzut pe nimeni / *Am văzut pe nimeni).

Attribute           value                Ro. example
Type                demonstrative(d)     acesta
                    indefinite (i)       oricine
                    possesive(s)         (al) meu
                    int_rel (w)          ce
                    personal(p)          eu
                    reflexive(x)         se
                    negative (z)         nimeni
                    emphatic (h)         însumi


4.2 Person

Attribute           value                Ro. example
Person              first (1)            eu
                    second (2)           tu
                    third (3)            el


4.3 Gender

Attribute           value                Ro. example
Gender              masculine (m)        el
                    feminine (f)         ea
                    neuter (n)           sg.acesta/pl.acestea


4.4 Number

Attribute           value                Ro. example
Number              singular (s)         eu
                    plural (p)           noi


4.5 Case

For the second person of the personal pronoun in both singular and
plural there is the vocative case too.The direct and oblique values
are needed for the syncretic causal forms of the other pronouns than
the personal ones.

Attribute           value                Ro. example
Case                nominative (n)       el
                    genitive (g)         (al) lui
                    dative (d)           lui
                    accusative (a)       (pe) el
                    vocative (v)         tu, voi!
                    direct(r)            acesta
                    oblique (o)          acestuia


4.6 Owner_Number

This attribute is meaningful for the Possessive pronouns and refers to
the grammatical number of the possessor.

Attribute           value                Ro. example
Owner_Number        singular (s)         meu
                    plural (p)           nostru

The gramatical number of the possesed object(s) is expressed by the
attribute Number (described above).


4.7 Owner_Gender

This attribute is irrelevant for Romanian.


4.8 Clitic 

See discussion on clitics in 2.10.


4.9, 4.10, 4.11, 4.12, 4.13 Referent_Type, Syntactic_Type,
Definiteness, Animate, Clitic_s

These attributes are irrelevant for Romanian.


4.14 Pronoun_Form

For Romanian we need an attribute (called Pronoun_Form) to make the
distinction between strong and weak forms of the same pronoun. All the
weak forms can be adjoined to the adjacent words both proclitically or
enclitically. In such cases the junction is always graphically marked
by a hyphen between the pronoun and the neighboring word. The hyphen
also marks possible elisions from either pronoun or the adjacent word.
Although in traditional grammar books the demonstrative, int_rel and
indefinite pronouns are not characterised by person, in our
dictionaries they are recorded (for reasons beyond morpho-lexical
encoding) as 3rd person (the same as nouns). However, for the
automatic tagging this value has been marked as irrelevant.

Attribute           value                Ro. example
Pronoun_Form        strong (s)           lui
                    weak (w)             îi, i-


Combinations
====================================
Tag                    Example
====================================
Pp1msn--------s        eu
Pp1msd--------w        mi
Pp1msd--------s        mie
Pp1msd--y-----s        mi-
Pd-msr                 acesta
Pd-mso                 acestuia
Pi-mpr                 toți
Ps1fsrs                mea
Pw-mso                 cărui
Pn-msr                 nimeni
Ph1msr                 însumi
Ph1fsr                 însămi
Px3msa--------s        sine
Px3msa--------w        se
Px3msa--y-----w        s-

 

Determiner (D)

5.1 Type

The need for a negative value of the determiners' Type attribute is
argued on the same lines as in the section on pronoun' s Type.  In
Romanian the negative determiner is expressed by the unit nici +
indefinite article (e.g. nici un, nici o).  In Romanian, there are
specific forms for the so-called emphatic determiner, which may
accompany both a noun and a personal pronoun: fata
însăși (the girl herself), also ea
însăși (she herself).

Attribute           value                Ro. example
Type                demonstrative (d)    acest
                    indefinite (i)       orice
                    possessive (s)       meu
                    int_rel (w)          ce
                    negative (z)         nici un
                    emphatic (h)         însuși


5.2 Person

This attribute is meaningful for the Possessive determiners and refers
to the grammatical person of the possessor.

Attribute           value                Ro. example
Person              first (1)            meu
                    second (2)           tău
                    third (3)            său


5.3 Gender

Attribute           value                Ro. example
Gender              masculine (m)        meu
                    feminine (f)         mea
                    neuter (n)           sg.meu/pl.mele


5.4 Number 

Attribute           value                Ro. example
Number              singular (s)         meu
                    plural (p)           mei


5.5 Case

Attribute           value                Ro. example
Case                direct(d)            aceasta
                    oblique (o)          acestei


5.6 Owner_Number

This attribute is meaningful for the Possessive determiners and refers
to the grammatical number of the possessor.

Attribute           value                Ro. example
Owner_Number       singular (s)          meu
                   plural (p)            nostru


5.7 Owner_Gender

This attribute is irrelevant for Romanian.


5.8 Clitic

See discussion on clitics in 2.10.

Attribute           value                Ro. example
Clitic              no (n)               mama mea  
                    yes (y)              maică- mea


5.9 Modific_Type 

As mentioned in the corresponding section on Pronoun, the Modific_Type
attribute is relevant for some determiners too. The prenominal
determiner always precedes the noun (e.g.acest băiat - this
boy), whereas the postnominal determiner appears only after the noun
(e.g. băiatul acesta - this boy).

Attribute           value                Ro. example
Modific_Type        prenominal (e)       acest
                    postnominal (o)      acesta


Combinations
=======================
Tag           Example
=======================
Dd-mso---e    acestui
Dd-mso---o    acestuia
Di-mpr        toți
Ds1fsrs       mea
Dw-msr        care
Dw-mso        cărui
Dz-msr        nici_un
Dh1msr        însumi
Dh1fsr        însămi

Although in traditional grammar books the demonstrative, indefinite
and int_rel determiners are not characterised by person, in our
dictionaries they are recorded (for reasons beyond morpho-lexical
encoding) as 3rd person (the same as nouns). However, for the
automatic tagging this value has been marked as irrelevant.

 

Article (T)

6.1 Type

Although it presents only a few items, the article in Romanian has
four types, unlike in most of the European languages. Beside the two
recommended types: definite and indefinite which have the generally
known semantic value, Romanian uses two additional types of articles,
which are semantically subordinated to the definite article but which
have special forms and meanings:

- the possessive article (also called genitival article) is an element
  in the structure of the possessive pronoun, of the ordinal numeral
  (e.g. al meu (mine) and al treilea (the third)), and of the indefinite
  genitive forms of the nouns (e.g. capitol al cărții
  (chapter of the book)).

- the demonstrative article links a definite noun to its determinants,
  links a numeral or an adjective to a noun, and it is a constituent
  part of the relative superlative (e.g.  fata cea mare (the elder
  girl), cel lenes, (the lazy), respectively prietenul cel mai bun
  (the best friend)).

Notice that the definite article has only enclitic forms, except for
one proclitical form (lui + proper noun: lui Ion).

Attribute           value                Ro. example
Type                definite (f)         lui
                    indefinite (i)       un
                    possessive (s)       al
                    demonstrative (d)    cel


6.2 Gender

Attribute           value                Ro. example
Gender              masculine (m)        un
                    feminine (f)         o
                    neuter (n)           sg.cel/pl.cele


6.3 Number

Attribute           value                Ro. example
Number              singular (s)         un
                    plural (p)           niște



6.4 Case

Attribute           value                Ro. example
Case                direct(r)            cel
                    oblique (o)          celui


6.5 Clitic

The inflected forms of the foreign-origin words (mainly nouns) not
fully assimilated, are usually written with a hyphen between the
base-form and the inflectional ending.  In our encoding, we classified
these endings (which are supposed to be split by the segmenter) as
clitic articles (clitic attribute is always "y") which can be either
definite (type=f, "-istul") or indefinite (type=i, "ist") and are
characterised by gender (gender=m, "ist"; gender=f, "istă"),
number (number=s, "ist"; number=p, "iști") and case (case=r,
"istul"; case=o, "istului").


Combinations
=====================
Tag          Example
=====================
Tfmso        lui
Tffso        lui
Timsr        un
Tsmpr        ai
Tdfso        celei
Timsry       -ist
Timsoy       -ist
Tfmsry       -istul
Tfmsoy       -istului

 

Adverb (R)

7.1 Type

The distinction proposed here considers the principal syntactic
properties of the adverbs.  For Romanian, the general type includes
most of the pronominal adverbs (demonstrative: aici (here),
indefinite: oriunde (anywhere)). As argued before for pronouns and
determiners, a distinct negative value is needed for adverbs as well
(nicăieri - nowhere, niciodată - never). The particle
type covers those adverbs which can dislocate verbal compound forms
(ex. Ea a tot cântat -- She has ever sung) or mark degrees
(ex. circa (about), foarte (very), prea (too)). Such adverbs are cam,
mai, prea, și, tot, foarte etc.  A useful distinction in
Romanian considers the adverbs which can have predicative role, that
is they can govern a subordinate sentence (ex. Firește
că o știu -- Certainly I know it). Here (for uniformity
within a multilingual environment), they are squeezed into the
modifier class.  No formal distinction is made between the
interrogative adverbs and the relative ones.

Attribute           value                Ro. example
Type                general (g)          bine, acolo
                    particle (p)         mai, cam
                    negative (z)         nicăieri
                    modifier (m)         firește, poate
                    int_rel (w)          cum
                    portmanteau (c)      ca

The "portmanteau" type of adverb was introduced to cover some few
words which can be both adverbs and conjunctions (with adverbial
reading more frequent). This was necessary for tagging purposes.

7.2 Degree

In Romanian, the comparative and superlative of adverbs is formed
analytically with mai (puțin), cel mai (puțin), foarte:
ex. mai repede (faster), cel mai devreme (at the
earliest). Nonetheless there are some adverbs with comparative or
superlative meaning (ex. optim, ulterior,. definitiv). These adverbs
can be used for expressing the absolute superlative of other adverbs
or adjectives: ex. extrem de bine, formidabil de frumos.

Attribute           value                Ro. example
Degree              positive (p)         bine
                    comparative (c)      ulterior
                    superlative (s)      extrem


7.3 Clitic

See discussion on clitics in 2.10.


Combinations
=====================
Tag        Example
=====================
Rgp        repede
Rgs        extraordinar
Rgc        ulterior
Rp         mai
Rz         nicăieri
Rm         probabil
Rw         cum

 

Adposition (S)

8.1 Type

It is the preposition type which is only pertinent to Romanian,
although some intercalating adpositions may be seen as a sort of
circumposition, for instance între...și... 
(between..and...).

Attribute           value                Ro. example
Type                preposition (p)      la, pe, în


8.2 Formation 

In Romanian there is a distinct class of compound prepositions. Each
of them forms a formal and semantic unit, although graphically they
stay unfused, e.g. de la, pe la, de pe, etc.

Attribute           value                Ro. example
Formation           simple (s)           la, pe, în
                    compound (c)         de la


8.3 Case

This attribute marks the subcategorisation properties of the adpositions.

Attribute           value                Ro. example
Case                genitive (g)         înaintea
                    dative (d)           datorită
                    accusative (a)       la


8.4 Clitic 

See discussion on clitics in 2.10.


Combinations
======================
Tag         Example
======================
Spsa        în
Spsay       într-
Spsd        datorită
Spca        de_la

 

Conjunction (C)

9.1 Type

The distinction between coordinating and subordinating conjunctions is
pertinent to Romanian as well.

Attribute           value                Ro. example
Type                coordinating (c)     dar
                    subordinating (s)    că, dacă
                    portmanteau (r)        și

The "portmanteau" type of conjunction applies only to the word "și"
which can be both a coordonating conjunction and an adverb. The
distinctionamong these interpretations is rather tricky for the
average native speaker and was a constant source of noise in automatic
tagging. Therefore, for the sake of automatic processing we defined this
"portmanteau" type value.

9.2 Formation 

Likewise prepositions, we can distinguish  two kinds of conjunctions in
Romanian:

- simple conjunctions: e.g. și,dar,deși  etc.

- conjunctions formed periphrastically, with some word/phrase combined
  by a conjunction: din moment ce, fără să,
  fat,ă de cum etc.
         
Attribute           value                Ro. example
Formation           simple (s)           deoarece
                    compound (c)         de_vreme_ce


9.3 Coord_Type

In Romanian, there are three kinds of conjunctions depending on their
usage: as such or together with other conjunctions or adverbs:

- simple, between conjuncts: Ion ori Maria (John or Mary);

- repetitive, before each conjunct: fie  Ion fie  Maria fie...
  (either John or Mary or...)

- correlative, before a conjoined phrase, it requires specific
  coordinators between conjuncts: atât mama cât și
  tata (both mother and father).
         
With respect to the place of the conjunctions, most of them stay
before the conjunct, except for: așadar, deci (so), dar,
însă (but), dacă (if), which also appear with
expressive value inside the conjoined sentence.

Attribute           value                Ro. example
Coord-Type          simple (s)           dar,deoarece
                    repetit (r)          fie...fie...
                    correlat (c)         atît...cât și


9.4 Sub_Type

In Romanian, each conjunction requires another mood, so that the
diversity may be controlled by subcategorisation rules.

Attribute           value                Ro. example
Sub-Type            negative (z)         nici
                    positive (p)         dar

This attribute distinguishes among the positive and negative
conjunctions, providing means to control verbal double negation, (as
in case of the negative pronouns, determiners and adverbs): nici NU am
venit, nimeni NU vorbește, nici_un tren N-a trecut,
nicăieri N-am văzut


9.5 Clitic

See discussion on clitics in 2.10.

Attribute           value                Ro. example
Clitic              no (n)               că așa
                    yes (y)              c-așa


Combinations
======================
Tag          Example
======================
Crssp        și
Ccssp        dar
Ccrsp        fie...fie
Csrsz        nici...nici
Csscp        de_vreme_ce

 

Numeral (M)

10.1 Type

Traditional Romanian grammars usually distinguish seven numeral types,
where five of them have specific forms and the other two are obtained
by composition.

The first group is made up by the following numeral types: cardinal
(trei-three), ordinal (al treilea-the third), fractional (treime-one
third), multiple (întreit-trine), collective
(amândoi-both).

The second group contains the numeral types which are composed by
means of other parts of speech: distributive (câte trei-...each
three...), adverbial (de trei ori-thrice) and again the collective
numeral which also has compound forms (toți trei-all three).
Nonetheless, as the numerals of the second group have a weak syntactic
cohesion, namely each composition element may be regarded as an
element of the sentence, with its own grammatical function, these last
numeral types are irrelevant for the morphosyntactic annotation.

Attribute           value                Ro. example
Type                cardinal (c)         trei
                    ordinal (o)          (al) treilea
                    fractal (f)          treime
                    multiple (m)         întreit
                    collect(l)           tustrei

In Romanian (as in many other languages) several numerals have noun
behaviour (some grammarians classify such numerals as nouns) with
gender and declension of their own, which they preserve even in the
composition of the superior order numerals; these are, for instance,
sută (hundred), mie (thousand), milion (million) and miliard
(billion). In a sentence most numerals may fulfill the function of
other parts of speech like noun, determiner or adverb.


10.2 Gender

Attribute           value                Ro. example
Gender              masculine (m)        doi, primul
                    feminine  (f)        două, prima
                    neuter (n)           (un) milion, (două) milioane


10.3 Number

Attribute           value                Ro. example
Number              singular (s)         primul
                    plural (p)           primii


10.4 Case

Attribute           value                Ro. example
Case                direct(r)            primul
                    oblique (o)          primului


10.5 Form

Attribute           value                Ro. example
Form                digit (d)            1690
                    letter (l)           unsprezece
                    both (b)             5 mii
                    roman (r)            XIV


10.6 Definiteness 

By virtue of their noun or adjective value, some numerals may take the
enclitic article (prim/primul - first/the first). Consequently for the
Romanian, definiteness attribute helps distinguish the enclitic forms
from the other forms.

Attribute           value                Ro. example
Definiteness        yes (y)              primul
                    no (n)               prim


10.7 Clitic

See discussion on clitics in 2.10.


Combinations
=======================
Tag           Example
=======================
Mcmprl        doi
Mcmpol        doi
Momsrl        doilea
Momsol        doilea
Mlmpr         amândoi
Momsrlyy      primu-i
Mffpoly       treimilor

 

Interjections (I)

In Romanian there are no relevant  subcategories of interjections.

====================
Tag        Example
====================
I        oh,ah,au
====================

 

Residual (X)

No attributes are defined for this category.

==========================
Tag        Example
==========================
X        show, a+b, retro-
==========================

 

Abbreviation (Y)

The Syntactic_Type attribute is useful for specifying the grammatical
category of an abbreviation. Although the values for this attribute
could range over the part of speech categories in the language, in
Romanian most of the abbreviations falls into noun class.


13.1 Syntactic_Type

Attribute           value                Ro. example
Syntactic_Type      nominal(n)           d-na (doamna)
                    verbal (v)           v. (vezi)
                    adjectival (a)       ant. (anterior)
                    adverbial(r)         f. (foarte)
                    pronominal(p)        d-ei (dumneaei)


13.2 Gender

Attribute           value                Ro. example
Gender              masculine (m)        d-lui
                    feminine (f)         d-na
                    neuter(n)            apt.


13.3 Number

Attribute           value                Ro. example
Number              singular (s)         d-na
                    plural (p)           d-nele


13.4 Case

Attribute           value                Ro. example
Case                direct (r)           d-na
                    oblique (o)          d-nei


13.5 Definiteness

Attribute           value                Ro. example
Definiteness        yes (y)              d-nele
                    no (n)               d-ne


Combinations
======================
Tag           Example
======================
Ynmsry        d-ul
Ynfsoy        d-nei
Ynnsry        apt.

 

Particle (Q)

14.1 Type

Attribute           value                Ro. example
Type                negation(z)          nu,n-
                    infinitive (n)       a
                    subjunctive (s)      să
                    aspect               fi
                    future               o


14.2 Formation

This attribute is not relevant for Romanian.



14.3 Clitic

See  discussion on clitics in 2.10.

Attribute           value                Ro. example
Clitic              yes (y)              n-am
                    no (n)               nu am


Combinations
=================
Tag       Example
=================
Qz        nu
Qz-y      n-
Qn        a
Qs        să
Qa        fi
Qf        o

 


Tomaz Erjavec
4/1/2001