Multext-East Speech Corpus

This page gives the speech corpus of the Multext-East project. It contains the 40 EUROM 'blocks', cca. 5 sentences each, in the seven languages of the project. Passages in spoken form are provided for Estonian, Hungarian, Romanian and Slovene. The digitised speech blocks were recorded in studio conditions, by professional (male) speakers and are encoded here in 16kHz, 16 bit WAV format. For Slovene, an additional speaker, S5 is provided.

CES/Speech files in HTML/WAV


Speech files only

Estonian

O0 O1 O2 O3 O4 O5 O6 O7 O8 O9
P0 P1 P2 P3 P4 P5 P6 P7 P8 P9
Q0 Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9
R0 R1 R2 R3 R4 R5 R6 R7 R8 R9

Hungarian

O0 O1 O2 O3 O4 O5 O6 O7 O8 O9
P0 P1 P2 P3 P4 P5 P6 P7 P8 P9
Q0 Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9
R0 R1 R2 R3 R4 R5 R6 R7 R8 R9

Romanian

O0 O1 O2 O3 O4 O5 O6 O7 O8 O9
P0 P1 P2 P3 P4 P5 P6 P7 P8 P9
Q0 Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9
R0 R1 R2 R3 R4 R5 R6 R7 R8 R9

Slovene

O0 O1 O2 O3 O4 O5 O6 O7 O8 O9
P0 P1 P2 P3 P4 P5 P6 P7 P8 P9
Q0 Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9
R0 R1 R2 R3 R4 R5 R6 R7 R8 R9

Slovene S5

O0 O1 O2 O3 O4 O5 O6 O7 O8 O9
P0 P1 P2 P3 P4 P5 P6 P7 P8 P9
Q0 Q1 P2 P3 P4 P5 P6 P7 P8 P9
R0 R1 R2 R3 R4 R5 R6 R7 R8 R9

[home]


Last updated 28-Dec-1997 by et