Zbornik 5. slovenske in 1. mednarodne konference JEZIKOVNE TEHNOLOGIJE 2006

Proceedings of 5th Slovenian and 1st international Language Technologies Conference 2006

IS-LTC'06

9. - 10. oktober 2006 / October 9th - 10th 2006
Jožef Stefan Institute, Ljubljana, Slovenia

Uredila / Edited by
Tomaž Erjavec, Jerneja Žganec Gros

ISBN-10 961-6303-83-X
ISBN-13 978-961-6303-83


Kazalo / Table of Contents

Note: Click on the title of the paper for the off-print in PDF.
All the papers contain Slovene and English abstracts


VABLJENI PRISPEVKI / INVITED CONTRIBUTIONS

Steven Krauwer:
Strengthening the Smaller Languages in Europe
Nick Campbell:
Speech Synthesis and Discourse Information

PRISPEVKI / CONTRIBUTIONS

Korbinian Riedhammer, Tino Haderlein, Maria Schuster, Frank Rosanowski, Elmar Nöth:
Automatic Evaluation of Tracheoesophageal Telephone Speech
Andras Banhalmi, Denes Paczolay, Laszlo Toth, Andras Kocsor:
First Results of a Hungarian Medical Dictation Project
Margus Treumuth, Tanel Alumäe, Einar Meister:
A Natural Language Interface to a Theater Information Database
Andreas Maier, Elmar Nöth, Emeka Nkenke, Maria Schuster:
Automatic Assessment of Children's Speech with Cleft Lip and Palate
Martin Karafiat, Frantisek Grezl, Petr Schwarz, Lukas Burget, Jan Cernocky:
Robust heteroscedastic linear discriminant analysis and LCRC posterior features in large vocabulary continuous speech recognition
Nikša Jakovljević, Dragiša Mišković, Milan Sečujski, Darko Pekar:
Vocal Tract Normalization Based on Formant Positions
Jerneja Žganec Gros, Varja Cvetko-Orešnik, Primož Jakopin:
SI-PRON: a Comprehensive Pronunciation Lexicon for Slovenian
Darinka Verdonik:
Pragmatically annotated corpora in speech-to-speech translation
Atanas Chanev:
Studying the Learning Curves of a Statistical Dependency Parser for Four Languages
Simon Krek, Adam Kilgarriff:
Slovene Word Sketches
Matthias Richter, Uwe Quasthoff, Erla Hallsteinsdóttir, Chris Biemann:
Exploiting the Leipzig Corpora Collection
Michael Pucher, Yan Huang, Ozgur Cetin:
Optimization of Latent Semantic Analysis based Language Model Interpolation for Meeting Recognition
Isabel Segura Bedmar, Jose L. Martinez-Fernandez, Paloma Martinez:
Including deeper semantic information in the Lexical Markup Framework: a proposal
Mehdi M. Kashani, Fred Popowich:
Pronoun Generation for Text Summarization and Question Answering
Simon Dobrišek, Boštjan Vesnicer, Jerneja Žganec Gros, France Mihelič:
Uporaba kanoničnega govornega akustičnega modela za prilagajanje prostora govornih akustičnih značilk
(Adaptation of Acoustic Feature Space Using Canonical Acoustic Model)
Špela Arhar, Miro Romih:
Klepec: programirani sogovornik za slovenščino
(Klepec: a Slovene chatbot program)
Andrej Žgank, Tomaž Rotovnik, Mirjam Sepesy Maučec, Zdravko Kačič:
Osnovna zgradba razpoznavalnika slovenskega tekočega govora UMB Broadcast News
(Basic Structure of the UMB Slovenian Broadcast News Transcription System)
Melita Hajdinjak, France Mihelič:
Rezultati vrednotenja dveh sistemov Čarovnik iz Oza
(Results from the Evaluation of two Wizard-of-Oz Systems)
Melita Hajdinjak, France Mihelič:
Vrednotenje govornih vmesnikov z ogrodjem PARADISE
(Speech-interface evaluation using the PARADISE framework)
Andrej Žgank, Tomaž Rotovnik, Matej Grašič, Marko Kos, Damjan Vlaj, Zdravko Kačič:
Slovenska govorna in tekstovna baza parlamentarnih razprav za avtomatsko razpoznavanje govora
(Slovenian parliamentary debates speech and text database for automatic speech recognition)
Jasna Belc, Miran Željko:
Načelo večjezičnosti ali večjezični korpus iz manjše množice dvojezičnih
(Spoken Corpus of Slovene)
Jana Zemljarič Miklavčič:
Korpus govorjene slovenščine
(Spoken Corpus of Slovene)
Agnes Pisanski Peterlin:
Iskanje pragmatičnih enot v neoznačenem korpusu: primer kažipotov
(Search for Pragmatic Units in an Untagged Corpus: The Signpost Case)
Mojca Stritar:
Oblikovanje korpusa usvajanja slovenščine kot tujega jezika
(Slovene learner corpus design)
Aneta Ivanovska, Katerina Zdravkova, Tomaž Erjavec, Sašo Džeroski:
Learning rules for morphological analysis and synthesis of Macedonian nouns, adjectives and verbs
Peter Holozan:
Dodatne dvoumnosti zaradi popustljivosti analizatorja pri analizi slovenskih stavkov
(Additional ambiguities caused by the Slovenian analyser's permissiveness)
Mihael Arčan, Špela Vintar:
Avtomatično prepoznavanje lastnih imen
(Named Entity Recognition)
Katarina Puc, Tomaž Erjavec:
Uporaba korpusa pri urejanju spletnega terminološkega slovarja
(Using a corpus for editing an on-line terminological dictionary)
Tomaž Erjavec, Nina Ledinek:
Slovenska odvisnostna drevesnica: prvi rezultati
(Slovene Dependency Treebank: first results)
Tomaž Erjavec, Bence Sárossy:
Oblikoslovno označevanje slovenskega jezika: primer korpusa SVEZ-IJS
(Word-class Syntactic Tagging of Slovene: the case of the SVEZ-IJS corpus)
Ralf Engel:
SPIN: A Semantic Parser for Spoken Dialog Systems
Jori Mur:
Increasing the coverage of answer extraction by applying anaphora resolution
Antoine Doucet, Helena Ahonen-Myka:
Fast extraction of discontiguous sequences in text: a new approach based on maximal frequent sequences
Cvetana Krstev, Duško Vitas:
Finite State Transducers for Recognition and Generation of Compound Words
Sanja Seljan:
The Role of the Lexicon in Lexical Functional Grammar - Example of Croatian
Luboš Popelínský, Jan Blaťák:
Mining actions from reports on flood
Daniel Sonntag:
Towards Combining Finite State, Ontologies, and Data Driven Approaches to Dialogue Management for Multimodal Question Answering
Darja Fišer, Špela Vintar, Ljupčo Todorovski:
Towards clustering-based word sense discrimination
Mirjam Sepesy Maučec, Janez Brest, Zdravko Kačič:
Slovenian to English Machine Translation using Corpora of Different Sizes and Morpho-syntactic Information
Milan Sečujski, Vlado Delić:
A Software Tool for Semi-Automatic Part-of-Speech Tagging and Sentence Accentuation in Serbian Language
Jerneja Žganec Gros, Vlado Delić, Darko Pekar, Milan Sečujski, Aleš Mihelič:
The iTEMA e-mail reader
Jerneja Žganec Gros, Stanislav Gruden, France Mihelič, Tomaž Erjavec, Špela Vintar, Peter Holozan, Aleš Mihelič, Simon Dobrišek, Janez Žibert, Nataša Logar, Tomo Korošec:
The VoiceTRAN Speech Translation Demonstrator
Anton Batliner, Stefan Steidl, Björn Schuller, Dino Seppi, Kornel Laskowski, Thurid Vogt, Laurence Devillers, Laurence Vidrascu, Noam Amir, Loic Kessous, Vered Aharonson:
Combining Efforts for Improving Automatic Classification of Emotional User States
Anton Batliner, Felix Burkhardt, Markus van Ballegooy, Elmar Nöth:
A Taxonomy of Applications that Utilize Emotional Awareness
Sanda Martinčić - Ipšić, Ivo Ipšić:
Context-Dependent Acoustic Modelling of Croatian Speech
Vlado Delić, Milan Sečujski, Darko Pekar, Nikša Jakovljević, Dragiša Mišković:
A Review of AlfaNum Speech Technologies for Serbian, Croatian and Macedonian
Rusko Milan, Trnka Marian, Darjaa Sakhia:
Slovak TTS - From Rule Based To Unit Selection
Wesley Mattheyses, Lukas Latacz, Yuk On Kong, Werner Verhelst:
A Flemish Voice for the Nextens Text-To-Speech System
Jan Macek, Julie Carson-Berndsen:
Articulatory Manner Features Recognition with Linear and Polynomial Kernels
Grega Milharčič, Janez Žibert, France Mihelič:
Statistical Language Modeling of SiBN Broadcast News Text Corpus

KAZALO AVTORJEV / AUTHOR INDEX

Aharonson Vered Combining Efforts for Improving Automatic Classification of Emotional User States
Ahonen-Myka Helena Fast extraction of discontiguous sequences in text: a new approach based on maximal frequent sequences
Alumäe Tanel A Natural Language Interface to a Theater Information Database
Amir Noam Combining Efforts for Improving Automatic Classification of Emotional User States
Arčan Mihael Avtomatično prepoznavanje lastnih imen
Arhar Špela Klepec: programirani sogovornik za slovenščino
Ballegooy Markus van A Taxonomy of Applications that Utilize Emotional Awareness
Banhalmi Andras First Results of a Hungarian Medical Dictation Project
Batliner Anton Combining Efforts for Improving Automatic Classification of Emotional User States
A Taxonomy of Applications that Utilize Emotional Awareness
Belc Jasna Načelo večjezičnosti ali večjezični korpus iz manjše množice dvojezičnih
Biemann Chris Exploiting the Leipzig Corpora Collection
Blaťák Jan Mining actions from reports on flood
Brest Janez Slovenian to English Machine Translation using Corpora of Different Sizes and Morpho-syntactic Information
Burget Lukas Robust heteroscedastic linear discriminant analysis and LCRC posterior features in large vocabulary continuous speech recognition
Burkhardt Felix A Taxonomy of Applications that Utilize Emotional Awareness
Campbell Nick Speech Synthesis and Discourse Information
Carson-Berndsen Julie Articulatory Manner Features Recognition with Linear and Polynomial Kernels
Cernocky Jan Robust heteroscedastic linear discriminant analysis and LCRC posterior features in large vocabulary continuous speech recognition
Cetin Ozgur Optimization of Latent Semantic Analysis based Language Model Interpolation for Meeting Recognition
Chanev Atanas Studying the Learning Curves of a Statistical Dependency Parser for Four Languages
Cvetko-Orešnik Varja SI-PRON: a Comprehensive Pronunciation Lexicon for Slovenian
Delić Vlado A Software Tool for Semi-Automatic Part-of-Speech Tagging and Sentence Accentuation in Serbian Language
The iTEMA e-mail reader
A Review of AlfaNum Speech Technologies for Serbian, Croatian and Macedonian
Devillers Laurence Combining Efforts for Improving Automatic Classification of Emotional User States
Dobrišek Simon Uporaba kanoničnega govornega akustičnega modela za prilagajanje prostora govornih akustičnih značilk
The VoiceTRAN Speech Translation Demonstrator
Doucet Antoine Fast extraction of discontiguous sequences in text: a new approach based on maximal frequent sequences
Džeroski Sašo Learning rules for morphological analysis and synthesis of Macedonian nouns, adjectives and verbs
Engel Ralf SPIN: A Semantic Parser for Spoken Dialog Systems
Erjavec Tomaž Learning rules for morphological analysis and synthesis of Macedonian nouns, adjectives and verbs
Uporaba korpusa pri urejanju spletnega terminološkega slovarja
Slovenska odvisnostna drevesnica: prvi rezultati
Oblikoslovno označevanje slovenskega jezika: primer korpusa SVEZ-IJS
The VoiceTRAN Speech Translation Demonstrator
Fišer Darja Towards clustering-based word sense discrimination
Grašič Matej Slovenska govorna in tekstovna baza parlamentarnih razprav za avtomatsko razpoznavanje govora
Grezl Frantisek Robust heteroscedastic linear discriminant analysis and LCRC posterior features in large vocabulary continuous speech recognition
Gruden Stanislav The VoiceTRAN Speech Translation Demonstrator
Haderlein Tino Automatic Evaluation of Tracheoesophageal Telephone Speech
Hajdinjak Melita Rezultati vrednotenja dveh sistemov Čarovnik iz Oza
Vrednotenje govornih vmesnikov z ogrodjem PARADISE
Hallsteinsdóttir Erla Exploiting the Leipzig Corpora Collection
Holozan Peter Dodatne dvoumnosti zaradi popustljivosti analizatorja pri analizi slovenskih stavkov
The VoiceTRAN Speech Translation Demonstrator
Huang Yan Optimization of Latent Semantic Analysis based Language Model Interpolation for Meeting Recognition
Ipšić Ivo Context-Dependent Acoustic Modelling of Croatian Speech
Ivanovska Aneta Learning rules for morphological analysis and synthesis of Macedonian nouns, adjectives and verbs
Jakopin Primož SI-PRON: a Comprehensive Pronunciation Lexicon for Slovenian
Jakovljević Nikša Vocal Tract Normalization Based on Formant Positions
A Review of AlfaNum Speech Technologies for Serbian, Croatian and Macedonian
Kačič Zdravko Osnovna zgradba razpoznavalnika slovenskega tekočega govora UMB Broadcast News
Slovenska govorna in tekstovna baza parlamentarnih razprav za avtomatsko razpoznavanje govora
Slovenian to English Machine Translation using Corpora of Different Sizes and Morpho-syntactic Information
Karafiat Martin Robust heteroscedastic linear discriminant analysis and LCRC posterior features in large vocabulary continuous speech recognition
Kashani Mehdi M. Pronoun Generation for Text Summarization and Question Answering
Kessous Loic Combining Efforts for Improving Automatic Classification of Emotional User States
Kilgarriff Adam Slovene Word Sketches
Kocsor Andras First Results of a Hungarian Medical Dictation Project
Kong Yuk On A Flemish Voice for the Nextens Text-To-Speech System
Korošec Tomo The VoiceTRAN Speech Translation Demonstrator
Kos Marko Slovenska govorna in tekstovna baza parlamentarnih razprav za avtomatsko razpoznavanje govora
Krauwer Steven Strengthening the Smaller Languages in Europe
Krek Simon Slovene Word Sketches
Krstev Cvetana Finite State Transducers for Recognition and Generation of Compound Words
Laskowski Kornel Combining Efforts for Improving Automatic Classification of Emotional User States
Latacz Lukas A Flemish Voice for the Nextens Text-To-Speech System
Ledinek Nina Slovenska odvisnostna drevesnica: prvi rezultati
Logar Nataša The VoiceTRAN Speech Translation Demonstrator
Macek Jan Articulatory Manner Features Recognition with Linear and Polynomial Kernels
Maier Andreas Automatic Assessment of Children's Speech with Cleft Lip and Palate
Marian Trnka Slovak TTS - From Rule Based To Unit Selection
Martinčić-Ipšić Sanda Context-Dependent Acoustic Modelling of Croatian Speech
Martinez Paloma Including deeper semantic information in the Lexical Markup Framework: a proposal
Martinez-Fernandez Jose L. Including deeper semantic information in the Lexical Markup Framework: a proposal
Mattheyses Wesley A Flemish Voice for the Nextens Text-To-Speech System
Meister Einar A Natural Language Interface to a Theater Information Database
Mihelič Aleš The iTEMA e-mail reader
The VoiceTRAN Speech Translation Demonstrator
Mihelič France Uporaba kanoničnega govornega akustičnega modela za prilagajanje prostora govornih akustičnih značilk
Rezultati vrednotenja dveh sistemov Čarovnik iz Oza
Vrednotenje govornih vmesnikov z ogrodjem PARADISE
The VoiceTRAN Speech Translation Demonstrator
Statistical Language Modeling of SiBN Broadcast News Text Corpus
Milan Rusko Slovak TTS - From Rule Based To Unit Selection
Milharčič Grega Statistical Language Modeling of SiBN Broadcast News Text Corpus
Mišković Dragiša Vocal Tract Normalization Based on Formant Positions
A Review of AlfaNum Speech Technologies for Serbian, Croatian and Macedonian
Mur Jori Increasing the coverage of answer extraction by applying anaphora resolution
Nkenke Emeka Automatic Assessment of Children's Speech with Cleft Lip and Palate
Nöth Elmar Automatic Evaluation of Tracheoesophageal Telephone Speech
Automatic Assessment of Children's Speech with Cleft Lip and Palate
A Taxonomy of Applications that Utilize Emotional Awareness
Paczolay Denes First Results of a Hungarian Medical Dictation Project
Pekar Darko Vocal Tract Normalization Based on Formant Positions
The iTEMA e-mail reader
A Review of AlfaNum Speech Technologies for Serbian, Croatian and Macedonian
Pisanski Peterlin Agnes Iskanje pragmatičnih enot v neoznačenem korpusu: primer kažipotov
Popelínský Luboš Mining actions from reports on flood
Popowich Fred Pronoun Generation for Text Summarization and Question Answering
Puc Katarina Uporaba korpusa pri urejanju spletnega terminološkega slovarja
Pucher Michael Optimization of Latent Semantic Analysis based Language Model Interpolation for Meeting Recognition
Quasthoff Uwe Exploiting the Leipzig Corpora Collection
Richter Matthias Exploiting the Leipzig Corpora Collection
Riedhammer Korbinian Automatic Evaluation of Tracheoesophageal Telephone Speech
Romih Miro Klepec: programirani sogovornik za slovenščino
Rosanowski Frank Automatic Evaluation of Tracheoesophageal Telephone Speech
Rotovnik Tomaž Osnovna zgradba razpoznavalnika slovenskega tekočega govora UMB Broadcast News
Slovenska govorna in tekstovna baza parlamentarnih razprav za avtomatsko razpoznavanje govora
Sakhia Darjaa Slovak TTS - From Rule Based To Unit Selection
Sárossy Bence Oblikoslovno označevanje slovenskega jezika: primer korpusa SVEZ-IJS
Schuller Björn Combining Efforts for Improving Automatic Classification of Emotional User States
Schuster Maria Automatic Evaluation of Tracheoesophageal Telephone Speech
Automatic Assessment of Children's Speech with Cleft Lip and Palate
Schwarz Petr Robust heteroscedastic linear discriminant analysis and LCRC posterior features in large vocabulary continuous speech recognition
Sečujski Milan Vocal Tract Normalization Based on Formant Positions
A Software Tool for Semi-Automatic Part-of-Speech Tagging and Sentence Accentuation in Serbian Language
The iTEMA e-mail reader
A Review of AlfaNum Speech Technologies for Serbian, Croatian and Macedonian
Segura Isabel Including deeper semantic information in the Lexical Markup Framework: a proposal
Seljan Sanja The Role of the Lexicon in Lexical Functional Grammar - Example of Croatian
Sepesy Maučec Mirjam Osnovna zgradba razpoznavalnika slovenskega tekočega govora UMB Broadcast News
Slovenian to English Machine Translation using Corpora of Different Sizes and Morpho-syntactic Information
Seppi Dino Combining Efforts for Improving Automatic Classification of Emotional User States
Sonntag Daniel Towards Combining Finite State, Ontologies, and Data Driven Approaches to Dialogue Management for Multimodal Question Answering
Steidl Stefan Combining Efforts for Improving Automatic Classification of Emotional User States
Stritar Mojca Oblikovanje korpusa usvajanja slovenščine kot tujega jezika
Todorovski Ljupčo Towards clustering-based word sense discrimination
Toth Laszlo First Results of a Hungarian Medical Dictation Project
Treumuth Margus A Natural Language Interface to a Theater Information Database
Verdonik Darinka Pragmatically annotated corpora in speech-to-speech translation
Verhelst Werner A Flemish Voice for the Nextens Text-To-Speech System
Vesnicer Boštjan Uporaba kanoničnega govornega akustičnega modela za prilagajanje prostora govornih akustičnih značilk
Vidrascu Laurence Combining Efforts for Improving Automatic Classification of Emotional User States
Vintar Špela Avtomatično prepoznavanje lastnih imen
Towards clustering-based word sense discrimination
The VoiceTRAN Speech Translation Demonstrator
Vitas Duško Finite State Transducers for Recognition and Generation of Compound Words
Vlaj Damjan Slovenska govorna in tekstovna baza parlamentarnih razprav za avtomatsko razpoznavanje govora
Vogt Thurid Combining Efforts for Improving Automatic Classification of Emotional User States
Zdravkova Katerina Learning rules for morphological analysis and synthesis of Macedonian nouns, adjectives and verbs
Zemljarič Miklavčič Jana Korpus govorjene slovenščine
Željko Miran Načelo večjezičnosti ali večjezični korpus iz manjše množice dvojezičnih
Žganec Gros Jerneja SI-PRON: a Comprehensive Pronunciation Lexicon for Slovenian
Uporaba kanoničnega govornega akustičnega modela za prilagajanje prostora govornih akustičnih značilk
The iTEMA e-mail reader
The VoiceTRAN Speech Translation Demonstrator
Žgank Andrej Osnovna zgradba razpoznavalnika slovenskega tekočega govora UMB Broadcast News
Slovenska govorna in tekstovna baza parlamentarnih razprav za avtomatsko razpoznavanje govora
Žibert Janez The VoiceTRAN Speech Translation Demonstrator
Statistical Language Modeling of SiBN Broadcast News Text Corpus

Valid HTML 4.01!

Page last updated 2006-10-14, et