Erjavec, Tomaž

Erjavec, Tomaž
Senior Researcher

Department of Knowledge Technologies
Jožef Stefan Institute

Slovene research infrastructure CLARIN.SI
JSI Natural Language Server nl.ijs.si


  • Project MEZZANINE: Development of speech resources and technologies for Slovene
  • Project KAS: Slovene scientific texts: resources and description
  • Project Janes: Resources, Tools and Methods for the Research of Nonstandard Internet Slovene
  • Project IMP: Digital library, corpus and lexicon of historical Slovene


Research Interests

  • Text corpora and other language resources
  • Digital humanities
  • Standardisation of text encoding
  • Language technologies for Slovene
  • Machine learning methods for natural language
  • Computational morphology


Functions

Program committee member

2022
  • VarDial 2022: Ninth Workshop on NLP for Similar Languages, Varieties and Dialects
    Gyeongju, October 16, 2022.
  • CLARIN 2022: CLARIN Annual Conference 2022 (chair or programme committee)
    Prague, 10 - 12 October 2022.
  • DiPaDA 2022: Digital Parliamentary Data in Action
    Uppsala, March 15, 2022.
  • ParlaCLARIN III: Workshop on Creating, Enriching and Using Parliamentary Corpora
    Marseille, June 22, 2022.
2021
  • CLARIN 2021: CLARIN Annual Conference 2021
    September 27 - 29, 2021.
  • VarDial 2021: Eighth Workshop on NLP for Similar Languages, Varieties and Dialects
    April 19, 2021.
2020
  • VarDial 2020: Seventh Workshop on NLP for Similar Languages, Varieties and Dialects
    Barcelona, December 13, 2020.
  • CLARIN 2020: CLARIN Annual Conference 2020
    October 5 - 7, 2020.
  • LREC 2020: 12th Language Resources and Evaluation Conference
    Marseille, May 11-16 2020.
    • CMLC 8: Challenges in the management of large corpora, 2019
      May 16 2020.
    • ParlaCLARIN II: Workshop on creating, using and linking parliamentary corpora with other types of political discourse
      May 12 2020.
2019
  • CLARIN 2019: CLARIN Annual Conference 2019
    Leipzig, September 30 - October 2, 2019.
  • CMC-Corpora 2019: 7th Conference on Computer-Mediated Communication (CMC) and Social Media Corpora
    Cergy-Pontoise, September 9-10, 2019.
  • BD 2019: Biographical Data in a Digital World
    Varna, September 5-6, 2019.
  • 2nd MomenT Workshop: Multilingualism at the intersection of Knowledge Bases and Machine Translation
    Dublin, August, 19th, 2019.
  • GWC 2019: The 10th Global WordNet Conference
    Wroclaw, July 23-27, 2019.
  • BSNLP 2019: The 7th Workshop on Balto-Slavic Natural Language Processing
    Florence, August 2nd, 2019.
  • CMCL 2019: 7th Workshop on the Challenges in the Management of Large Corpora
    Cardiff, July 22nd, 2019.
  • VarDial 2019: Sixth Workshop on NLP for Similar Languages, Varieties and Dialects
    Minneapolis, June 7th, 2019.
2018
  • CLARIN 2018: CLARIN Annual Conference 2018
    Pisa, October 8-10, 2018.
  • Baltic HLT 2018: 8th Biennial Symposium "Human Language Technologies - the Baltic Perspective"
    Tartu, September 27-29, 2018.
  • JT-DH 2018: Language Technologies & Digital Humanities 2018
    Ljubljana, September 20-21, 2018.
  • SlaviCorp 2018: Slavic Corpus Linguistics Conference
    Prague, September 24–26, 2018.
  • CMC and Social Media Corpora 2018: 6th Conference on Computer-Mediated Communication (CMC) and Social Media Corpora
    Antwerp, September 17-18 2018.
  • VarDial 2018: Fifth Workshop on NLP for Similar Languages, Varieties and Dialects
    Santa Fe, around August 24–26, 2018.
  • EURALEX 2018: 18th Euralex International Congress: Lexicography in global contexts
    Ljubljana, July 17-21, 2018.
  • ACL 2018: 56th Annual Meeting of the Association for Computational Linguistics
    Melbourne, July 15-20, 2018.
  • LREC 2018: 11th Language Resources and Evaluation Conference
    Miyazaki, May 7-12, 2018.
    • MOMENT 2018: Multilingualism at the intersection of Knowledge Bases and Machine Translation
    • CMLC 2018: 6th Workshop on the Challenges in the Management of Large Corpora
  • TLT 2018: 16th International Workshop on Treebanks and Linguistic Theories
    Prague, January 23-24, 2018.
  • GWC 2018: The 9th Global WordNet Conference
    Singapore, January 8–12, 2018.
2017
  • IJCNLP 2017: The 8th International Joint Conference on Natural Language Processing
    Taipei, November 27 - December 1, 2017.
  • CMC 2017: Conference on CMC and Social Media Corpora for the Humanities
    Bolzano, October 3-4, 2017.
  • CMLC 2017: Challenges in the Management of Large Corpora + Big Data and Natural Language Processing
    Birmingham, July 24, 2017.
  • CLARIN 2017: 6th CLARIN Annual Conference
    September 18–20, 2017, Budapest, Hungary.
  • ACL 2017 co-located events:
    • BSNLP 2017: The 6th Workshop on Balto-Slavic Natural Language Processing
      Valencia, April 4th, 2017. (organiser)
    • LAW 2017: The 11th Linguistic Annotation Workshop
      Valencia, April 3rd, 2017.
    • VarDial 2017: Fourth Workshop on NLP for Similar Languages, Varieties and Dialects
      Valencia, April 3rd, 2017.
  • TLT 2017: 15th International Workshop on Treebanks and Linguistic Theories
    Bloomington, IN, January 20-21, 2017.
2016
  • VarDial 2016: Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 3)
    December 11-16 2016, Osaka, Japan
  • Obdobja 2016: Toporišičeva obdobja
    November 10-12 2016, Ljubljana
  • CLARIN 2016: 5th CLARIN Annual Conference
    October 26–28, 2016, Aix-en-Provence, France.
  • ACL 2016: Annual Meeting of the Association for Computational Linguistics
    August 7-12 2016, Berlin (reviewer for System Demonstration track).
  • LAW X 2016: The 10th Linguistic Annotation Workshop
    August 11 2016, Berlin.
  • LREC 2016: 10th Language Resources and Evaluation Conference
    May 23-28 2016, Portorož (member of local committee).
  • CMLC-4: 4th Workshop on the Challenges in the Management of Large Corpora
    May 28th 2016, Portorož.
2015
  • CoLTA 2015: International Conference on Corpus Linguistics and Technology Advancement
    December 16–18, 2015, Hong Kong.
  • TLT 2015: 14th International Workshop on Treebanks and Linguistic Theories
    December 11–12 2015, Warsaw, Poland.
  • Slovko 2015: 8th International Conference NLP, Corpus Linguistics, Lexicography
    October 21–23, 2015, Bratislava.
  • CLARIN 2015: 4th CLARIN Annual Conference
    October 15–17, 2015, Wroclaw, Poland.
  • BSNLP 2015: The 5th Biennial Workshop on Balto-Slavic Natural Language Processing
    September 10–11, 2015, Hissar, Bulgaria.
  • CMLC-3: 3rd Workshop on Challenges in the Management of Large Corpora
    July 20, 2015, Lancaster.
  • LAW 2015: The 9th Linguistic Annotation Workshop
    June 5, 2015, co-located with NAACL in Denver, Colorado, USA.
  • NetWordS 2015: NetWordS Final Conference: "Word Knowledge and Word Usage: Representations and processes in the Mental Lexicon"
    March 30th – April 1st, 2015, Pisa.
  • ConSOLE XXIII: 23rd Conference of the Student Organization of Linguistics in Europe
    January 7–9, 2015, Paris.
    (Invited speaker)
2014
  • TLT 2014: The 13th International Workshop on Treebanks and Linguistic Theories
    December 12–13, 2014, Tübingen
  • EMNLP 2014: Conference on Empirical Methods in Natural Language Processing
    October 25–29, 2014, Doha, Qatar
    (Chair for track "Phonology, morphology and segmentation")
    • LT4CloseLang 2014: Language Technology for Closely Related Languages and Language Variants
  • CoLing Workshops:
    • VarDial 2014: Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects
      August 23rd, 2014
    • LAW VIII: The 8th Linguistic Annotation Workshop
      August 23–24, 2014, Dublin
  • LREC 2014: 9th Language Resources and Evaluation Conference
    May 26–31, 2014, Reykjavik
    • LDL 2014: 3rd Workshop on Linked Data in Linguistics: Multilingual Knowledge Resources and Natural Language Processing
    • CMLC-2: Challenges in the management of large corpora
  • DATeCH 2014: Digital Access to Textual Cultural Heritage
    May 19–20, 2014, Madrid
  • MWE 2014: 10th Workshop on Multiword Expressions, EACL 2014
    April 26–27, 2014, Gothenburg
  • CICLing 2014: 15th International Conference on Intelligent Text Processing and Computational Linguistics
    April 6–12, 2014, Kathmandu
2013
  • IJCNLP 2013: 6th International Joint Conference on Natural Language Processing
    October 14–18, 2013, Nagoya, Japan
  • ACL 2013: 51st Annual Meeting of the Association for Computational Linguistics
    August 4-9, 2013, Sofia, Bulgaria
    (co-chair of track "NLP for the languages of Central and Eastern Europe and the Balkans")
  • BSNLP 2013: The 4th Biennial International Workshop on Balto-Slavic Natural Language Processing
    August 8, 2013, Sofia, Bulgaria
  • LAW VII & ID: The 7th Linguistic Annotation Workshop & Interoperability with Discourse
    August 8–9, 2013, Sofia, Bulgaria
  • ITI 2013: 35th International Conference on Information Technology Interfaces
    June 24-27, 2013, Cavtat / Dubrovnik
  • LP & IIS 2013: International Conference "Language Processing and Intelligent Information Systems"
    17–18 June 2013, Warsaw
  • NoDaLiDa 2013: The 19th Nordic Conference on Computational Linguistics
    May 22–24, 2013, Oslo
2012
  • COLING 2012: 24th International Conference on Computational Linguistics
    December 8–15 2012 September, Mumbai
  • FASSBL-8: Formal Approaches to South Slavic and Balkan Languages
    Sept. 19–21 2012, Dubrovnik
  • CLoBL 2012: Workshop on Computational Linguistics and Natural Language Processing of Balkan Languages
    at the 5th Balkan Conference in Informatics
    Sept. 16–20, 2012, Novi Sad
  • LAW 2012: The 6th Linguistic Annotation Workshop
    at the ACL 2012
    July 8–14, 2012, Jeju
  • ITI 2012: 34th International Conference on Information Technology Interfaces
    June 25–28, 2012, Cavtat / Dubrovnik
  • LREC 2012: Eight international conference on Language Resources and Evaluation
    23–25 May 2012, Istambul
  • EACL 2012: 13th Conference of the European Chapter of the Association for computational Linguistics
    23–27 April 2012, Avignon
2011
  • EPIA 2011: 15th Portuguese Conference on Artificial Intelligence
    October 10–13, 2011 – Lisbon
  • BSNLP 2011: Third International Workshop on Balto-Slavonic Natural Language Processing 2011
    September 5, 2011 – Plzeň
  • WoLeR 2011: ESSLLI Workshop on Lexical Resources
    August 1–5, 2011 – Ljubljana, Slovenia
  • ACL-HLT 2011: The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
    Portland, Oregon, USA, June 19–24, 2011
  • LAW 2011: 5th Linguistic Annotation Workshop
    Portland, Oregon, USA, June 23–24, 2011
2010
  • TEI 2010: The 2010 Conference and Members’ Meeting of the Text Encoding Initiative Consortium
    November 8–14 2010, Zadar, Croatia
  • ACM CIKM 2010: The 19th ACM International Conference on Information and Knowledge Management
    October 26–30, 2010 Toronto, Canada
  • EMNLP 2010: Conference on Empirical Methods in Natural Language Processing
    October 9–11, 2010 — MIT, Massachusetts, USA.
  • ACL 2010: 48th Annual Meeting of the Association for Computational Linguistics
    July 11–16, 2010, Uppsala, Sweden
  • LAW IV: The Fourth Linguistic Annotation Workshop
    July 15–16, 2010, Uppsala, Sweden
  • LREC 2010: Seventh international conference on Language Resources and Evaluation
    19-21 May 2010, La Valleta, Malta
  • LREC Workshop: Exploitation of Multilingual Resources and Tools for Central and (South-) Eastern European Languages
    23rd May 2010, La Valleta, Malta

Member of Societies



Contact info

URL http://nl.ijs.si/et/
E-mail tomaz.erjavec@ijs.si
S-mail Jožef Stefan Institute
Jamova cesta 39
SI-1000 Ljubljana
Slovenia
Fax +386 1 477 33 15
Phone +386 1 477 35 07 (direct line)
+386 1 477 31 75 (dept. secretary)
Location
[visitors info, photo, map]
Room S-35 (directions)

Valid HTML5!
Page last updated 2023-02-09 by et.