IWCLUL 2020: Sixth International Workshop on Computational Linguistics of Uralic

The Sixth International Workshop on Computational Linguistics for Uralic Languages was organised by ACL SIGUR and the Department of Finno-Ugrian Studies at the University of Vienna. It was held on 10-11 January 2020 in Vienna, Austria.

Proceedings

More information to follow.

Programme

10 January (Friday): Workshop
9:00Registration
10:00Opening
10:15Timofey Arkhangelskiy: Digital communication corpora for Volga-Kama Uralic languages - what can we learn from them (invited talk)
11:00Coffee break
11:30Rogier Blokland, Niko Partanen and Michael Rießler: A pseudonymization method for language documentation corpora
12:00Nick Howell, Maria Bibaeva and Francis M. Tyers: Effort-value payoff in lemmatisation for Uralic languages
12:30Jack Rueter, Niko Partanen and Larisa Ponomareva: On the questions in developing computational infrastructure for Komi-Permyak
13:00Lunch
14:00Khalid Alnajjar, Mika Hämäläinen and Jack Rueter: On Editing Dictionaries for Uralic Languages in an Online Environment
14:30Nils Hjortnaes, Niko Partanen, Michael Rießler and Francis M. Tyers: Towards a Speech Recognizer for Komi, an Endangered and Low-Resource Uralic Language
15:00Coffee break
15:30László Fejes: Hunting for antiharmonic stems in Erzya
16:00Gábor Bobály, Csilla Horváth and Veronika Vincze: apPILcation: an Android-based Tool for Learning Mansi
16:30Closing
17:00SIGUR business meeting
20:00Dinner at Pizzeria Scarabocchio (Florianigasse 3, 1080 Wien)

  

11 January (Saturday): Tutorials
10:00Timofey Arkhangelskiy: Corpora of Uralic Volga-Kama Languages
11:30Coffee break
11:45Niko Partanen: Optical Character Recognition
13:15Lunch
14:30Mika Hämäläinen, Khalid Alnajjar: Uralic NLP package
16:00Closing

Venue

Abteilung Finno-Ugristik
Universität Wien
Campus AAKH, Hof 7-2
Spitalgasse 2-4
1090 Wien
AUSTRIA

Geographical coordinates: 48.217450, 16.353900

Conference location on Open Street Map

Registration

To register for the workshop please fill out the registration form. Please note that there is a 10 € participation fee (to be paid in cash at arrival) to cover running costs of the conference.

Call for papers

The purpose of the conference series International Workshop on Computational Linguistics for Uralic Languages is to bring together researchers working on computational approaches to working with these languages. We accept long and short papers as well as tutorial proposals working on the following languages: Finnish, Hungarian, Estonian, Võro, the Sámi languages, Komi (Zyrian, Permyak), Mordvin (Erzya, Moksha), Mari (Hill, Meadow), Udmurt, Nenets (Tundra, Forest), Enets, Nganasan, Selkup, Mansi, Khanty, Veps, Karelian (Olonets), Karelian, Ingrian (Izhorian), Votic, Livonian, Ludic, and other related languages. All Uralic languages exhibit rich morphological structure, which makes processing them challenging for state-of-the-art computational linguistic approaches, the majority also suffer from a lack of resources and many are endangered. Research papers should be original, substantial and unpublished research, that can describe work-in-progress systems, frameworks, standards and evaluation schemes. Demos and tutorials will present systems and standards towards the goal of interoperability and unification of different projects, applications and research groups Appropriate topics include (but are not limited to):

     

  • Parsers, analysers and processing pipelines of Uralic languages
  • Lexical databases, electronic dictionaries
  • Finished end-user applications aimed at Uralic languages, such as spelling or grammar checkers, machine translation or speech processing
  • Evaluation methods and gold standards, tagged corpora, treebanks
  • Reports on language-independent or unsupervised methods as applied to Uralic languages
  • Surveys and review articles on subjects related to computational linguistics for one or more Uralic languages
  • Any work that aims at combining efforts and reducing duplication of work
  • How to elicit activity from the language community, agitation campaigns, games with a purpose

To maximise the possibility of reproducibility, replication and reuse, we particularly encourage submissions which present free/open-source language resources and make use of free/open-source software. One of the aims of this gathering is to avoid unnecessary duplicated work in field of Uralistics by establishing connections and interoperability standards between researchers and research groups working at different sites. We have also identified a serious lack of gold standards and evaluation metrics for all Uralic languages including those with national support, any work towards better resources in these fields will be greatly appreciated.

Important dates

     

  • August 21, 2019: Call for papers announced
  • November 18 November 25, 2019: Paper submission deadline
  • December 6, 2019: Paper notification
  • December 21 December 31, 2019: Camera-ready deadline
  • December 31, 2019: Fill in the registration form
  • January 10-11, 2020: Workshop held in Vienna

Submission of papers

Language of submission: Submissions should be made in English or Russian with an obligatory abstract in at least one of the Uralic Language(s).

Submission format:

     

  • Papers should be up to 8 pages in length excluding references.
  • LaTeX submissions are preferred, but submissions using the Microsoft Word template are possible as well. The first draft has to be submitted as a PDF
  • Our templates are based on the ACL templates; author instructions can be found on the ACL page. The LaTeX templates can be found here, the MS Word template here.
  • Citations in LaTeX submissions should be managed with bibtex.
  • Linguistic glosses should follow Leipzig glossing rules; LaTeX submissions should and use the expex package (make sure to update expex regularly as it is developed actively).
  • The preferred LaTeX version is XeLaTeX; you should use UTF-8 encoded Unicode in your sources rather than TeX encoded characters where possible.
  •  

Submissions can be made here using the EasyChair conference management system.

Publication venue: TBA

Conflicts of interest: The reviewing process will be anonymous (double-blind peer review).

Double submission: To maximise the impact of work in the field of computational linguistics for the Uralic languages we are open to the possibility of double submission, or submission of work which has been partially published elsewhere. Any double submission should however be reported to the programme committee at the time of submission. In the advent of double acceptance the authors should choose in which venue to publish.

Programme committee

  • Tommi A. Pirinen, University of Hamburg
  • Francis M. Tyers, Indiana University Bloomington
  • Jeremy Bradley, University of Vienna
  • Eszter Simon, Research Institute for Linguistics, Hungarian Academy of Sciences
  • Kadri Muischnek, University of Tartu
  • Michael Rießler, University of Eastern Finland
  • Filip Ginter, University of Turku
  • Timofey Arkhangelskiy, University of Hamburg / Alexander von Humboldt Foundation
  • Veronika Vincze, Research Group on Articial Intelligence, Hungarian Academy of Sciences
  • Roman Yangarber, University of Helsinki
  • Miikka Silfverberg, University of Helsinki

Local organisers

Contact

Organisers can be reached via google group: iwclul@googlegroups.com. Local organisers should be contacted directly: jeremy.moss.bradley@univie.ac.at