Sixth International Workshop on Computational Linguistics of Uralic Languages

IWCLUL 2020: Sixth International Workshop on Computational Linguistics of Uralic

The Sixth International Workshop on Computational Linguistics for Uralic Languages was organised by ACL SIGUR and the Department of Finno-Ugrian Studies at the University of Vienna. It was held on 10-11 January 2020 in Vienna, Austria.

Proceedings

More information to follow.

Programme

10 January (Friday): Workshop
9:00	Registration
10:00	Opening
10:15	Timofey Arkhangelskiy: Digital communication corpora for Volga-Kama Uralic languages - what can we learn from them (invited talk)
11:00	Coffee break
11:30	Rogier Blokland, Niko Partanen and Michael Rießler: A pseudonymization method for language documentation corpora
12:00	Nick Howell, Maria Bibaeva and Francis M. Tyers: Effort-value payoff in lemmatisation for Uralic languages
12:30	Jack Rueter, Niko Partanen and Larisa Ponomareva: On the questions in developing computational infrastructure for Komi-Permyak
13:00	Lunch
14:00	Khalid Alnajjar, Mika Hämäläinen and Jack Rueter: On Editing Dictionaries for Uralic Languages in an Online Environment
14:30	Nils Hjortnaes, Niko Partanen, Michael Rießler and Francis M. Tyers: Towards a Speech Recognizer for Komi, an Endangered and Low-Resource Uralic Language
15:00	Coffee break
15:30	László Fejes: Hunting for antiharmonic stems in Erzya
16:00	Gábor Bobály, Csilla Horváth and Veronika Vincze: apPILcation: an Android-based Tool for Learning Mansi
16:30	Closing
17:00	SIGUR business meeting
20:00	Dinner at Pizzeria Scarabocchio (Florianigasse 3, 1080 Wien)

11 January (Saturday): Tutorials
10:00	Timofey Arkhangelskiy: Corpora of Uralic Volga-Kama Languages
11:30	Coffee break
11:45	Niko Partanen: Optical Character Recognition
13:15	Lunch
14:30	Mika Hämäläinen, Khalid Alnajjar: Uralic NLP package
16:00	Closing

Venue

Abteilung Finno-Ugristik
Universität Wien
Campus AAKH, Hof 7-2
Spitalgasse 2-4
1090 Wien
AUSTRIA

Geographical coordinates: 48.217450, 16.353900

Conference location on Open Street Map

Registration

To register for the workshop please fill out the registration form. Please note that there is a 10 € participation fee (to be paid in cash at arrival) to cover running costs of the conference.

Call for papers

The purpose of the conference series International Workshop on Computational Linguistics for Uralic Languages is to bring together researchers working on computational approaches to working with these languages. We accept long and short papers as well as tutorial proposals working on the following languages: Finnish, Hungarian, Estonian, Võro, the Sámi languages, Komi (Zyrian, Permyak), Mordvin (Erzya, Moksha), Mari (Hill, Meadow), Udmurt, Nenets (Tundra, Forest), Enets, Nganasan, Selkup, Mansi, Khanty, Veps, Karelian (Olonets), Karelian, Ingrian (Izhorian), Votic, Livonian, Ludic, and other related languages. All Uralic languages exhibit rich morphological structure, which makes processing them challenging for state-of-the-art computational linguistic approaches, the majority also suffer from a lack of resources and many are endangered. Research papers should be original, substantial and unpublished research, that can describe work-in-progress systems, frameworks, standards and evaluation schemes. Demos and tutorials will present systems and standards towards the goal of interoperability and unification of different projects, applications and research groups Appropriate topics include (but are not limited to):

Parsers, analysers and processing pipelines of Uralic languages
Lexical databases, electronic dictionaries
Finished end-user applications aimed at Uralic languages, such as spelling or grammar checkers, machine translation or speech processing
Evaluation methods and gold standards, tagged corpora, treebanks
Reports on language-independent or unsupervised methods as applied to Uralic languages
Surveys and review articles on subjects related to computational linguistics for one or more Uralic languages
Any work that aims at combining efforts and reducing duplication of work
How to elicit activity from the language community, agitation campaigns, games with a purpose

To maximise the possibility of reproducibility, replication and reuse, we particularly encourage submissions which present free/open-source language resources and make use of free/open-source software. One of the aims of this gathering is to avoid unnecessary duplicated work in field of Uralistics by establishing connections and interoperability standards between researchers and research groups working at different sites. We have also identified a serious lack of gold standards and evaluation metrics for all Uralic languages including those with national support, any work towards better resources in these fields will be greatly appreciated.

Important dates

August 21, 2019: Call for papers announced
November 18 November 25, 2019: Paper submission deadline
December 6, 2019: Paper notification
December 21 December 31, 2019: Camera-ready deadline
December 31, 2019: Fill in the registration form
January 10-11, 2020: Workshop held in Vienna

Submission of papers

Language of submission: Submissions should be made in English or Russian with an obligatory abstract in at least one of the Uralic Language(s).

Submission format:

Papers should be up to 8 pages in length excluding references.
LaTeX submissions are preferred, but submissions using the Microsoft Word template are possible as well. The first draft has to be submitted as a PDF
Our templates are based on the ACL templates; author instructions can be found on the ACL page. The LaTeX templates can be found here, the MS Word template here.
Citations in LaTeX submissions should be managed with bibtex.
Linguistic glosses should follow Leipzig glossing rules; LaTeX submissions should and use the expex package (make sure to update expex regularly as it is developed actively).
The preferred LaTeX version is XeLaTeX; you should use UTF-8 encoded Unicode in your sources rather than TeX encoded characters where possible.

Submissions can be made here using the EasyChair conference management system.

Publication venue: TBA

Conflicts of interest: The reviewing process will be anonymous (double-blind peer review).

Double submission: To maximise the impact of work in the field of computational linguistics for the Uralic languages we are open to the possibility of double submission, or submission of work which has been partially published elsewhere. Any double submission should however be reported to the programme committee at the time of submission. In the advent of double acceptance the authors should choose in which venue to publish.

Programme committee

Tommi A. Pirinen, University of Hamburg
Francis M. Tyers, Indiana University Bloomington
Jeremy Bradley, University of Vienna
Eszter Simon, Research Institute for Linguistics, Hungarian Academy of Sciences
Kadri Muischnek, University of Tartu
Michael Rießler, University of Eastern Finland
Filip Ginter, University of Turku
Timofey Arkhangelskiy, University of Hamburg / Alexander von Humboldt Foundation
Veronika Vincze, Research Group on Articial Intelligence, Hungarian Academy of Sciences
Roman Yangarber, University of Helsinki
Miikka Silfverberg, University of Helsinki

Local organisers

Jeremy Bradley (jeremy.moss.bradley@univie.ac.at)

Contact

Organisers can be reached via google group: iwclul@googlegroups.com. Local organisers should be contacted directly: jeremy.moss.bradley@univie.ac.at