2017-12-06 12:00 to 2017-12-08 15:30

Workshop on Interoperability of Second Language Resources and Tools

Venue: University of Gothenburg, Sweden.

Address: Lennart Torstenssongata 8, Gothenburg (exact room and map coming later)

Dates: December, 6-8, 2017 (from 12.00 on December, 6 -- till 15.40 on December, 8)

Thank you all for a wonderful workshop!

Read blog about the workshop.

Enjoy some workshop pictures!

Koenraad de Smedt outlines CLARIN needs for L2 resources

Audience, day 2

Discussion table on Metadata interoperability

Discussion table on Error annotation questions

Final coffee break - with a christmas cake!


The workshop aims at building a network of researchers, developers and users dealing with second language resources (e.g. corpora) and tools (e.g. error annotation tools) within the CLARIN context. We focus on the needs of interoperability of L2 resources (second language learner corpora, written mode) and associated tools, since comparability of corpora and annotation will, ideally, enable comparative research across languages and national borders. However, we need to first identify the interested parties, relevant projects, corpora and tools. We therefore welcome presentations and demos by workshop participants that will cover existing resources, tools and approaches. Please, fill in the title and abstract of the proposed presentation/demo here.

During the two and a half days we will discuss critical issues affecting quality and interoperability, such as L2 corpus composition, metadata, error coding and exploratory tools, and will attempt to establish a common ground in order to serve ongoing and future projects serving researchers throughout CLARIN.

On the last date, we will provide time (from 15.40 and on) for internal meetings and discussions. 



Book of abstracts

December, 6, 12:15-17:30. Room K332, Lennart Torstenssongata 6.

30 min per presentation (in general)

Time Program point
12.15 - 13.10          Lunch Restaurant Näckrosen at the main university building, Renströmsgatan 6. [Map from Korsvägen] [Description from Korsvägen]
13.10 - 13.15 Elena Volodina Opening Slides
13.15 - 13.30 Koenraad De Smedt Introduction: Goals and expected outcomes
13.30 - 15.00 Session 1. Existing corpora / Corpora under construction
  Mats Wirén SweLL - an upcoming infrastructure for Swedish as a Second Language Abstract Slides
  Jūratė Ruzaitė Development of a Lithuanian learner corpus: A work in progress report Abstract Slides
  Ari Huhta Towards a learner corpus of English and Swedish in Finland  Abstract Slides
15.00 - 15.30 Coffee break
15.30 - 17.30 Session 1, cont. Existing corpora / Corpora under construction
  Nives Mikelic Preradovic CroLTeC - Croatian Learner Text Corpus Abstract Slides
  Egon Stemle Learner Corpus Infrastructure (LCI) @ Eurac Research Abstract Slides
  Elżbieta Kaczmarska Towards a learner corpus of Czech for Polish speaking students Abstract Slides
  Inga Znotiņa Learner corpus of the second Baltic language: annotation and data comparability Abstract Slides
18.00 - ... Dinner at Quality Panorama Hotel


December, 7, 09:00-17:00 Pedagogen, building A, Room AK 2134/2135.

[Map Panorama-Pedagogen/walk] [Map Panorama-Pedagogen/public transport] [Map of Pedagogen campus]  [Map inside the building A]

On Wednesday we will be in another building, Pedagogen, a building part of the University of Gothenburg located in the centre of town. Walking time is about 30 minutes.  You can use google maps to find ways to get there by public transport which could decrease the transport time by some minutes. We will be in the main building, A, in the basement, rooms AK2134 and AK2135.

30 min per presentation (in general)

Time Program point
09.00 - 10.30         Session 2. Metadata and ethics
Keynote talk Sylviane Granger Towards standardization of metadata for L2 corpora​ (1 hour) Abstract Slides
  Silje Ragnhildstveit​ Metadata in ASK Abstract Slides_Silje Slides_Paul
10.30 - 11.00 Coffee break
11.00 - 11.30 Session 2, cont. Metadata and ethics
  Elena Volodina Legal issues in learner essay collection Abstract Slides
11.30 - 12.30 Session 3. Error annotation
  Iria del Rio Error annotation in the COPLE2 corpus Abstract Slides
  Julia Prentice Error taxonomy and other considerations in the SweLL project Abstract Slides
12.30 - 14.00 Lunch -- Long lunch, take a chance for a walk in the center!
14.00 - 15.30 Session 4. Tools and software
  Maarten Janssen TEITOK - using an XML based framework for learner corpora Abstract
  Dan Rosén The SweLL normalization editor for learner texts Abstract Slides More...
  Nadezda Okinina Transc&Anno Abstract Slides
15.30 - 16.00 Coffee break
16.00 - 17.00 Session 4, cont. Tools and software
  Adriane Boyd MERLIN: Lessons Learned Abstract Slides
  Andreas Nolda Towards a German–Hungarian learner corpus: Standards and software for annotating learner data in ‘Dulko’​ Abstract Slides
19.15 - ... Dinner -- Berzeluis bar adressen: Södra Vägen 20, 412 54 Göteborg


December, 8, 09:00-16:00. Room T307, Olof Wijksgatan 6.

30 min per presentation (in general)

Time Program point
09.00 - 10.00          Session 5. Happy user
  Alexandr Rosen Trying make a learner corpus user happy: from annotation to search tools Abstract Slides
  Paul Meurer, Silje Ragnhildstveit, Kari Tenfjord The importance of a user-friendly interface Abstract
10.00 - 10.30 Coffee break
10.30 - 11.00 Session 5 cont. Happy user
  Therese Lindström Tiedemann Case study: Studying the Swedish passive Abstract Slides
11.00 - 12.00 Session 6. Developments on top of L2 corpora
  Simon Smith SkELL for Chinese: a corpus-based Chinese learning platform Abstract
  David Alfter SweLLex + - productive L2 learner vocabulary and more Abstract Slides
12.00 - 13.00 Lunch
13.00 - 14.30 Nives Mikelic Preradovic World café (discussion session)
14.30 - 15.00 Coffee break
15.00 - 15.30 World café (discussion session) Summary
15.30 - 15.40 Maarten Janssen Listing L2 resources (a practical issue)
15.40 - 15.50 Closing
18.00 Dinner Moon Thai Kitchen Address: Kristinelundsgatan 9



Belgium : Sylviane Granger

Croatia : Nives Mikelic Preradovic

Czech Republic : Alexandr Rosen

Estonia : Jane Klavan

Finland : Ari Huhta, Therese Lindström Tiedemann

Germany : Adriane Boyd, Lorenz Eliane

Hungary : Andreas Nolda

Italy : Egon Stemle, Nadezda Okinina

Latvia : Inga Znotiņa, Ilze Auziņa

Lithuania : Jurate Ruzaite

Norway :  Silje Ragnhildstveit, Paul Maurer, Kari Tenfjord, Koenraad De Smedt

Poland : Elzbieta Kaczmarska

Portugal : Maarten Janssen, Iria Del Río

Sweden: Mats Wirén, Vi Thanh Son, Elena Volodina, Dan Rosén, David Alfter, Julia Prentice

United Kingdom : Simon Smith


Registration & funding

2017-10-26, 12:20. ALL 25 CLARIN EU funded places are now filled. You are welcome to register, but will have to find financing from your own sources

The funding can cover travel expenses, accommodation and food for maximum three days (6, 7, 8 December) for maximum 25 people, the local people from Sweden are not included into that count. We will provide the funding on the first come first served basis*. To register, please, fill in the form.

In case, you are not among the first 25 registered participants, but you still wish to attend, please, contact your national CLARIN coordinator for a possibility to get national funding. 


* In case of many applicants, we may consider to keep (CLARIN-EU financed) participation till two representatives per country (to be decided).



Elena Volodina, University of Gothenburg, Sweden (Swe-CLARIN) 

Kari Tenfjord, University of Bergen, Norway (CLARINO)

Therese Lindström Tiedemann, University of Helsinki, Finland (Fin-CLARIN)

Nives Mikelić Preradović, University of Zagreb, Croatia

Maarten Janssen, University of Coimbra, Portugal

Silje Ragnhildstveit, Western Norway University of Applied Sciences, Norway (CLARINO)


Contact information

For questions, please contact elena . volodina @ svenska . gu . se

For reimbursement questions, please contact svenska . exp @ svenska . gu . se (Bodil Wikhall)