LT-based e-HSS in Sweden – taking stock and looking ahead
Time and venue
The workshop will be organized in conjunction with SLTC 2014, the Fifth Swedish Language Technology Conference, in Uppsala on the morning of the 13th of November 2014. Room: Gunnar Johansson, 14:K120 For registration, please visit SLTC 2014 registration.
Schedule
09.00-09.10 : Welcome and introduction 09.10-10.10 : Four paper presentations á 15 min (10 min presentation + 5 min questions) 10.10-10.40 : Coffee break + poster session 10.40-11.10 : Poster session continues. 11.10-11.25 : Paper presentation 15 min. 11.25-11.55 : Invited speaker: Krister Lindén - FIN-CLARIN 11.55-12.00 : Closing
Paper session 1 (9.10-10.10):
- Eva Pettersson, Beata Megyesi and Joakim Nivre: Verb Phrase Extraction in a Historical Context
- Lars Ahrenberg: Towards a Research Infrastructure for Translation Studies
- Maria Ågren and Jonas Lindström: ‘Peering into darkness’: the uses and usefulness of language technology to the Gender & Work project
- Mats Malm: Making all Swedish out-of copyright literary books available for mining: a digitization project in development
Paper session 2 (11.10-11.25):
- Jens Edlund, David House and Joakim Gustafson: Spoken language, speech technology & HSS in the 21st century
Background
CLARIN (Common Language Resources and Technology Infrastructure; www.clarin.eu) was established by the European Commission as an ERIC (European Research Infrastructure Consortium) in 2012. In 2013, Vetenskapsrådet (the Swedish Research Council) approved a proposal for Swedish membership in CLARIN, including a Swedish national CLARIN organization. The process of setting up SWE-CLARIN is ongoing, and it is likely that Sweden will be able to join the CLARIN ERIC during 2014.
CLARIN aims at making language-based material available as primary research data to the humanities and social sciences (HSS) research communities with the help of the sophisticated language and speech processing tools and language resources (LRs) that have been developed over many years through research in language technology (LT), and taking advantage of the fact that increasing amounts of text and speech material – including historical material – are available in digital form, thus allowing for the utilization of unprecedented volumes of text and speech data in HSS research. The expectation is that this LT-based e-HSS paradigm will lead to completely new kinds of research as well as to new ways of addressing old research questions.
Purpose and audience
The purpose of this first national Swedish CLARIN workshop is to take stock of ongoing activities falling within the remit of SWE-CLARIN, as well as to look ahead, by formulating needs to be filled and research avenues to be explored over the upcoming years.
The workshop is intended as a forum where LT and HSS researchers can meet to exchange experiences and questions, both those who are already pursuing work in LT-based e-HSS and those who are interested in what this paradigm has to offer.
Call for papers
We invite contributions for either 10-minute oral presentations or poster presentations. Posters will typically describe ongoing or completed work, while oral presentations will be expected to describe needed or planned work. Both kinds of submissions should be in the form of an extended abstract. Accepted abstracts will be published on the workshop website prior to the workshop.
In line with the exploratory nature of this first national SWE-CLARIN workshop, we welcome submissions describing completed, ongoing, and planned e-HSS work – crucially involving the use of LT and LRs – on (but not limited to):
- specific e-HSS projects
- digitization efforts involving intangible cultural heritage
- concrete efforts as well as general methodology development aiming at adaptation of LT and LRs to, e.g., historical or non-standard language varieties and genres (e.g., social media)
- multilingual aspects of LT-based e-HSS
- efforts to create workflows and effective user interfaces
- research questions relevant to the LT-based e-HSS paradigm
- how to reconcile large-scale quantitative and "close-reading" qualitative research methods
Submissions should adhere to the format of NoDaLiDa 2013 (please ignore the abstract section) and consist of 2-5 pages (800-2000 words). Submissions need not be anonymous. Preferences regarding presentation format (oral/poster) may be indicated, but the program committee will decide which format best fits the overall programme.
Submissions will be handled through EasyChair.
https://easychair.org/conferences/?conf=sweclarinws2014
Important dates
- First call for papers: 19th June 2014
- Second call for papers/Submission opens: 4th August 2014
- Deadline for submissions: 22th September 2014
- Notification of acceptance: 6th October 2014
- Workshop: 13th November 2014, 9–12am
Workshop organizers
- Johanna Berg, Digisam
- Lars Borin, Språkbanken, University of Gothenburg
- Rickard Domeij, the Swedish Language Council
- Marianne Gullberg, Lund University
- David House, KTH
- Hans Jørgen Marker, SND
- Magnus Merkel, Linköping University
- Joakim Nivre, Uppsala University
- Mats Wirén, Stockholm University