LT-based e-HSS in Sweden – taking stock and looking ahead
Time and venue
The workshop will be organized in conjunction with SLTC 2014, the
Fifth Swedish Language Technology Conference, in Uppsala on the morning of the 13th of November 2014.
Room: Gunnar Johansson, 14:K120
For registration, please visit SLTC 2014 registration.
09.00-09.10 : Welcome and introduction
09.10-10.10 : Four paper presentations á 15 min (10 min presentation + 5 min questions)
10.10-10.40 : Coffee break + poster session
10.40-11.10 : Poster session continues.
11.10-11.25 : Paper presentation 15 min.
11.25-11.55 : Invited speaker: Krister Lindén - FIN-CLARIN
11.55-12.00 : Closing
Paper session 1 (9.10-10.10):
- Eva Pettersson, Beata Megyesi and Joakim Nivre: Verb Phrase Extraction in a Historical Context
- Lars Ahrenberg: Towards a Research Infrastructure for Translation Studies
- Maria Ågren and Jonas Lindström: ‘Peering into darkness’: the uses and usefulness of language technology to the Gender & Work project
- Mats Malm: Making all Swedish out-of copyright literary books available for mining: a digitization project in development
Paper session 2 (11.10-11.25):
- Jens Edlund, David House and Joakim Gustafson: Spoken language, speech technology & HSS in the 21st century
CLARIN (Common Language Resources and Technology Infrastructure;
www.clarin.eu) was established by the European Commission as
an ERIC (European Research Infrastructure Consortium) in 2012. In
2013, Vetenskapsrådet (the Swedish Research Council) approved a
proposal for Swedish membership in CLARIN, including a Swedish
national CLARIN organization. The process of setting up SWE-CLARIN is
ongoing, and it is likely that Sweden will be able to join the CLARIN
ERIC during 2014.
CLARIN aims at making language-based material available as primary
research data to the humanities and social sciences (HSS) research
communities with the help of the sophisticated language and speech
processing tools and language resources (LRs) that have been developed
over many years through research in language technology (LT), and
taking advantage of the fact that increasing amounts of text and
speech material – including historical material – are available in
digital form, thus allowing for the utilization of unprecedented
volumes of text and speech data in HSS research. The expectation is
that this LT-based e-HSS paradigm will lead to completely new kinds of
research as well as to new ways of addressing old research questions.
Purpose and audience
The purpose of this first national Swedish CLARIN workshop is to take
stock of ongoing activities falling within the remit of SWE-CLARIN, as
well as to look ahead, by formulating needs to be filled and research
avenues to be explored over the upcoming years.
The workshop is intended as a forum where LT and HSS researchers can
meet to exchange experiences and questions, both those who are
already pursuing work in LT-based e-HSS and those who are interested
in what this paradigm has to offer.
Call for papers
We invite contributions for either 10-minute oral presentations or
poster presentations. Posters will typically describe ongoing or
completed work, while oral presentations will be expected to describe
needed or planned work. Both kinds of submissions should be in the
form of an extended abstract. Accepted abstracts will be published on the workshop
website prior to the workshop.
In line with the exploratory nature of this first national SWE-CLARIN
workshop, we welcome submissions describing completed, ongoing, and
planned e-HSS work – crucially involving the use of LT and LRs
– on (but not limited to):
- specific e-HSS projects
- digitization efforts involving intangible cultural heritage
- concrete efforts as well as general methodology development aiming
at adaptation of LT and LRs to, e.g., historical or non-standard
language varieties and genres (e.g., social media)
- multilingual aspects of LT-based e-HSS
- efforts to create workflows and effective user interfaces
- research questions relevant to the LT-based e-HSS paradigm
- how to reconcile large-scale quantitative and "close-reading"
qualitative research methods
Submissions should adhere to the format of NoDaLiDa 2013 (please ignore the abstract section) and consist of 2-5 pages (800-2000 words). Submissions need not be anonymous. Preferences regarding presentation format (oral/poster) may be indicated, but the program committee will decide which format best fits the overall programme.
Submissions will be handled through EasyChair.
- First call for papers: 19th June 2014
- Second call for papers/Submission opens: 4th August 2014
- Deadline for submissions: 22th September 2014
- Notification of acceptance: 6th October 2014
- Workshop: 13th November 2014, 9–12am
- Johanna Berg, Digisam
- Lars Borin, Språkbanken, University of Gothenburg
- Rickard Domeij, the Swedish Language Council
- Marianne Gullberg, Lund University
- David House, KTH
- Hans Jørgen Marker, SND
- Magnus Merkel, Linköping University
- Joakim Nivre, Uppsala University
- Mats Wirén, Stockholm University