Parallel texts from the Swedish National Food Agency

SND-ID: ext0336-1.

Is part of collection at SND: Parallel Texts from Public Agencies

Creator/Principal investigator(s)

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Institute for Language and Folklore, Language Council of Sweden

Research principal

Institute for Language and Folklore - Language Council of Sweden rorId

Description

Parallel texts downloaded from the agency's website.

Parallel texts downloaded from the website of the Swedish National Food Agency.
The files that were downloaded were all pdf files. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.
Method and outcome

Sampling procedure

Multilingual parallel material.

Time period(s) investigated

2017-01-01 – 2017-01-31

Data format / data structure

Data collection
  • Mode of collection: Self-administered questionnaire: web based
  • Time period(s) for data collection: 2017-01-01 – 2017-01-31
Language resources

Resource type

Corpus

Foreseen use

NLP application

Text corpus

  • Linguality

    Multilingual
  • Language

    • Swedish (swe)

      Texts: 8

    • English (eng)

      Texts: 8

    • Spanish (spa)

      Texts: 4

    • French (fra)

      Texts: 3

    • Polish (pol)

      Texts: 2

    • Finnish (fin)

      Texts: 1

    More..
  • Modality

    Written Language
  • Size

    Words: 60496 (TOT)

    Texts: 26 (TOT)

    Words: 18897 (swe)

    Texts: 8 (swe)

  • Original source

    livsmedelsverket
    www.livsmedelsverket.se
Geographic coverage

Geographic spread

Geographic location: Sweden

Administrative information

Responsible department/unit

Language Council of Sweden

Contributor(s)

Institute for Language and Folklore, Language Council of Sweden

Topic and keywords

Research area

Public health, global health, social medicine and epidemiology (Standard för svensk indelning av forskningsämnen 2011)

Nutrition and dietetics (Standard för svensk indelning av forskningsämnen 2011)

Languages and literature (Standard för svensk indelning av forskningsämnen 2011)

General health and well-being (CESSDA Topic Classification)

Agriculture and rural industry (CESSDA Topic Classification)

Public health (CESSDA Topic Classification)

Publications

Contact for questions about the data

This resource has the following relations

Related research data in SND's catalogue

Is part of collection at SND

CLARIN Virtual Collection Registry

Add to collection

A virtual collection is connected to a specific research purpose and contains links to data resources from various digital archives. It is easy to create, access, and cite the collection.

Read more about virtual collections on the CLARIN website.