transparent gif


Ej inloggad.

Göteborgs universitets publikationer

The open lexical infrastructure of Språkbanken

Författare och institution:
Lars Borin (Institutionen för svenska språket); Markus Forsberg (Institutionen för svenska språket); Leif-Jöran Olsson (Institutionen för svenska språket); Jonatan Uppström (Institutionen för svenska språket)
Publicerad i:
Proceedings of the 8th International Conference on Language Resources and Evaluation : May 23-25, 2012 / eds. Nicoletta Calzolari , s. 3598-3602
Konferensbidrag, refereegranskat
Sammanfattning (abstract):
We present our ongoing work on Karp, Språkbanken’s (the Swedish Language Bank) open lexical infrastructure, which has two main functions: (1) to support the work on creating, curating, and integrating our various lexical resources; and (2) to publish daily versions of the resources, making them searchable and downloadable. An important requirement on the lexical infrastructure is also that we maintain a strong bidirectional connection to our corpus infrastructure. At the heart of the infrastructure is the SweFN++ project with the goal to create free Swedish lexical resources geared towards language technology applications. The infrastructure currently hosts 15 Swedish lexical resources, including historical ones, some of which have been created from scratch using existing free resources, both external and in-house. The resources are integrated through links to a pivot lexical resource, SALDO, a large morphological and lexical-semantic resource for modern Swedish. SALDO has been selected as the pivot partly because of its size and quality, but also because its form and sense units have been assigned persistent identifiers (PIDs) to which the lexical information in other lexical resources and in corpora are linked.
Ämne (baseras på Högskoleverkets indelning av forskningsämnen):
Data- och informationsvetenskap ->
Språkteknologi (språkvetenskaplig databehandling)
Språk och litteratur ->
lexicon, infrastructure, Swedish language resources
Ytterligare information:
International Conference on Language Resources and Evaluation ; 8 (Istanbul, Turkey) : 2012.05.23-25 LREC 2010 ; 8 (Istanbul, Turkey) : 2012.05.23-25 Conference website:
Postens nummer:
Posten skapad:
2012-03-21 11:31
Posten ändrad:
2013-12-12 14:48

Visa i Endnote-format

Göteborgs universitet • Tel. 031-786 0000
© Göteborgs universitet 2007