Deliverable 4.3

First upload of language resources

Version No. 1.0

01/12/2011

 

An important aim of META-NORD is to upgrade and harmonize national language resources and tools in order to make them interoperable, within languages and across languages, with respect to their data formats and as far as possible also as regards their content.

A further central aim is the definition of standardized resource and tool metadata and mechanisms for making these metadata harvestable, so that distributed resources and tools can be effectively utilized in language technology applications, both in academic research and in industry.To describe the metadata the META-SHARE metadata model has been used. Based on the META-SHARE metadata model and the technical information regarding data formats that are supported by META-SHARE tool, consortium has created META-SHARE XML for the first meta-data upload. Taking under considerations that XML format has been harmonized with technical requirements provided by META-SHARE we do not foresee any major problems to import XML to META-SHARE when the sable tool will be realized.

In total 67 resources metadata has been described in XML format.

Table below describes the uploaded meta-data by partner. Table indicates the name of the resources and the location of XML schema that is frilly available for public.

Eurotermbank

Data provider: Tilde

Metadata XML

Description of the resource

Data

 

Lithuanian-Latvian dictionary

Data provider: Tilde

Metadata XML

Description of the resource

Data

 

Latvian-Lithuanian dictionary

Data provider: Tilde

Metadata XML

Description of the resource

Data

Estonian-Latvian dictionary

Data provider: Tilde

Metadata XML

Description of the resource

Data

 

Latvian-English legislation corpus of Republic of Latvia

Data provider: Tilde

Metadata XML

Description of the resource

Data

 

Multilingual dictionary of person names

Data provider: Tilde

Metadata XML

Description of the resource

Data

 

Corpus of Latvian literature

Data provider: Tilde

Metadata XML

Description of the resource

Data

 

Danish wordnet, DanNet

Data provider: UCPH

Metadata XML

Description of the resource

Data

 

Copenhagen Dependency Treebank1

Data provider: UCPH
Metadata XML

Description of the resource

Data

 

Copenhagen Dependency Treebank2

Data provider: UCPH
Metadata XML

Description of the resource

Data

 

STO – LMF

Data provider: UCPH
Metadata XML

Description of the resource

Data

 

The Estonian Reference Corpus

Data provider: UT

Metadata XML

Description of the resource

Data

Estonian Treebank

Data provider: UT

Metadata XML

Description of the resource

Link to data

Estonian WordNet

Data provider: UT

Metadata XML

Description of the resource

Data

 

Corpora of morphologically disambiguated texts

Data provider: UT

Metadata XML

Description of the resource

Data

 

Corpora with shallow syntactic annotation

Data provider: UT

Metadata XML

Description of the resource

Data

 

English-Estonian and Estonian-English parallel corpus

Data provider: UT

Metadata XML

Description of the resource

Data

 

Semantically disambiguated corpus

Data provider: UT

Metadata XML

Description of the resource

Data

 

The database of Estonian verbal multi-word expressions

Data provider: UT

Metadata XML

Description of the resource

Data

 

Acoustic database for Danish

Metadata provider: UIB

Metadata XML

Description of the resource

Data

 

Acoustic database for Norwegian

Metadata provider: UIB

Metadata XML

Description of the resource

Data

 

Acoustic database for Swedish

Metadata provider: UIB

Metadata XML

Description of the resource

Data

 

Lexical database for Danish

Metadata provider: UIB

Metadata XML

Description of the resource

Data

 

Lexical database for Swedish

Metadata provider: UIB

Metadata XML

Description of the resource

Data

 

Lexical database for Norwegian

Metadata provider: UIB

Metadata XML

Description of the resource

Data

 

Norsk ordbank, Bokmål

Metadata provider: UIB

Metadata XML

Description of the resource

Data I, II

Oslo-Bergen tagger

Metadata provider: UIB

Metadata XML

Description of the resource

Data

 

SCARRIE lexicon

Metadata provider: UIB

Metadata XML

Description of the resource

Data

 

Sofietrebanken

Metadata provider: UIB

Metadata XML

Description of the resource

Data

 

Sofie parallel treebank

Data provider: UIB

Metadata XML

Description of the resource

Data

 

TRIS Spanish-German parallel corpus

Metadata provider: UIB

Metadata XML

Description of the resource

Data

 

Norsk ordbank, Nynorsk

Metadata provider: UIB

Metadata XML

Description of the resource

Data I, II

 

Written corpora of old literary Finnish (Vanha kirjasuomi)

Data provider: UHEL

Metadata XML

Description of the resource

Data

 

Corpus of early modern Finnish (Varhaisnykysuomen korpus)

Data provider: UHEL

Metadata XML

Description of the resource

Data

 

Finnish literature classics (Suomalaisen kirjallisuuden klassikoita)

Data provider: UHEL

Metadata XML

Description of the resource

Data

Up-to-date word list of modern Finnish (Ajantasainen nykysuomen sanalista)

Data provider: UHEL

Metadata XML

Description of the resource

Data

Frequency list of words in written Finnish (Kirjoitetun suomen kielen sanojen taajuuslista)

Data provider: UHEL

Metadata XML

Description of the resource

Data

Finnish wordnet

Data provider: UHEL

Metadata XML

Description of the resource

Data

 

Finish treeBank

Data provider: UHEL

Metadata XML

Description of the resource

Data

 

Icelandic Parsed Historical Corpus

Data provider: HI

Metadata XML

Description of the resource

Data

Icelandic Frequency Dictionary Corpus
UP10 7.2.1

Data provider: HI

Metadata XML

Description of the resource

Link to data


UP10 7.2.2

Data provider: HI

Metadata XML

Description of the resource

Data

 

Parliament Speech Corpus

Data provider: HI

Metadata XML

Description of the resource

Data

Hjal Speech Corpus

Data provider: HI

Metadata XML

Description of the resource

Data

 

Pronunciation Dictionary for Icelandic

Data provider: HI

Metadata XML

Description of the resource

Data

 

The Saga Corpus

Data provider: HI

Metadata XML

Description of the resource

Data

 

Modern Lithuanian Dictionary

Data provider: LKI

Metadata XML

Description of the resource

Data

 

Database of Neologisms

Data provider: LKI

Metadata XML

Description of the resource

Data

 

Database of Lithuanian Historical Ethnic Place Names

Data provider: LKI

Metadata XML

Description of the resource

Data

 

Geoinformational Database of Lithuanian Toponyms

Data provider: LKI

Metadata XML

Description of the resource

Data

 

The Dictionary of Lithuanian

Data provider: LKI

Metadata XML

Description of the resource

Data

 

Swedish Wikipedia Corpus

Data provider: UGOT

Metadata XML

Description of the resource

Data

 

Loan Word Typology list

Data provider: UGOT

Metadata XML

Description of the resource

Data

 

Semantic Information for Multifunctional Plurilingual Lexica

Data provider: UGOT

Metadata XML

Description of the resource

Data

 

Preparatory Action for Linguistic Resources Organization for Language Engineering

Data provider: UGOT

Metadata XML

Description of the resource

Data

 

Swesaurus

Data provider: UGOT

Metadata XML

Description of the resource

Data

 

Swedish FrameNet

Data provider: UGOT

Metadata XML

Description of the resource

Data

Examples from the Swedish Associative Thesaurus (SALDO)

Data provider: UGOT

Metadata XML

Description of the resource

Data

 

Swedish Associative Thesaurus' morphology

Data provider: UGOT

Metadata XML

Description of the resource

Data

 

Old Swedish morphology

Data provider: UGOT

Metadata XML

Description of the resource

Data

A diachronic computational lexical resource for 800 years of Swedish

Söderwall's Dictionary of old Swedish Supplement
Data provider: UGOT

Metadata XML

Description of the resource

Data

A diachronic computational lexical resource for 800 years of Swedish

 

Söderwall's Dictionary of old Swedish
Data provider: UGOT

Metadata XML

Description of the resource

Data

 

Schlyter's Dictionary of old Swedish
Data provider: UGOT

Metadata XML

Description of the resource

A diachronic computational lexical resource for 800 years of Swedish

Data

 

Dalin's dictionary morphology
Data provider: UGOT

Metadata XML

Description of the resource

A diachronic computational lexical resource for 800 years of Swedish

Data

 

Dalin's dictionary
Data provider: UGOT

Metadata XML

Description of the resource

Data

Swedish Associative Thesaurus
Data provider: UGOT

Metadata XML

Description of the resource

Data

 

Keywords for Language Learning for Young and adults alike

Data provider: UGOT

Metadata XML

Description of the resource

Data

 

SALDO morphology

Data provider: UGOT

Description of the resource

Metadata XML

Data I

Data II


SALDO examples

Data provider: UGOT

Description of the resource

Metadata XML

Data I

Data II

Parole lexicon

Data provider: UGOT

Description of the resource

Metadata XML

Data I

Data II

 

Dalin

Data provider: UGOT

Description of the resource

Metadata XML

Data I

Data II

 

Söderwall

Data provider: UGOT

Description of the resource

Metadata XML

Data I

Data II

 

Swesaurus

Data provider: UGOT

Description of the resource

Metadata XML

Data I

Data II

Data III

 

Swedish LWT list

Data provider: UGOT

Description of the resource

Metadata XML

Data I

Data II

 

Swedish Kelly list

Data provider: UGOT

Description of the resource

Metadata XML

Data

 

Simple

Data provider: UGOT

Description of the resource

Metadata XML

Data I

Data II


0