LINGUIST List 20.2008

Thu May 28 2009

Software: ELRA - Language Resources Catalogue - Update

Editor for this issue: Fatemeh Abdollahi <>

        1.    Helene Mazo, ELRA - Language Resources Catalogue - Update

Message 1: ELRA - Language Resources Catalogue - Update
Date: 28-May-2009
From: Helene Mazo <>
Subject: ELRA - Language Resources Catalogue - Update
E-mail this message to a friend

ELRA is happy to announce that 3 new WordNets and 1 new Sign Languagedatabase are now available in its catalogue:

ELRA-M0048 LatinWordNetLatinWordNet contains information about the following aspects of the Latinand English lexicon: lexical relations between words, semantic relationsbetween lexical concepts, correspondences between Latin and English lexicalconcepts. LatinWordNet covers nouns, verbs, adjectives and adverbs, andcontains 8,978 synsets in correspondence with the English equivalents (andwith all the MultiWordNet-based wordnets).For more information, see:

ELRA-M0049 Basque WordNetThe Basque WordNet models nouns, verbs and adjectives. Each sense is linkedto a so-called synset (for a total of 30,281 Synsets). Every synset encodesthe synonymy relation between (possibly) several words (synonyms), having aunique meaning, belonging to one and the same part of speech (specified inthe POS tag value), and expressing the same lexical meaning. Each synset isrelated to the corresponding synset in the English WordNet 1.6. via itsidentification number ID, which includes the synset number and the POS tag.The only exceptions are newly created synsets to account for culturalconcepts not present in WordNet 1.6.For more information, see:

ELRA-M0050 The MWN.PT - MultiWordnet of PortugueseMWN.PT - MultiWordnet of Portuguese (version 1) spans over 17,200 manuallyvalidated concepts/synsets, linked under the semantic relations of hyponymyand hypernymy. These concepts are made of over 21,000 word senses/wordforms and 16,000 lemmas from both European and American variants ofPortuguese. They are aligned with the translationally equivalent conceptsof the English Princeton WordNet and, transitively, of the MultiWordNets ofItalian, Spanish, Hebrew, Romanian and Latin.For more information, see:

ELRA-S0300 SIGNUM DatabaseThe SIGNUM Database contains both isolated and continuous utterances ofvarious signers. The corpus was recorded on video. For quick random accessto individual frames, each video clip is stored as a sequence of images.The vocabulary comprises 450 basic signs in German Sign Language (DGS)representing different word types. Based on this vocabulary, overall 780sentences were constructed. Each sentence ranges from two to eleven signsin length. The entire corpus was performed once by 25 native signers ofdifferent sexes and ages. One of them was chosen to be the so-calledreference signer. His performances were recorded three times.For more information, see:

For more information on the catalogue, please contact Valérie

Visit our On-line Catalogue: http://catalog.elra.infoVisit the Universal Catalogue: http://universal.elra.infoArchives of ELRA Language Resources Catalogue Updates:

Linguistic Field(s): Computational Linguistics                             Semantics                             Text/Corpus Linguistics
Subject Language(s): Basque (eus)
                            English (eng)                             German Sign Language (gsg)                             Latin (lat)                             Portuguese (por)