LINGUIST List 18.1667

Thu May 31 2007

Software: ELRA Language Resources Catalogue Update 05/07

Editor for this issue: Hannah Morales <hannahlinguistlist.org>


Directory         1.    Helene Mazo, ELRA Language Resources Catalogue Update 05/07


Message 1: ELRA Language Resources Catalogue Update 05/07
Date: 31-May-2007
From: Helene Mazo <mazoelda.org>
Subject: ELRA Language Resources Catalogue Update 05/07


ELRA is happy to announce that 2 new Speech Resources, 1 Written Corpus and1 Monolingual Lexicon are now available in its catalogue.

ELRA-S0238 MIST Multi-lingual Interoperability in Speech Technology database

The MIST Multi-lingual Interoperability in Speech Technology databasecomprises the recordings of 74 native Dutch speakers (52 males, 22 females)who uttered 10 sentences in Dutch, English, French and German, including 5sentences per language identical for all speakers and 5 sentences perlanguage per speaker unique. Dutch sentences are orthographically annotated.

For more information, see:http://catalog.elra.info/product_info.php?products_id=988&language=en

ELRA-S0239 N4 (NATO Native and Non Native) database

The (NATO Native and Non Native) database comprises speech data recorded inthe naval transmission training centers of four countries (Germany, TheNetherlands, United Kingdom, and Canada) during naval communicationtraining sessions in 2000-2002. The material consists of native andnon-native speakers using NATO Naval English procedure between ships, andreading from a text, 'The North Wind and the Sun,' in both English and thespeaker's native language. The audio material was recorded on DAT anddownsampled to 16kHz-16bit, and all the audio files have been manuallytranscribed and annotated with speakers identities using the tool, Transcriber.

For more information, see:http://catalog.elra.info/product_info.php?products_id=989&language=en

ELRA-W0047 Catalan Corpus of News Articles

The Catalan Corpus of News Articles comprises articles in Catalan from 1January 1999 to 31 March 2007. These articles are grouped per trimesterwithout chronological order inside.

For more information, see:http://catalog.elra.info/product_info.php?products_id=990&language=en

ELRA-L0075 Bulgarian Linguistic Database

This database contains 81,647 entries in Bulgarian with a linguisticenvironment tool (for WINDOWS XP). The data may be used for morphologicalanalysis and synthesis, syntactic agreement checking, phonetic stressdetermining.

For more information, see:http://catalog.elra.info/product_info.php?products_id=987&language=en

For more information on the catalogue, please contact Valérie Mapellimailto:mapellielda.org

Our on-line catalogue has moved to the following address:http://catalog.elra.info. Please update your bookmarks.

Linguistic Field(s): Computational Linguistics
Subject Language(s): Bulgarian (bul)
                            Catalan-Valencian-Balear (cat)                             Dutch (nld)                             English (eng)                             French (fra)                             Plautdietsch (pdt)