LINGUIST List 18.918

Tue Mar 27 2007

Software: ELRA Language Resources Catalogue Update 03/07

Editor for this issue: Hannah Morales <hannahlinguistlist.org>


Directory         1.    Helene Mazo, ELRA Language Resources Catalogue Update 03/07


******************************************************************************* Fund Drive FLASH: We still need $40,685 to end Fund Drive. If you have not donated, please visit http://linguistlist.org/donate.html*******************************************************************************
Message 1: ELRA Language Resources Catalogue Update 03/07
Date: 20-Mar-2007
From: Helene Mazo <mazoelda.org>
Subject: ELRA Language Resources Catalogue Update 03/07


ELRA is happy to announce that 3 new Speech Related Resources are nowavailable in its catalogue. Moreover, we are pleased to announce that years2005 and 2006 from the Text Corpus of 'Le Monde' (ELRA-W0015) are nowavailable.

ELRA-S0235 LC-STAR Hebrew (Israel) phonetic lexiconThe LC-STAR Hebrew (Israel) phonetic lexicon comprises 109,580 words,including a set of 62,431 common words, a set of 47,149 proper names(including person names, family names, cities, streets, companies and brandnames) and a list of 8,677 special application words. The lexicon isprovided in XML format and includes phonetic transcriptions in SAMPA.For more information, see:http://catalog.elra.info/product_info.php?products_id=984&language=en

ELRA-S0236 LC-STAR English-Hebrew (Israel) Bilingual Aligned Phrasal lexiconThe LC-STAR English-Hebrew (Israel) Bilingual Aligned Phrasal lexiconcomprises 10,520 phrases from the tourist domain. It is based on a list ofshort sentences obtained by translation from US-English 10,449 phrasalcorpus. The lexicon is provided in XML format.For more information, see:http://catalog.elra.info/product_info.php?products_id=985&language=en

ELRA-S0237 LC-STAR US English phonetic lexiconThe LC-STAR US English phonetic lexicon comprises 102,310 words, includinga set of 51,119 common words, a set of 51,111 proper names (includingperson names, family names, cities, streets, companies and brand names) anda list of 6,807 special application words. The lexicon is provided in XMLformat and includes phonetic transcriptions in SAMPA.For more information, see:http://catalog.elra.info/product_info.php?products_id=986&language=en

ELRA-W0015 Text corpus of 'Le Monde'Corpus from 'Le Monde' newspaper. Years 1987 to 2002 are available in anASCII text format. Years 2003 to 2006 are available in .XML format. Eachmonth consists of some 10 MB of data (circa 120 MB per year).For more information, see:http://catalog.elra.info/product_info.php?products_id=438&language=en

For more information on the catalogue, please contact Valérie Mapellimailto:mapellielda.org

Our on-line catalogue has moved to the following address:http://catalog.elra.info. Please update your bookmarks.

Linguistic Field(s): Computational Linguistics