LINGUIST List 20.2136

Thu Jun 11 2009

Software: ELRA - Language Resources Catalogue - Update

Editor for this issue: Fatemeh Abdollahi <>

        1.    Helene Mazo, ELRA - Language Resources Catalogue - Update

Message 1: ELRA - Language Resources Catalogue - Update
Date: 11-Jun-2009
From: Helene Mazo <>
Subject: ELRA - Language Resources Catalogue - Update
E-mail this message to a friend

ELRA is happy to announce that 1 new Written Corpus is now available in itscatalogue:

ELRA-W0050 The CINTIL Corpus – International Corpus of PortugueseCINTIL-Corpus Internacional do Português is a linguistically interpretedwritten and spoken corpus of European Portuguese. It is composed of onemillion annotated tokens, each one of which verified by human expertannotators. The annotation comprises information on part-of-speech, openclass lemma and inflection, multi-word expressions pertaining to the classof adverbs and to the closed POS classes, and multi-word proper names (fornamed entity recognition). The corpus is developed over raw textualmaterials of several types, of which 30% are spoken materials.

For more information, see:

For more information on the catalogue, please contact Valérie

Visit our On-line Catalogue: http://catalog.elra.infoVisit the Universal Catalogue: http://universal.elra.infoArchives of ELRA Language Resources Catalogue Updates:

Linguistic Field(s): Computational Linguistics                             Text/Corpus Linguistics