LINGUIST List 21.4878
Fri Dec 03 2010
Software: NooJ: Finite-State Language Processing
Editor for this issue: Susanne Vejdemo
<susannelinguistlist.org>
1. Chris Humphrey ,
NooJ: Finite-State Language Processing
Message 1: NooJ: Finite-State Language Processing
Date: 03-Dec-2010
From: Chris Humphrey <chumphreyc-s-p.org>
Subject: NooJ: Finite-State Language Processing
E-mail this message to a friend
http://www.nooj4nlp.net/pages/nooj.html
NooJ is both a corpus processing tool and a linguistic developmentenvironment: it allows linguists to formalize several levels of linguisticphenomena: orthography and spelling, lexicons for simple words, multiwordunits and frozen expressions, inflectional, derivational and productivemorphology, local, structural syntax and transformational syntax. For eachof these levels, NooJ provides linguists with one or more formal toolsspecifically designed to facilitate the description of each phenomenon, aswell as parsing tools designed to be as computationally efficient aspossible. This approach distinguishes NooJ from most computationallinguistic tools, which provide a single formalism that should describeeverything. As a corpus processing tool, NooJ allows users to applysophisticated linguistic queries to large corpora in order to build indicesand concordances, annotate texts automatically, perform statisticalanalyses, etc.
NooJ is freely available and linguistic modules can already be downloadedfor Acadian, Arabic, Armenian, Bulgarian, Catalan, Chinese, Croatian,French, English, German, Hebrew, Greek, Hungarian, Italian, Polish,Portuguese, Spanish and Turkish.
Linguistic Field(s):
Morphology
Syntax
Text/Corpus Linguistics
Subject Language(s): Armenian (hye)
Bulgarian (bul)
Chinese, Mandarin (cmn)
Catalan-Valencian-Balear (cat)
English (eng)
French (fra)
German, Standard (deu)
Greek (ell)
Hebrew (heb)
Hungarian (hun)
Italian (ita)
Portuguese (por)
Polish (pol)
Spanish (spa)
Turkish (tur)
Croatian (hrv)
Page Updated: 03-Dec-2010
|