LINGUIST List 21.4878

Fri Dec 03 2010

Software: NooJ: Finite-State Language Processing

Editor for this issue: Susanne Vejdemo <>

        1.     Chris Humphrey , NooJ: Finite-State Language Processing

Message 1: NooJ: Finite-State Language Processing
Date: 03-Dec-2010
From: Chris Humphrey <>
Subject: NooJ: Finite-State Language Processing
E-mail this message to a friend

NooJ is both a corpus processing tool and a linguistic developmentenvironment: it allows linguists to formalize several levels of linguisticphenomena: orthography and spelling, lexicons for simple words, multiwordunits and frozen expressions, inflectional, derivational and productivemorphology, local, structural syntax and transformational syntax. For eachof these levels, NooJ provides linguists with one or more formal toolsspecifically designed to facilitate the description of each phenomenon, aswell as parsing tools designed to be as computationally efficient aspossible. This approach distinguishes NooJ from most computationallinguistic tools, which provide a single formalism that should describeeverything. As a corpus processing tool, NooJ allows users to applysophisticated linguistic queries to large corpora in order to build indicesand concordances, annotate texts automatically, perform statisticalanalyses, etc.

NooJ is freely available and linguistic modules can already be downloadedfor Acadian, Arabic, Armenian, Bulgarian, Catalan, Chinese, Croatian,French, English, German, Hebrew, Greek, Hungarian, Italian, Polish,Portuguese, Spanish and Turkish.

Linguistic Field(s): Morphology                             Syntax                             Text/Corpus Linguistics
Subject Language(s): Armenian (hye)
                            Bulgarian (bul)                             Chinese, Mandarin (cmn)                             Catalan-Valencian-Balear (cat)                             English (eng)                             French (fra)                             German, Standard (deu)                             Greek (ell)                             Hebrew (heb)                             Hungarian (hun)                             Italian (ita)                             Portuguese (por)                             Polish (pol)                             Spanish (spa)                             Turkish (tur)                             Croatian (hrv)

Page Updated: 03-Dec-2010