Morphological analysis and generation: a first step in natural language processing

Ken Beesley
Computer programs that perform morphological analysis and generation are a useful bridge between language resources, such as corpora, lexicons and printed grammars, and the overall field of natural language processing, whcih includes tokenization, spelling checking, spelling correction, non-trivial dictionary lookup, language teaching and comprehension assistance, part-of-speech disambiguation, syntactic parsing, text-to-speech, speech recognition, and many other applications. This paper is an overview of morphological analysis/generation using finite-state techniques, listing available software, showing how existing language resources can be used in building and testing morphology systems, and explaining how root-guessing morphological analyzers can help expand those resources by actively suggesting new roots that need to added to the lexicon.
Fourth international conference on Language Resources and Evaluation, LREC 2004, Lisbon, Portugal, May 26-28