Publications
Authors:
  • Jean-Pierre Chanod , Pasi Tapanainen
Citation:
Proc. ECAI '96 workshop on 'Extended finite state models of language'Budapest,August 11-12, 1996.
Abstract:
This paper describes a non-deterministic tokeniser implemented and used for the development of a French
finite-state grammar. The tokeniser includes a finite-state automaton for simple tokens and a lexical transducer
that encodes a wide variety of multiword expressions, associated with multiple lexical descriptions when
required.
Year:
1996
Report number:
1996/013
Attachments: