Multi-language machine translation through interactive document normalization

Aurelien Max
Document normalization is an interactive process that transforms raw legacy documents into semantically controlled documents with the same communicative intention content . A paradigm for content analysis has been implemented to select candidate document content semantic representations to account for the communicative content of an input document. This implementation reuses the formal content specification of a multilingual controlled authoring system. As a consequence, a candidate content semantic representation can not only be associated with a text in the language of the input document, but also in all the languages supported by the system. This paper presents how multilingual versions of an input legacy document can be obtained interactively with the current implementation, and discusses of this kind of normalizing translation.
EACL Workshop, Budapest, Hungary, 12-17 April 2003.


EACL2003-EAMT-Max.pdf (1,008.29 kB)