Publications
Authors:
  • Aurelien Max
Citation:
EACL Workshop, Budapest, Hungary, 12-17 April 2003.
Abstract:
Document normalization is an interactive process that transforms raw legacy documents into semantically
controlled documents with the same communicative intention content . A paradigm for content analysis has
been implemented to select candidate document content semantic representations to account for the
communicative content of an input document. This implementation reuses the formal content specification
of a multilingual controlled authoring system. As a consequence, a candidate content semantic
representation can not only be associated with a text in the language of the input document, but also in all
the languages supported by the system. This paper presents how multilingual versions of an input legacy
document can be obtained interactively with the current implementation, and discusses of this kind of
normalizing translation.
Year:
2003
Report number:
2003/017
Attachments: