Reversing controlled document authoring to normalize documents
This paper introduces document normalization, and consider whether controlled document authoring system
can be used in a reverse mode to normalize legacy documents. A paradigm for deep content analysis using
such a system is proposed, and an architecture for a document normalization system is described.
EACL 11th Conference of the European Chapter of the Association for Computational Linguistics, Budapest, Hungary, April 12-17, 2003.
EACL2003-SRW-Max.pdf (551.51 kB)