Publications
Authors:
  • Xavier Tannier
Citation:
Appear in Traitement Automatique des Langues Volume 47 issue 2
Abstract:
Some tags used in XML documents create arbitrary breaks in the natural flow of the text. This flexibility may raise some difficulties for some techniques of document engineering. This article presents this issue and proposes answers, theoretically first, with the introduction of a new concept of reading context, and in practice afterwards, with an automatic classification of tags and the presentation of a generic tool for XML content handling
Year:
2007
Report number:
2006/037