Publications
Authors:
  • HervĂ© Dejean , Jean-Luc Meunier
Citation:
DocEng 05, Bristol, UK, November 2-4, 2005.
Abstract:
In this paper we present a method for structuring a document according to the information present in its table of contents. The detection of the ToC as well as the determination of the parts it refers to in the document body rely on a series of generic properties characterizing any ToC, while its hierarchization is achieved using clustering techniques. We also report on the robustness and performance of the method before discussing it, in light of related work
Year:
2005
Report number:
2005/002
Attachments: