Publications
Authors:
  • HervĂ© Dejean , Jean-Luc Meunier
Citation:
DAS (Document Analysis System), Boston, MA, USA, 9-11 June, 2010
Abstract:
After two participations to the INEX competition in the Structure Extraction task, which consists in building navigation tools for digitised books by constructing hyperlinked table of contents from OCR text and layout information, we present in this paper some reflections about this competition regarding its
dataset, and its evaluation measure. We point out some issues, and propose some recommendations for improving the groundtruth and the measures.
Year:
2010
Report number:
2009/092