Home pageSite mapContact

INTERNSHIP PROJECT PROPOSAL

Using partitionning techniques for categories refinement in classification tasks:

Date: 2008-01-24

Unit: Grenoble/TVPA

Proposers:

Julien Ah-Pine:julien.ah-pine@xrce.xerox.com

Duration:

3-6 months

Start date:

February 2008 and after

Description:

The main research lines within the Textual and Visual Pattern Analysis (TVPA) area at XRCE are categorization and retrieval of text and images; multimodal and hybrid pattern analysis (text, images, cross-lingual); image clustering and visualization; machine learning; document object detection and image aesthetics. Much of our research along these lines has been delivered into innovative solutions with proven scientific and commercial performance. Examples of this are Xerox GVC (image categorization engine), CategoriX (text categorization engine) and Xerox AIE (Automatic Image Enhancement).

The general aim of this internship is to investigate how partitioning algorithms (relational or k-means methods...) can improve classification tasks according to the following decomposition process : first apply locally to each categories, a partitioning algorithm in order to have sub-categories; then analyse the interdependencies between sub-categories for disambiguation purposes; finally, assign a new object to the "nearest" sub-category or use a specific classifier in case of ambiguity

The successful candidate will combine competences in clustering, classification methods and matlab implementation. Knowledge and practice of text and/or image processing will be a plus.

XRCE provides an informal and relaxed working environment situated in the Parc de Maupertuis in Meylan. The successful students will be given the freedom and flexibility to find their own solutions and to work in a way that suits them but will have the guidance and support of experienced full-time Xerox researchers and thereby gain an introduction to the field of commercial research in a world-class research laboratory.

Back to list of subjects