2007/057 - Factored word-sequence kernels
- Pierre Mahé,Nicola Cancedda
ESANN 2008, European Symposium on artificial neural networks, Bruges, Belgium, 23-25 April, 2008.
In this paper we propose an extension of sequence kernels to the case where the symbols that define the sequences have multiple representations. This configuration occurs in natural language processing for instance, where words can be characterized according to different linguistic dimensions. The core of our contribution is to integrate early the different representations in the kernel, in a way that generates rich composite features defined across the various symbol dimensions.