Publications
Authors:
  • Eric Gaussier , Cyril Goutte
Citation:
The 28th annual International ACM SIGIR, Conference on Research and Development in information retrieval, Salvador, Brazil, August 15-19, 2005.
Abstract:
The techniques of non-negative matrix factorisation (NMF, [5] and Probabilistic latent semantic analysis (PLSA, [4]) have been succesfully applied to a number of text analysis tasks such as document clustering. Despite their different inspirations, these methods are both instances of multinomial PCA [1]. We further explore this relationship and first show that PLSA solves the problem of NMF with KL divergence, and then explore the implications of this relationship
Year:
2005
Report number:
2005/029