Publications
Authors:
  • Greg Grefenstette
Citation:
ASLIB, Translating and the Computer 21, London, Nov 10-11, 1999.
Abstract:
The WWW is two orders of magnitude larger than the largest corpora. Although noisy, web text presents
language as it is used, and statistics derived from the Web can have practical uses in many NLP applications.
For this reason, the WWW should be seen and studied as any other computationally available linguistic
resource. In this article, we illustrate this by showing that an Example-Based approach to lexical choice for
machine translation can use the Web as an adequate and free resource.
Year:
1999
Report number:
1999/004
Attachments: