Keywords

Authors

Year

Automatic Wrapper Generation for Web Search Engines

Authors: Boris Chidlovskii, Jon Ragetli, Maarten de Rijke
The 1st Intern. Conf. on Web-Age Information Management (WAIM'2000), Shanghai, China, June 2000
To facilitate effective search on the World Wide Web, several `meta search engines' have been developed which do not search the Web themselves, but use available search engines to find the required information. By means of wrappers, meta search engines retrieve relevant information from the HTML pages returned by search engines. In this paper we present an algorithm to create such wrappers automatically, that uses an adaptation of the "string edit distance". Our algorithm performs well; it is quick, it can be used for several types of result pages and it requires a minimal amount of interaction with the user.
Year: 2000
Report number: 2000/204

Attachments

waim00.ps (255.77 kB)