enter search term and/or author name
Building a distributed full-text index for the web
Sergey Melink, Sriram Raghavan, Beverly Yang, Hector Garcia-Molina
We identify crucial design issues in building a distributed inverted index for a large collection of Web pages. We introduce a novel pipelining technique for structuring the core index-building system that substantially reduces the index construction...
Scaling question answering to the web
Cody Kwok, Oren Etzioni, Daniel S. Weld
The wealth of information on the web makes it an attractive resource for seeking quick answers to simple, factual questions such as "e;who was the first American in space?"e; or "e;what is the second tallest mountain in the world?"e;...
WebQuilt: A proxy-based approach to remote web usability testing
WebQuilt is a web logging and visualization system that helps web design teams run usability tests (both local and remote) and analyze the collected data. Logging is done through a proxy, overcoming many of the problems with server-side and...
On the design of a learning crawler for topical resource discovery
Charu C. Aggarwal, Fatima Al-Garawi, Philip S. Yu
In recent years, the World Wide Web has shown enormous growth in size. Vast repositories of information are available on practically every possible topic. In such cases, it is valuable to perform topical resource discovery effectively. Consequently,...
A highly scalable and effective method for metasearch
Weiyi Meng, Zonghuan Wu, Clement Yu, Zhuogang Li
A metasearch engine is a system that supports unified access to multiple local search engines. Database selection is one of the main challenges in building a large-scale metasearch engine. The problem is to efficiently and accurately determine a...