enter search term and/or author name
Taxonomy generation for text segments: A practical web-based approach
Shui-Lung Chuang, Lee-Feng Chien
It is crucial in many information systems to organize short text segments, such as keywords in documents and queries from users, into a well-formed taxonomy. In this article, we address the problem of taxonomy generation for diverse text segments...
Set-based vector model: An efficient approach for correlation-based ranking
Bruno Pôssas, Nivio Ziviani, Wagner Meira, Jr., Berthier Ribeiro-Neto
This work presents a new approach for ranking documents in the vector space model. The novelty lies in two fronts. First, patterns of term co-occurrence are taken into account and are processed efficiently. Second, term weights are generated using a...
Learning to crawl: Comparing classification schemes
Gautam Pant, Padmini Srinivasan
Topical crawling is a young and creative area of research that holds the promise of benefiting from several sophisticated data mining techniques. The use of classification algorithms to guide topical crawlers has been sporadically suggested in the...
Evolution of web site design patterns
Melody Y. Ivory, Rodrick Megraw
The Web enables broad dissemination of information and services; however, the ways in which sites are designed can either facilitate or impede users' benefit from these resources. We present a longitudinal study of web site design from 2000 to 2003....