ACM Transactions on Information Systems (TOIS), Volume 26 Issue 2, March 2008

Classification-aware hidden-web text database selection
Panagiotis G. Ipeirotis, Luis Gravano
Article No.: 6
DOI: 10.1145/1344411.1344412

Many valuable text databases on the web have noncrawlable contents that are “hidden” behind search interfaces. Metasearchers are helpful tools for searching over multiple such “hidden-web” text databases at once through a...

Writeprints: A stylometric approach to identity-level identification and similarity detection in cyberspace
Ahmed Abbasi, Hsinchun Chen
Article No.: 7
DOI: 10.1145/1344411.1344413

One of the problems often associated with online anonymity is that it hinders social accountability, as substantiated by the high levels of cybercrime. Although identity cues are scarce in cyberspace, individuals often leave behind textual...

Towards a belief-revision-based adaptive and context-sensitive information retrieval system
Raymond Y. K. Lau, Peter D. Bruza, Dawei Song
Article No.: 8
DOI: 10.1145/1344411.1344414

In an adaptive information retrieval (IR) setting, the information seekers' beliefs about which terms are relevant or nonrelevant will naturally fluctuate. This article investigates how the theory of belief revision can be used to model adaptive...

Locality-Based pruning methods for web search
Edleno Silva de Moura, Celia Francisca dos Santos, Bruno Dos santos de Araujo, Altigran Soares da Silva, Pavel Calado, Mario A. Nascimento
Article No.: 9
DOI: 10.1145/1344411.1344415

This article discusses a novel approach developed for static index pruning that takes into account the locality of occurrences of words in the text. We use this new approach to propose and experiment on simple and effective pruning methods that...

DirichletRank: Solving the zero-one gap problem of PageRank
Xuanhui Wang, Tao Tao, Jian-Tao Sun, Azadeh Shakery, Chengxiang Zhai
Article No.: 10
DOI: 10.1145/1344411.1344416

Link-based ranking algorithms are among the most important techniques to improve web search. In particular, the PageRank algorithm has been successfully used in the Google search engine and has been attracting much attention recently. However, we...

On ranking techniques for desktop search
Sara Cohen, Carmel Domshlak, Naama Zwerdling
Article No.: 11
DOI: 10.1145/1344411.1344417

Users tend to store huge amounts of files, of various formats, on their personal computers. As a result, finding a specific, desired file within the file system is a challenging task. This article addresses the desktop search problem by...