ACM DL

Information Systems (TOIS)

Menu

Search Issue
enter search term and/or author name

Archive


ACM Transactions on Information Systems (TOIS), Volume 30 Issue 1, February 2012

Word-based self-indexes for natural language text
Antonio Fariña, Nieves R. Brisaboa, Gonzalo Navarro, Francisco Claude, Ángeles S. Places, Eduardo Rodríguez
Article No.: 1
DOI: 10.1145/2094072.2094073

The inverted index supports efficient full-text searches on natural language text collections. It requires some extra space over the compressed text that can be traded for search speed. It is usually fast for single-word searches, yet phrase...

Static index pruning in web search engines: Combining term and document popularities with query views
Ismail S. Altingovde, Rifat Ozcan, Özgür Ulusoy
Article No.: 2
DOI: 10.1145/2094072.2094074

Static index pruning techniques permanently remove a presumably redundant part of an inverted file, to reduce the file size and query processing time. These techniques differ in deciding which parts of an index can be removed safely; that is,...

Summarizing figures, tables, and algorithms in scientific publications to augment search results
Sumit Bhatia, Prasenjit Mitra
Article No.: 3
DOI: 10.1145/2094072.2094075

Increasingly, special-purpose search engines are being built to enable the retrieval of document-elements like tables, figures, and algorithms [Bhatia et al. 2010; Liu et al. 2007; Hearst et al. 2007]. These search engines present a thumbnail view...

Multiple testing in statistical analysis of systems-based information retrieval experiments
Benjamin A. Carterette
Article No.: 4
DOI: 10.1145/2094072.2094076

High-quality reusable test collections and formal statistical hypothesis testing together support a rigorous experimental environment for information retrieval research. But as Armstrong et al. [2009b] recently argued, global analysis of...

High-performance processing of text queries with tunable pruned term and term pair indexes
Andreas Broschart, Ralf Schenkel
Article No.: 5
DOI: 10.1145/2094072.2094077

Term proximity scoring is an established means in information retrieval for improving result quality of full-text queries. Integrating such proximity scores into efficient query processing, however, has not been equally well studied. Existing...

Large-scale validation and analysis of interleaved search evaluation
Olivier Chapelle, Thorsten Joachims, Filip Radlinski, Yisong Yue
Article No.: 6
DOI: 10.1145/2094072.2094078

Interleaving is an increasingly popular technique for evaluating information retrieval systems based on implicit user feedback. While a number of isolated studies have analyzed how this technique agrees with conventional offline evaluation...