enter search term and/or author name
Word-based self-indexes for natural language text
Antonio Fariña, Nieves R. Brisaboa, Gonzalo Navarro, Francisco Claude, Ángeles S. Places, Eduardo Rodríguez
Article No.: 1
The inverted index supports efficient full-text searches on natural language text collections. It requires some extra space over the compressed text that can be traded for search speed. It is usually fast for single-word searches, yet phrase...
Static index pruning in web search engines: Combining term and document popularities with query views
Ismail S. Altingovde, Rifat Ozcan, Özgür Ulusoy
Article No.: 2
Static index pruning techniques permanently remove a presumably redundant part of an inverted file, to reduce the file size and query processing time. These techniques differ in deciding which parts of an index can be removed safely; that is,...
Summarizing figures, tables, and algorithms in scientific publications to augment search results
Sumit Bhatia, Prasenjit Mitra
Article No.: 3
Increasingly, special-purpose search engines are being built to enable the retrieval of document-elements like tables, figures, and algorithms [Bhatia et al. 2010; Liu et al. 2007; Hearst et al. 2007]. These search engines present a thumbnail view...
Multiple testing in statistical analysis of systems-based information retrieval experiments
Benjamin A. Carterette
Article No.: 4
High-quality reusable test collections and formal statistical hypothesis testing together support a rigorous experimental environment for information retrieval research. But as Armstrong et al. [2009b] recently argued, global analysis of...
High-performance processing of text queries with tunable pruned term and term pair indexes
Andreas Broschart, Ralf Schenkel
Article No.: 5
Term proximity scoring is an established means in information retrieval for improving result quality of full-text queries. Integrating such proximity scores into efficient query processing, however, has not been equally well studied. Existing...
Large-scale validation and analysis of interleaved search evaluation
Olivier Chapelle, Thorsten Joachims, Filip Radlinski, Yisong Yue
Article No.: 6
Interleaving is an increasingly popular technique for evaluating information retrieval systems based on implicit user feedback. While a number of isolated studies have analyzed how this technique agrees with conventional offline evaluation...