enter search term and/or author name
Conjunctive Boolean queries are a key component of modern information retrieval systems, especially when Web-scale repositories are being searched. A conjunctive query q is equivalent to a |q|-way intersection over...
Engineering basic algorithms of an in-memory text search engine
Frederik Transier, Peter Sanders
Article No.: 2
Inverted index data structures are the key to fast text search engines. We first investigate one of the predominant operation on inverted indexes, which asks for intersecting two sorted lists of document IDs of different lengths. We explore...
Utilizing inter-passage and inter-document similarities for reranking search results
Eyal Krikon, Oren Kurland, Michael Bendersky
Article No.: 3
We present a novel language-model-based approach to reranking search results; that is, reordering the documents in an initially retrieved list so as to improve precision at top ranks. Our model integrates whole-document information with...
Improving graph-walk-based similarity with reranking: Case studies for personal information management
Einat Minkov, William W. Cohen
Article No.: 4
Relational or semistructured data is naturally represented by a graph, where nodes denote entities and directed typed edges represent the relations between them. Such graphs are heterogeneous, describing different types of objects and links. We...
Digital content production and distribution has radically changed our business models. An unprecedented volume of supply is now on offer, whetted by the demand of millions of users from all over the world. Since users cannot be expected to browse...
Extraction, characterization and utility of prototypical communication groups in the blogosphere
Munmun De Choudhury, Hari Sundaram, Ajita John, Doree Duncan Seligmann
Article No.: 6
This article analyzes communication within a set of individuals to extract the representative prototypical groups and provides a novel framework to establish the utility of such groups. Corporations may want to identify representative groups...