ACM Transactions on Information Systems (TOIS), Volume 32 Issue 1, January 2014

Suffix Array Construction in External Memory Using D-Critical Substrings
Ge Nong, Wai Hong Chan, Sen Zhang, Xiao Feng Guan
Article No.: 1
DOI: 10.1145/2518175

We present a new suffix array construction algorithm that aims to build, in external memory, the suffix array for an input string of length n measured in the magnitude of tens of Giga characters over a constant or integer alphabet. The core...

Document Score Distribution Models for Query Performance Inference and Prediction
Ronan Cummins
Article No.: 2
DOI: 10.1145/2559170

Modelling the distribution of document scores returned from an information retrieval (IR) system in response to a query is of both theoretical and practical importance. One of the goals of modelling document scores in this manner is the inference...

Indexing Word Sequences for Ranked Retrieval
Samuel Huston, J. Shane Culpepper, W. Bruce Croft
Article No.: 3
DOI: 10.1145/2559168

Formulating and processing phrases and other term dependencies to improve query effectiveness is an important problem in information retrieval. However, accessing word-sequence statistics using inverted indexes requires unreasonable processing...

Cost-Aware Collaborative Filtering for Travel Tour Recommendations
Yong Ge, Hui Xiong, Alexander Tuzhilin, Qi Liu
Article No.: 4
DOI: 10.1145/2559169

Advances in tourism economics have enabled us to collect massive amounts of travel tour data. If properly analyzed, this data could be a source of rich intelligence for providing real-time decision making and for the provision of travel tour...

Learning to Recommend Descriptive Tags for Questions in Social Forums
Liqiang Nie, Yi-Liang Zhao, Xiangyu Wang, Jialie Shen, Tat-Seng Chua
Article No.: 5
DOI: 10.1145/2559157

Around 40% of the questions in the emerging social-oriented question answering forums have at most one manually labeled tag, which is caused by incomprehensive question understanding or informal tagging behaviors. The incompleteness of...