ACM Transactions on Information Systems (TOIS), Volume 24 Issue 3, July 2006

Extraction of coherent relevant passages using hidden Markov models
Jing Jiang, Chengxiang Zhai
Pages: 295-319
DOI: 10.1145/1165774.1165775
In information retrieval, retrieving relevant passages, as opposed to whole documents, not only directly benefits the end user by filtering out the irrelevant information within a long relevant document, but also improves retrieval accuracy in...

Query enrichment for web-query classification
Dou Shen, Rong Pan, Jian-Tao Sun, Jeffrey Junfeng Pan, Kangheng Wu, Jie Yin, Qiang Yang
Pages: 320-352
DOI: 10.1145/1165774.1165776
Web-search queries are typically short and ambiguous. To classify these queries into certain target categories is a difficult but important problem. In this article, we present a new technique called query enrichment, which takes a short query and...

CLAIRE: A modular support vector image indexing and classification system
Chih-Fong Tsai, Ken McGarry, John Tait
Pages: 353-379
DOI: 10.1145/1165774.1165777
Many users of image retrieval systems would prefer to express initial queries using keywords. However, manual keyword indexing is very time-consuming. Therefore, a content-based image retrieval system which can automatically assign keywords to images...

A large scale, corpus-based approach for automatically disambiguating biomedical abbreviations
Hong Yu, Won Kim, Vasileios Hatzivassiloglou, John Wilbur
Pages: 380-404
DOI: 10.1145/1165774.1165778
Abbreviations and acronyms are widely used in the biomedical literature and many of them represent important biomedical concepts. Because many abbreviations are ambiguous (e.g., CAT denotes both chloramphenicol acetyl transferase and...