TY - GEN
T1 - What Snippets Say about Pages in Federated Web Search
AU - Demeester, Thomas
AU - Nguyen, Dong
AU - Trieschnigg, Dolf
AU - Develder, Chris
AU - Hiemstra, Djoerd
PY - 2012
Y1 - 2012
N2 - What is the likelihood that a Web page is considered relevant to a query, given the relevance assessment of the corresponding snippet? Using a new federated IR test collection that contains search results from over a hundred search engines on the internet, we are able to investigate such research questions from a global perspective. Our test collection covers the main Web search engines like Google, Yahoo!, and Bing, as well as a number of smaller search engines dedicated to multimedia, shopping, etc., and as such reflects a realistic Web environment. Using a large set of relevance assessments, we are able to investigate the connection between snippet quality and page relevance. The dataset is strongly inhomogeneous, and although the assessors’ consistency is shown to be satisfying, care is required when comparing resources. To this end, a number of probabilistic quantities, based on snippet and page relevance, are introduced and evaluated.
AB - What is the likelihood that a Web page is considered relevant to a query, given the relevance assessment of the corresponding snippet? Using a new federated IR test collection that contains search results from over a hundred search engines on the internet, we are able to investigate such research questions from a global perspective. Our test collection covers the main Web search engines like Google, Yahoo!, and Bing, as well as a number of smaller search engines dedicated to multimedia, shopping, etc., and as such reflects a realistic Web environment. Using a large set of relevance assessments, we are able to investigate the connection between snippet quality and page relevance. The dataset is strongly inhomogeneous, and although the assessors’ consistency is shown to be satisfying, care is required when comparing resources. To this end, a number of probabilistic quantities, based on snippet and page relevance, are introduced and evaluated.
U2 - 10.1007/978-3-642-35341-3_21
DO - 10.1007/978-3-642-35341-3_21
M3 - Conference contribution
SN - 978-3-642-35340-6
T3 - Lecture Notes in Computer Science (LNCS)
SP - 250
EP - 261
BT - Information Retrieval Technology
PB - Springer
ER -