Logarithmic Merging, BM25F, PRF, DFR




CS267

Chris Pollett

Nov. 19, 2012

Outline

Introduction

Quiz

Which of the following is true?

  1. Index batch updates using REMERGE are easier to implement than batch updates using REBUILD.
  2. A NO MERGE-based index, all things being equal, would typically be faster than IMMEDIATE MERGE based index for query processing.
  3. Dynamic inverted indices that support document insertions and document deletions but allow a delay before updates are seen are called semi-static.

Hybrid Index Maintenance

Logarithmic Merging

How Logarithmic Merging Works

BM25F and Pseudo-Relevance Feedback

Modeling Relevance

Log-Odds

Generating Queries from documents