Logarithmic Merging, BM25F, PRF, DFR




CS267

Chris Pollett

Nov 5, 2018

Outline

Introduction

Quiz

Which of the following is true?

  1. LLRUN was an example of a global parametric gap compression technique that we discussed.
  2. There are no circumstances under which it makes sense to use the REBUILD index batch update strategy as opposed to the REMERGE strategy.
  3. If the gaps in our postings follow a geometric distribution then a `delta` code is the best choice for compression of posting delta lists.

Hybrid Index Maintenance

Logarithmic Merging

How Logarithmic Merging Works

BM25F and Pseudo-Relevance Feedback

Modeling Relevance

Log-Odds

Generating Queries from documents