Ranking using Language Models




CS267

Chris Pollett

Apr 26, 2021

Outline

BM25F and Pseudo-Relevance Feedback

Modeling Relevance

Log-Odds

Generating Queries from documents

Languages Models

Smoothing

Quiz

Which of the following is true?

  1. LLRUN is an example global parametric gap compression technique.
  2. Remerging an inverted index is always faster than rebuilding it.
  3. Hybrid index maintenance and logarithmic merging are two names for the same thing.

Ranking with Language Models

Massaging our equations

Substituting in a particular model

Kullback-Leibler Divergence