Ranking using Language Models




CS267

Chris Pollett

Apr 25, 2022

Outline

BM25F and Pseudo-Relevance Feedback

Modeling Relevance

Log-Odds

Generating Queries from documents

Languages Models

Smoothing

Quiz

Which of the following is true?

  1. Rebuilding an inverted index is always faster than remerging it.
  2. LLRUN is an example global parametric gap compression technique.
  3. A `gamma` code is a special kind of arithmetic code.

Ranking with Language Models

Massaging our equations

Substituting in a particular model

Kullback-Leibler Divergence