Document Quality Measures




CS267

Chris Pollett

Nov 18, 2019

Outline

Introduction

Traffic Rank

Online Page Importance Computation (OPIC)

Quiz

Which of the following is true?

  1. Intra-query parallelism is parallelism achieved by having whole queries computed by different machines.
  2. `P_1` in DFR was estimated using Laplace's law of succession.
  3. A document is said to be elite for the term `t` when it is "about" the topic associated with the term.

Page Rank

The Power Method

Tweaks on the basic matrix

Topic Specific Page Rank

More Algorithms -- HITS

SALSA