Document Quality Measures




CS267

Chris Pollett

Dec 3, 2012

Outline

Introduction

Traffic Rank

Online Page Importance Computation (OPIC)

Quiz

Which of the following is true?

  1. In DFR, Laplace's law of succession is used to estimate document eliteness.
  2. In DFR, we used a variant on the binomial coefficient to estimate document eliteness.
  3. One virtue of map reduce is that it is fault tolerant to failures of a reducer, so we don't need to store a reducer's inputs in a distributed file system.

Page Rank

The Power Method

Tweaks on the basic matrix

Topic Specific Page Rank

More Algorithms -- HITS

SALSA