Document Quality Measures




CS267

Chris Pollett

Nov 27, 2023

Outline

Introduction

Traffic Rank

Online Page Importance Computation (OPIC)

Quiz

Which of the following is true?

    1. Adding machines to a document partitioning multi-machine index will always speed up query processing.
    2. When a term partitioning multi-machine index is used for query processing, a central receptionist requests the top k query results from each partition and combines them into a final score.
    3. In a basic map reduce job between the map and reduce phases there is a shuffle phase where the pairs produced during the map phase are sorted by their key, and all values for the same key are grouped together.

Page Rank

The Power Method

Tweaks on the basic matrix

Topic Specific Page Rank

More Algorithms -- HITS

SALSA