Dictionaries, Posting Lists, Index Construction




CS267

Chris Pollett

Sep 30, 2019

Outline

Introduction

Dictionary Types

Storing Dictionary Terms

Dictionary As a String Example

Sort-based versus Hash-based dictionaries

Posting Lists

Random Accesses of Posting Lists

Prefix Queries

Interleaving Dictionary and Posting Lists

An example of the dictionary interleaving strategy

Quiz

Which of the following is true?

  1. Class autoloading in PHP is done using the Class:forName() method.
  2. ASCII characters in UTF-8 take two bytes to store.
  3. The page rank of a document might be stored in the document map part of the index.

Dropping the distinction between terms and postings