Static Inverted Indices




CS267

Chris Pollett

Mar 21, 2022

Outline

Inverted Index Intro

The Dictionary

Dictionary Types

Storing Dictionary Terms

Dictionary As a String Example

Sort-based versus Hash-based dictionaries

Posting Lists

Random Accesses of Posting Lists

Prefix Queries

Interleaving Dictionary and Posting Lists

An example of the dictionary interleaving strategy

Quiz

Which of the following is true?

  1. ASCII characters in UTF-8 sometimes take two bytes to store.
  2. Segmentation could be useful in separating the web-site keywords from a url.
  3. Class autoloading in PHP is done based on the path given by a package command.

Dropping the distinction between terms and postings