Static Inverted Indices




CS267

Chris Pollett

Mar 15, 2021

Outline

Inverted Index Intro

The Dictionary

Dictionary Types

Storing Dictionary Terms

Dictionary As a String Example

Sort-based versus Hash-based dictionaries

Posting Lists

Random Accesses of Posting Lists

Prefix Queries

Interleaving Dictionary and Posting Lists

An example of the dictionary interleaving strategy

Quiz

Which of the following is true?

  1. Class autoloading in PHP is done using the Class:load() method.
  2. ASCII characters in UTF-8 sometimes take two bytes to store.
  3. Segmentation is often done on text for languages that don't traditionally use whitespace to separate terms.

Dropping the distinction between terms and postings