Posting Lists, Index Construction




CS267

Chris Pollett

Oct. 12, 2011

Outline

Posting Lists

Random Accesses of Posting Lists

Prefix Queries

Interleaving Dictionary and Posting Lists

An example of the dictionary interleaving strategy

Dropping the distinction between terms and postings

Index Construction

In-memory Index Construction

buildIndex

buildIndex (indexTokenizer)
{
   position := 0;
   while (inputTokenizer.hasNext()) {
      T := inputTokenizer.getNext();
      obtain dictionary entry for T; create new entry, if necessary;
      append new posting position to T's posting list;
      position ++;
   }
   sort all dictionary entries in lex order
   for each term T in the dictionary {
      write T's postings list to disk
   }
   write the dictionary to disk
}
return

Next day we will continue our discussion of index construction.