Arithmetic Coding, Gap Compression




CS267

Chris Pollett

Apr 19, 2021

Outline

Motivating Arithmetic Coding

Arithmetic Coding

More Arithmetic Coding

Redux

Finishing Up General Text Compression

Compressing Posting Lists: `Delta`-values

Nonparametric Gap Compression

Quiz

Which of the following is true?

  1. A static, symbol-wise compression method compresses symbols in the same way independent of message being compressed.
  2. To code a string using a `gamma`-code we first build a Huffman tree.
  3. The trec_eval program makes use of a sub-module to automatically judge relevance of documents without human involvement.

Parametric Gap Compression

Geometric Distributions and Posting Lists

Golomb/Rice Codes

Finding the Modulus

Byte-Aligned Codes