Language Modeling, Test Collections, Open-Source IR Systems, Inverted Indexes




CS267

Chris Pollett

Aug 28, 2019

Outline

Higher-order Models

On Monday, we were talking about language modeling. Let's briefly recall some of the things we learned...

Example

In-Class Exercise

Suppose the phrase "hammer time" appears 2 times in Shakespeare and "hammer" appear 256 times.

What is `M_0(mbox("hammer time"))`? What is `M_1(mbox("time") | mbox("hammer"))`?

Post your solutions to the Aug. 28 In-Class Exercise Thread.

Markov Models

Markov Model for to be or not to be

More Markov Models

Test Collections

TREC Tasks

Open-Source IR Systems

Inverted Indices

diagram with dictionary and posting list of an inverted index

ADT Example: Phrase Search