Char-gramming, Language Processing, Static Inverted Indices




CS267

Chris Pollett

Sep 25, 2019

Outline

Finishing up Stemming

Example of stemmed text

Stopping

Characters

Understanding Unicode

Character n-grams

European Languages

CJK(V) Languages

In-Class Exercise

Inverted Index Intro

The Dictionary