CS267 Fall 2023 Lecture Notes
Topics in Database Systems
Videos of lectures are available.
Below are my lecture notes for the class so far.
They should serve as a rough guide to what was covered on any given day.
Frequently, however, I say more in class than is in these notes.
Also, I tend to dynamically correct typos on the board
that might appear in these lecture notes. So caveat emptor.
Week 1: [Aug 21 - Introduction to Information
Retrieval] [Aug 23 - Text Formats, Tokenization, Term Distributions, Language Models]
Week 2: [Aug 28 - Language Modeling, Test Collections, Open-Source IR Systems, Inverted Indexes] [Aug 30 - Learning to Crawl]
Week 3: [Sep 4 - Labor Day] [Sep 6 - PHP]
Week 4: [Sep 11 - Finish PHP; Back to Inverted Index ADT] [Sep 13 - Galloping/Exponential Search, Document-Oriented Indexes]
Week 5: [Sep 18 - Index Types, VSM Ranking] [Sep 20 - Proximity Ranking, Boolean Retrieval]
Week 6: [Sep 25 - Evaluating Results, Token and Term Processing] [Sep 27 - Text Preprocessing and More PHP]
Week 7: [Oct 2 - PHP Autoloading - Yioop as an IR Library] [Oct 4 - Char-gramming, Language Processing]
Week 8: [Oct 9 - Review] [Oct 11 - Midterm]
Week 9: [Oct 16 - Static Inverted Indices] [Oct 18 - Index Construction]
Week 10: [Oct 23 - Merge Based Index Construction - Query Processing] [Oct 25 - Heap query processing, Accumulator Pruning, Concordance Lists]
Week 11: [Oct 30 - Finish GC-Lists, trec_eval, Start Index Compression] [Nov 1 - Huffman Coding]
Week 12: [Nov 6 - Arithmetic Coding, Gap Compression] [Nov 8 - Finish Compression, Dynamic Inverted Indexes]
Week 13: [Nov 13 - Ranking using Language Models] [Nov 15 - Divergence-from-randomness, Parallel Information Retrieval]
Week 14: [Nov 20 - More Parallel Information Retrieval]
Week 15: [Nov 27 - Document Quality Measures] [Nov 29 - Doc Quality, Page Rank via Map Reduce]
Week 16: [Dec 4 - Map Reduce on Hadoop - Categorization and Filtering] |