Chris Pollett> CS267
( Print View )

Student Corner:
[Submit Sec1]
[Grades Sec1]

[Lecture Notes]
[Discussion Board]

Course Info:
[Texts & Links]
[Description]
[Course Outcomes]
[Outcomes Matrix]
[Course Schedule]
[Grading]
[Requirements/HW/Quizzes]
[Class Protocols]
[Exam Info]
[Regrades]
[University Policies]
[Announcements]

HW Assignments:
[Hw1] [Hw2] [Hw3]
[Hw4] [Hw5] [Quizzes]

Practice Exams:
[Midterm] [Final]

CS267 Spring 2022 Lecture Notes

Topics in Database Systems

Videos of lectures are available.

Below are my lecture notes for the class so far. They should serve as a rough guide to what was covered on any given day. Frequently, however, I say more in class than is in these notes. Also, I tend to dynamically correct typos on the board that might appear in these lecture notes. So caveat emptor.

Week 1: [Jan 26 - Introduction to Information Retrieval]

Week 2: [Jan 31 - Text Formats, Tokenization, Term Distributions, Language Models] [Feb 2 - Language Modeling, Test Collections, Open-Source IR Systems, Inverted Indexes]

Week 3: [Feb 7 - Learning to Crawl] [Feb 9 - PHP]

Week 4: [Feb 14 - Finish PHP; Back to Inverted Index ADT] [Feb 16 - Galloping/Exponential Search, Document-Oriented Indexes]

Week 5: [Feb 21 - VSM, Proximity Ranking] [Feb 23 - Finish Proximity Ranking, Boolean Retrieval]

Week 6: [Feb 28 - Evaluating Results, Token and Term Processing] [Mar 2 - Text Preprocessing and More PHP]

Week 7: [Mar 7 - PHP Autoloading - Yioop as an IR Library] [Mar 9 - Char-gramming, Language Processing]

Week 8: [Mar 14 - Practice Midterm] [Mar 16 - Midterm]

Week 9: [Mar 21 - Static Inverted Indices] [Mar 23 - Inverted Index Construction]

Week 10: [Mar 28 - Spring Break] [Mar 30 - Spring Break]

Week 11: [Apr 4 - Merge Based Index Construction - Query Processing] [Apr 6 - Accumulator Pruning, Concordance Lists]

Week 12: [Apr 11 - Finish GC-Lists, trec_eval, Start Index Compression] [Apr 13 - Huffman Coding]

Week 13: [Apr 18 - Arithmetic Coding, Gap Compression] [Apr 20 - Finish Compression, Dynamic Inverted Indexes]

Week 14: [Apr 25 - Ranking using Language Models] [Apr 27 - Divergence-from-randomness, Parallel Information Retrievals]

Week 15: [May 2 - More Parallel Information Retrieval] [May 4 - Document Quality Measures]

Week 16: [May 9 - Doc Quality, Page Rank via Map Reduce, Hadoop] [May 11 - More Categorization and Filtering]