Chris Pollett > Students >
Padia

    ( Print View)

    [Bio]

    [Blog]

    [CS297 Proposal]

    [CS 297 Report PDF]

    [CS298 Proposal]

    [CS298 Report PDF]

    [CS298 Oral Defence Slides PDF]

    [Deliverable 1: CacheRefresh MediaJob]

    [Deliverable 2: Implement MLDC Algorithm]

    [Deliverable 3: Implement STDC Algorithm]

    [Deliverable 4: Implement SSDC Algorithm]

    [Understanding Yioop PDF]

    [Scalability Challenges PDF]

    [Cache Aware strategies PDF]

    [ML Based Cache Algorithm PDF]

    [Static Topic Dynamic Cache PDF]

    [Static Semi-Static Dynamic Cache PDF]

    [Query Statistics]

Project Blog

Week 11 : April 11 2023

1. Review KMeans Patch

1. Review Word Encodding

Things to do:

Start Working on Report

Week 8 : March 21 2023

1. Show K-Means Algorithm Implementation

Things to do:

Implement Word Encoding

Week 7 : March 14 2023

1. Topic Model Discussion

Things to do:

Start Implementing K-Means algorithm

Week 6 : March 07 2023

1. Topic Model Discussion

1. Discussed LDA Implementation

1. Discussed K-means Implementation

Things to do:

1. Continue working on topic model

Week 5 : February 28 2023

1. Create code pull request

Things to do:

Start implmenting LDA

Week 4 : February 21 2023

1. Showed performance evaluation of current implmentation

1. Code review of performance evaluation and test classes

Things to do:

1. Fix review comments

2. Create issue on Mantis

3. Work on dataset creation of LDA

Week 3 : February 14 2023

1. Showed partial implementation of performance evaluation module

Things to do:

1. Continue working on implementation

Week 2 : February 07 2023

1. Submit Proposal

Things to do:

1. Start working on implementation

Week 1 : January 31 2023

1. Decide on deliverables

Things to do:

1. Start working on proposal


Week 15 : December 06 2022

1. Demonstrate result of Static-Semi-Static-Dynamic Cache

2. Review CS 297 Report

3. Demonstrate Cache Simulator in PHP

Things to do:

1. Fix deliverables on the website, add purpose and code and result snippets

2. Fix review comments on report

Week 14 : November 29 2022

1. Demonstrate Popularity results in Topical Cache

2. Present Paper "Exploiting temporal changes in query submission behavior for improving the search engine result cache performance"

Things to do:

1. Report first draft

2. Try to recode in php

Week 13 : November 22 2022

1. Demonstrate Topical Cache Implementation

Things to do:

1. Implement Topic Popularity in STD Cache

2. Read paper - "Exploiting temporal changes in query submission behavior for improving the search engine result cache performance"

3. Report first draft

4. Try to recode in php

Week 12 : November 15 2022

1. Continue implementing Topical cache

2. Figure out list of query statistics to be generated from Yioop

3. Look at current implementation of cache and figure out changes to be done at Yioop

4. Decide on strategy for static cache

Things to do:

1. Complete Topical Cache Implementaion

2. Try to recode in php

3. Read paper - "Energy-Efficient Query Processing in Web Search Engines"

Week 11 : November 8 2022

1. Present "Topical result caching in web search engines"

2. Demonstrate Cache Simulator and baseline caching algorithm

3. Discuss training results

Things to do:

1. Start Implementing Topical cache

Week 10 : November 1 2022

1. Extract features from the query logs as per the paper

Things to do:

1. Train linear regression model on the extracted features

2. Implement Cache Simulator and baseline caching algorithms

3. Start working on deliverable 3, read paper "Topical result caching in web search engines"

Week 8 : October 18 2022

1. Demonstrated query cleaning using python and pandas

Things to do:

1. Complete feature extraction part

Week 7 : October 11 2022

1. Presented paper "A machine learning approach for result caching in web search engines"

Things to do:

1. Start implementing MLSDC approach for deliverable 2

Week 6 : September 27 2022

1. Created patch for media job

2. Presented paper Cost-Aware Strategies for Query Result Caching in Web Search Engines

Things to do:

1. Search datasets

2. Read paper "A machine learning approach for result caching in web search engines"

Week 5 : September 20 2022

1. Demonstrated Media Job

2. Demonstrated Crawling

Things to do:

1. Fix review comments in media job

2. Read paper Cost-Aware Strategies for Query Result Caching in Web Search Engines

Week 4 : September 13 2022

1. Discussed Paper 'Scalability Challenges in Web Search Engines'

2. Discussed Media Jobs and Yioop Indexing

Things to do:

1. Run Crawl

2. Query Refresh media job

Week 3 : September 6 2022

1. Formatting changes in proposal

2. Discussed reading material for next week and further

Things to do:

1. Submit Slides for paper - "B. Cambazoglu and R. Baeza-Yates, "Scalability Challenges in Web Search Engines" in Synthesis Lectures on Information Concepts, Retrieval, and Services, vol. 7, 2011, pp. 27-50. doi: 10.1007/978-3-642-20946-8_2."

2. Read Media Jobs, how they are called and explain existing media job

3. Read and present Yioop Indexing and Searching

Week 2 : August 29 2022

1. Formatting changes in proposal

2. Concrete algorithm requirement in deliverables

3. Introduced to student portal

Things to do:

1. Mention concrete algorithms of caching

2. Read and present paper - "B. Cambazoglu and R. Baeza-Yates, "Scalability Challenges in Web Search Engines" in Synthesis Lectures on Information Concepts, Retrieval, and Services, vol. 7, 2011, pp. 27-50. doi: 10.1007/978-3-642-20946-8_2."

Week 1 : August 24 2022

1. Decide on project topic

2. Discussion on project deliverables

3. Discussion on course requirements and logistics

Things to do:

1. Create project proposal draft

2. Decide 3-4 deliverables