Project Blog
Week 11 : April 11 2023
1. Review KMeans Patch
1. Review Word Encodding
Things to do:
Start Working on Report
Week 8 : March 21 2023
1. Show K-Means Algorithm Implementation
Things to do:
Implement Word Encoding
Week 7 : March 14 2023
1. Topic Model Discussion
Things to do:
Start Implementing K-Means algorithm
Week 6 : March 07 2023
1. Topic Model Discussion
1. Discussed LDA Implementation
1. Discussed K-means Implementation
Things to do:
1. Continue working on topic model
Week 5 : February 28 2023
1. Create code pull request
Things to do:
Start implmenting LDA
Week 4 : February 21 2023
1. Showed performance evaluation of current implmentation
1. Code review of performance evaluation and test classes
Things to do:
1. Fix review comments
2. Create issue on Mantis
3. Work on dataset creation of LDA
Week 3 : February 14 2023
1. Showed partial implementation of performance evaluation module
Things to do:
1. Continue working on implementation
Week 2 : February 07 2023
1. Submit Proposal
Things to do:
1. Start working on implementation
Week 1 : January 31 2023
1. Decide on deliverables
Things to do:
1. Start working on proposal
Week 15 : December 06 2022
1. Demonstrate result of Static-Semi-Static-Dynamic Cache
2. Review CS 297 Report
3. Demonstrate Cache Simulator in PHP
Things to do:
1. Fix deliverables on the website, add purpose and code and result snippets
2. Fix review comments on report
Week 14 : November 29 2022
1. Demonstrate Popularity results in Topical Cache
2. Present Paper "Exploiting temporal changes in query submission behavior for improving the search engine result cache performance"
Things to do:
1. Report first draft
2. Try to recode in php
Week 13 : November 22 2022
1. Demonstrate Topical Cache Implementation
Things to do:
1. Implement Topic Popularity in STD Cache
2. Read paper - "Exploiting temporal changes in query submission behavior for improving the search engine result cache performance"
3. Report first draft
4. Try to recode in php
Week 12 : November 15 2022
1. Continue implementing Topical cache
2. Figure out list of query statistics to be generated from Yioop
3. Look at current implementation of cache and figure out changes to be done at Yioop
4. Decide on strategy for static cache
Things to do:
1. Complete Topical Cache Implementaion
2. Try to recode in php
3. Read paper - "Energy-Efficient Query Processing in Web Search Engines"
Week 11 : November 8 2022
1. Present "Topical result caching in web search engines"
2. Demonstrate Cache Simulator and baseline caching algorithm
3. Discuss training results
Things to do:
1. Start Implementing Topical cache
Week 10 : November 1 2022
1. Extract features from the query logs as per the paper
Things to do:
1. Train linear regression model on the extracted features
2. Implement Cache Simulator and baseline caching algorithms
3. Start working on deliverable 3, read paper "Topical result caching in web search engines"
Week 8 : October 18 2022
1. Demonstrated query cleaning using python and pandas
Things to do:
1. Complete feature extraction part
Week 7 : October 11 2022
1. Presented paper "A machine learning approach for result caching in web search engines"
Things to do:
1. Start implementing MLSDC approach for deliverable 2
Week 6 : September 27 2022
1. Created patch for media job
2. Presented paper Cost-Aware Strategies for Query Result Caching in Web Search Engines
Things to do:
1. Search datasets
2. Read paper "A machine learning approach for result caching in web search engines"
Week 5 : September 20 2022
1. Demonstrated Media Job
2. Demonstrated Crawling
Things to do:
1. Fix review comments in media job
2. Read paper Cost-Aware Strategies for Query Result Caching in Web Search Engines
Week 4 : September 13 2022
1. Discussed Paper 'Scalability Challenges in Web Search Engines'
2. Discussed Media Jobs and Yioop Indexing
Things to do:
1. Run Crawl
2. Query Refresh media job
Week 3 : September 6 2022
1. Formatting changes in proposal
2. Discussed reading material for next week and further
Things to do:
1. Submit Slides for paper - "B. Cambazoglu and R. Baeza-Yates, "Scalability Challenges in Web Search Engines" in Synthesis Lectures on Information Concepts, Retrieval, and Services, vol. 7, 2011, pp. 27-50. doi: 10.1007/978-3-642-20946-8_2."
2. Read Media Jobs, how they are called and explain existing media job
3. Read and present Yioop Indexing and Searching
Week 2 : August 29 2022
1. Formatting changes in proposal
2. Concrete algorithm requirement in deliverables
3. Introduced to student portal
Things to do:
1. Mention concrete algorithms of caching
2. Read and present paper - "B. Cambazoglu and R. Baeza-Yates, "Scalability Challenges in Web Search Engines" in Synthesis Lectures on Information Concepts, Retrieval, and Services, vol. 7, 2011, pp. 27-50. doi: 10.1007/978-3-642-20946-8_2."
Week 1 : August 24 2022
1. Decide on project topic
2. Discussion on project deliverables
3. Discussion on course requirements and logistics
Things to do:
1. Create project proposal draft
2. Decide 3-4 deliverables |