Chris Pollett >
Students > [Bio] [Blog] |
Project BlogWeek 11 : April 11 20231. Review KMeans Patch 1. Review Word Encodding Things to do: Start Working on Report Week 8 : March 21 20231. Show K-Means Algorithm Implementation Things to do: Implement Word Encoding Week 7 : March 14 20231. Topic Model Discussion Things to do: Start Implementing K-Means algorithm Week 6 : March 07 20231. Topic Model Discussion 1. Discussed LDA Implementation 1. Discussed K-means Implementation Things to do: 1. Continue working on topic model Week 5 : February 28 20231. Create code pull request Things to do: Start implmenting LDA Week 4 : February 21 20231. Showed performance evaluation of current implmentation 1. Code review of performance evaluation and test classes Things to do: 1. Fix review comments 2. Create issue on Mantis 3. Work on dataset creation of LDA Week 3 : February 14 20231. Showed partial implementation of performance evaluation module Things to do: 1. Continue working on implementation Week 2 : February 07 20231. Submit Proposal Things to do: 1. Start working on implementation Week 1 : January 31 20231. Decide on deliverables Things to do: 1. Start working on proposal Week 15 : December 06 20221. Demonstrate result of Static-Semi-Static-Dynamic Cache 2. Review CS 297 Report 3. Demonstrate Cache Simulator in PHP Things to do: 1. Fix deliverables on the website, add purpose and code and result snippets 2. Fix review comments on report Week 14 : November 29 20221. Demonstrate Popularity results in Topical Cache 2. Present Paper "Exploiting temporal changes in query submission behavior for improving the search engine result cache performance" Things to do: 1. Report first draft 2. Try to recode in php Week 13 : November 22 20221. Demonstrate Topical Cache Implementation Things to do: 1. Implement Topic Popularity in STD Cache 2. Read paper - "Exploiting temporal changes in query submission behavior for improving the search engine result cache performance" 3. Report first draft 4. Try to recode in php Week 12 : November 15 20221. Continue implementing Topical cache 2. Figure out list of query statistics to be generated from Yioop 3. Look at current implementation of cache and figure out changes to be done at Yioop 4. Decide on strategy for static cache Things to do: 1. Complete Topical Cache Implementaion 2. Try to recode in php 3. Read paper - "Energy-Efficient Query Processing in Web Search Engines" Week 11 : November 8 20221. Present "Topical result caching in web search engines" 2. Demonstrate Cache Simulator and baseline caching algorithm 3. Discuss training results Things to do: 1. Start Implementing Topical cache Week 10 : November 1 20221. Extract features from the query logs as per the paper Things to do: 1. Train linear regression model on the extracted features 2. Implement Cache Simulator and baseline caching algorithms 3. Start working on deliverable 3, read paper "Topical result caching in web search engines" Week 8 : October 18 20221. Demonstrated query cleaning using python and pandas Things to do: 1. Complete feature extraction part Week 7 : October 11 20221. Presented paper "A machine learning approach for result caching in web search engines" Things to do: 1. Start implementing MLSDC approach for deliverable 2 Week 6 : September 27 20221. Created patch for media job 2. Presented paper Cost-Aware Strategies for Query Result Caching in Web Search Engines Things to do: 1. Search datasets 2. Read paper "A machine learning approach for result caching in web search engines" Week 5 : September 20 20221. Demonstrated Media Job 2. Demonstrated Crawling Things to do: 1. Fix review comments in media job 2. Read paper Cost-Aware Strategies for Query Result Caching in Web Search Engines Week 4 : September 13 20221. Discussed Paper 'Scalability Challenges in Web Search Engines' 2. Discussed Media Jobs and Yioop Indexing Things to do: 1. Run Crawl 2. Query Refresh media job Week 3 : September 6 20221. Formatting changes in proposal 2. Discussed reading material for next week and further Things to do: 1. Submit Slides for paper - "B. Cambazoglu and R. Baeza-Yates, "Scalability Challenges in Web Search Engines" in Synthesis Lectures on Information Concepts, Retrieval, and Services, vol. 7, 2011, pp. 27-50. doi: 10.1007/978-3-642-20946-8_2." 2. Read Media Jobs, how they are called and explain existing media job 3. Read and present Yioop Indexing and Searching Week 2 : August 29 20221. Formatting changes in proposal 2. Concrete algorithm requirement in deliverables 3. Introduced to student portal Things to do: 1. Mention concrete algorithms of caching 2. Read and present paper - "B. Cambazoglu and R. Baeza-Yates, "Scalability Challenges in Web Search Engines" in Synthesis Lectures on Information Concepts, Retrieval, and Services, vol. 7, 2011, pp. 27-50. doi: 10.1007/978-3-642-20946-8_2." Week 1 : August 24 20221. Decide on project topic 2. Discussion on project deliverables 3. Discussion on course requirements and logistics Things to do: 1. Create project proposal draft 2. Decide 3-4 deliverables |