Chris Pollett >
Students >
Gargi
( Print View)
[Bio]
[Blog]
[C297 Proposal]
[Paper 1: Large-scale IRLBot crawl PDF]
[Paper 2: Distributed Crawler Architecture PDF]
[Paper 3: Scalability Challenges PDF]
[Paper 4: High Performance Priority Queues PDF]
[Deliverable 1: Yioop Ranking Mechanisms PDF]
[Deliverable 2(ii): Modifying Yioop's UI Editor]
[Deliverable 3: Modifying Yioop's queuing process]
[Deliverable 4: Yandex Signal PDF]
[CS 297 Report PDF]
[C298 Proposal]
[C298: Yandex-inspired Search Factors]
[C298: Latest Page Version in SERP]
[C298: Disjunctive Queries in Yioop Search]
[CS 298 Report PDF]
[CS 298 Report Slides PDF]
|
Project Blog
Week#13 (May 2)
Meeting log:
- Completed Deliverable#4 (Yandex signals research)
- Went over Paper [4]: High-Performance Priority Queues for Parallel Web Crawlers
For the upcoming weeks:
- Submit 297 report
Week#13 (Apr 26)
Meeting log:
- Completed Deliverable#3 (queueing modifications) implementation
For the upcoming weeks:
- Begin with Deliverable#4 (read up on Yandex signals)
- Read [4]: High-performance priority queues
Week#12 (Apr 19)
Meeting log:
- Discussed Deliverable#3 (queueing modifications) implementation
Week#11 (Apr 11)
Meeting log:
- Discussed Deliverable#3 (queueing modifications) implementation
For the upcoming weeks:
- Continue implementation for Deliverable#3
- Fix PHP deprecation warnings in executables/QueueServer.php
Week#9/10 (Apr 4)
Meeting log:
- Discussed strategy to implement Deliverable#3 (queueing modifications)
For the upcoming weeks:
- Upload patch for UI wiki editor fixes
- Start implementation for Deliverable#3
Week#8 (Mar 21)
Meeting log:
- Discussed UI fix (wiki editor) modifications
- Discussed approach to begin Deliverable#3 (queueing modifications)
For the upcoming weeks:
- UI fixes
- Change A/B test documentation to html
- Chalk out strategy for Deliverable#3 (queueing modifications)
Week#7 (Mar 14)
Meeting log:
- Discussed a/b testing for UI fix (wiki editor)
For the upcoming week:
- Upload patch for UI fix and document a/b testing results
Week#6 (Mar 7)
Meeting log:
- Went over Paper [2]: Distributed web crawler architecture and [3]: Scalability Challenges in Web Search Engines summaries
For the upcoming week:
- Begin UI fix for wiki editor (part of Deliverable#2)
Week#5 (Feb 28)
Meeting log:
- Checked PHP 8.2 deprecation bug fix patch
For the upcoming week:
- Read Paper [2]: Distributed web crawler architecture
- Read Paper [3]: Scalability Challenges in Web Search Engines
Week#4 (Feb 21)
Meeting log:
- Went over Paper [1]: Around the web in six weeks: Documenting a large-scale crawl summary
- Discussed how to begin with Deliverable#2 (Fixing PHP 8.2 deprecation bugs)
For the upcoming week:
- Modify and upload summary for Paper [1]: Around the web in six weeks: Documenting a large-scale crawl
- Create account on Yioop's issue tracker
- Begin working on PHP 8.2 deprecation bug fix
Week#3 (Feb 14)
Meeting log:
- Went over Ranking Mechanisms summary
- Discussed possibilities for bug selection for Deliverable#2
For the upcoming week:
- Wrap up Deliverable#1
- Read Paper [1]: Around the web in six weeks: Documenting a large-scale crawl
Week#2 (Feb 7)
Meeting log:
- Set up account and project page
- Finalized project schedule
- Discussed reference papers
- Discussed overview of relevant Yioop files in codebase
For the upcoming week:
- Finish project page on student portal
- Study Yioop's ranking mechanisms and codebase
Week#1 (Jan 31)
Meeting Log:
- Introductory meeting
- Finalized project deliverables, weekly meeting schedule
For the upcoming week:
- Complete proposal
- Find research papers for reference
|