CS297 Proposal
Improved User News Feed Customization for an Open Source Search Engine
Timothy Chow (timothy.chow@sjsu.edu)
Advisor: Dr. Chris Pollett
Description:
Yioop is an open source search engine project hosted on a site of the same name. It offers several features outside of searching, with one such feature being a newsfeed. The current newsfeed system aggregates articles from a curated list of news sites determined by the owner. However in its current state, the feed is limited in size, being able to utilize around 50 sources. One of the goals for my project will be to increase this amount. I will also be implementing the ability for users to personalize the newsfeed for their own use.
Schedule:
Week 1:
Feb 5 - Feb 11 | First meeting |
Week 2:
Feb 12 - Feb 18 | Look through the IndexShard class and run preexisting tests to see what it does. |
Week 3:
Feb 19 - Feb 25 | Deliverable 1 due: Create additional unit test cases for IndexShard class. |
Week 4:
Feb 26 - Mar 4 | Read through Ch. 1 and 2(JRFL) of Relevance Ranking for Vertical Search Engines[3] |
Week 5:
Mar 5 - Mar 11 | Read through Ch. 2(clustering) of Relevance Ranking for Vertical Search Engines[3] and finish up presentation. |
Week 6:
Mar 12 - Mar 18 | Work on Deliverable 2. |
Week 7:
Mar 19 - Mar 25 | Continue work for Deliverable 2. |
Week 8:
Mar 26 - Apr 1 | Deliverable 2 due: Implement a word tracker for Yioop that should return the top n keywords and corresponding index after running a crawl job. |
Week 9:
Apr 2 - Apr 8 | Spring Break!! |
Week 10:
Apr 9 - Apr 15 | Deliverable 3 due: Prototype a user interface necessary for registered users to filter and add news sources to the feed. |
Week 11:
Apr 16 - Apr 22 | Look through the user account code |
Week 12:
Apr 23 - Apr 29 | Create simple unit tests for users. |
Week 13:
Apr 30 - May 6 | Deliverable 4: Creating a prototype framework for a NewsFeedBundle to store news related IndexShards. |
Week 14:
May 7 - May 13 | Work on CS 297 report |
Week 15:
May 14 - May 20 | CS 297 report due |
Deliverables:
The full project will be done when CS298 is completed. The following will
be done by the end of CS297:
1. Create additional unit test cases for IndexShard class.
2. Implement a word tracker for Yioop that should return the top 25 keywords after running a crawl job.
3. Prototype a user interface necessary for registered users to filter and add news sources to the feed.
4. Creating a prototype framework for a NewsFeedBundle to store news related IndexShards.
5. Final CS 297 Report
References:
[1] Jongdeog Lee, Daniel Xu, Md Tanvir Al Amin, Tarek Abdelzaher; iApollo: A Newsfeed Summary Service on NDN; iEEE, 2017.
[2] Nicola Ferro, Yubin Kim, Mark Sanderson; Using Collection Shards to Study Retrieval Performance Effect Sizes; ACM, 2019.
[3] Bo Long, Yi Change; Relevance Ranking for Vertical Search Engines; 2014
|