Chris Pollett > Students >
Tim

    ( Print View)

    [Bio]

    [Blog]

    [CS 297 Proposal]

    [CS 298 Proposal]

    [Deliverable 1]

    [Deliverable 2]

    [Deliverable 3]

    [Deliverable 4]

    [Relevance Ranking(JRFL) slides - pdf]

    [Relevance Ranking(cluster) slides - pdf]

    [CS 297 Report.pdf]

    [CS 298 Report.pdf]

    [CS 298 Slides - pdf]

CS297 Proposal

Improved User News Feed Customization for an Open Source Search Engine

Timothy Chow (timothy.chow@sjsu.edu)

Advisor: Dr. Chris Pollett

Description:

Yioop is an open source search engine project hosted on a site of the same name. It offers several features outside of searching, with one such feature being a newsfeed. The current newsfeed system aggregates articles from a curated list of news sites determined by the owner. However in its current state, the feed is limited in size, being able to utilize around 50 sources. One of the goals for my project will be to increase this amount. I will also be implementing the ability for users to personalize the newsfeed for their own use.

Schedule:

Week 1: Feb 5 - Feb 11First meeting
Week 2: Feb 12 - Feb 18Look through the IndexShard class and run preexisting tests to see what it does.
Week 3: Feb 19 - Feb 25Deliverable 1 due: Create additional unit test cases for IndexShard class.
Week 4: Feb 26 - Mar 4Read through Ch. 1 and 2(JRFL) of Relevance Ranking for Vertical Search Engines[3]
Week 5: Mar 5 - Mar 11Read through Ch. 2(clustering) of Relevance Ranking for Vertical Search Engines[3] and finish up presentation.
Week 6: Mar 12 - Mar 18Work on Deliverable 2.
Week 7: Mar 19 - Mar 25Continue work for Deliverable 2.
Week 8: Mar 26 - Apr 1Deliverable 2 due: Implement a word tracker for Yioop that should return the top n keywords and corresponding index after running a crawl job.
Week 9: Apr 2 - Apr 8 Spring Break!!
Week 10: Apr 9 - Apr 15Deliverable 3 due: Prototype a user interface necessary for registered users to filter and add news sources to the feed.
Week 11: Apr 16 - Apr 22Look through the user account code
Week 12: Apr 23 - Apr 29Create simple unit tests for users.
Week 13: Apr 30 - May 6Deliverable 4: Creating a prototype framework for a NewsFeedBundle to store news related IndexShards.
Week 14: May 7 - May 13Work on CS 297 report
Week 15: May 14 - May 20CS 297 report due

Deliverables:

The full project will be done when CS298 is completed. The following will be done by the end of CS297:

1. Create additional unit test cases for IndexShard class.

2. Implement a word tracker for Yioop that should return the top 25 keywords after running a crawl job.

3. Prototype a user interface necessary for registered users to filter and add news sources to the feed.

4. Creating a prototype framework for a NewsFeedBundle to store news related IndexShards.

5. Final CS 297 Report

References:

[1] Jongdeog Lee, Daniel Xu, Md Tanvir Al Amin, Tarek Abdelzaher; iApollo: A Newsfeed Summary Service on NDN; iEEE, 2017.

[2] Nicola Ferro, Yubin Kim, Mark Sanderson; Using Collection Shards to Study Retrieval Performance Effect Sizes; ACM, 2019.

[3] Bo Long, Yi Change; Relevance Ranking for Vertical Search Engines; 2014