CS298 Proposal
Smart Search: A Firefox Add-On to Compute a Web Traffic Ranking
Vijaya Pamidi (vijaya_pamidi@yahoo.com)
Advisor: Dr. Chris Pollett
Committee Members: Dr.T.Y.Lin and Dr.Robert K. Chun
Abstract:
The main goal of this project is to improve the Web traffic ranking for the search Yioop search engine and giving user centric results.
Search engines results are given to the user based on the ranking and indexing strategies followed by the search algorithm of a search engine. Currently, there are some tools available like www.alexa.com, www.ranking.com, www.compete.com which give analytic data for ranking web sites based on web traffic, the number of users who visit a site. Alexa provides the traffic rank for a web site based on two factors: The number of users that view a web site and the number of pages viewed. The main goal of our project is to create a Smart Search Firefox add-on for the Yioop search engine, an open source search engine developed by my project advisor, Dr. Chris Pollett. This add-on will provide similar analytic data to the Yioop search engine. Smart search considers three main factors to get a rank for a web site, these factors are:
Number of users that view a web site.
Probability that user leaves some other web site and enters the current one and the probability that user clicks on the link provided on the current page.
Getting the user review for a result: if the user finds a particular result useful clicks on the + sign. A + will increment the rank of a page or web site
Smart search takes these three basic factors into consideration and sends back data to the Yioop search engine periodically. With the results received from the Smart Search tool, the Yioop search engine refines the search results. Eventually, users would benefit from these better search results.
CS297 Results
- Google's Page Rank Algorithm modeled for a 10X10 matrix.
- Made a simple Firefox extension with alert functionality added.
- Captured user activity by the XUL tool bar button.
- Communicated with Yioop by sending the captured links to Yioop.
Proposed Schedule
Week 1:
Jan26_Feb 1 | Getting CS298 Proposal approved and upload. |
Week 2:
Feb 2_Feb 8 | Study Firefox's SQLite and database design for user end captured data. |
Week 3-4:
Feb 09_Feb 22 | Capture user activity all three features, make configuration panel for the tool bar. |
Week 5:
Feb 23_march 01 | Send the captured data periodically to Yioop |
Week 6-7:
March 02_March 08 | Re rank the Yioop index based on the user results received. |
Week 8:
March 09_March 15 | Send the results back based when a search query is given to Yioop. |
Week 9:
March 16_March 22 | Conduct a result study and do the statistical analysis. |
Week 10:
March 23_March 29 | Prepare a report of the results. |
Week 11:
March 30_April 05 | Work on CS298 Report |
Week 12:
April 06_April 12 | Submit CS298 Report First Draft. |
Week 13:
April 13_April 19 | Complete the report for committee review. |
Week 14-15:
April 20_ May 03 | Final Report Submission and presentation slides. |
Week 16:
May 04_May 10 | Present to Committee. |
Key Deliverables:
- Software
- Deliverable 1: A Firefox component that uses an SQLite database to store the user end captured search history.
- Deliverable 2: A component that sends data from Firefox to Yioop periodically.
- Deliverable 3: A Yioop component that refines the ranking algorithm based on the received data.
- Deliverable 4: A component that gives refined results to users for their search queries.
- Report
- Deliverable 5: CS298 Project Report
- Deliverable 6: Project code Documentation.
Innovations and Challenges
- Building a Firefox traffic toolbar that communicates with the Yioop open
source search engine is innovative as the existing traffic toolbars communicate with closed source search engines.
- Many downloaded instances of our toolbar can send results to the Yioop server simultaneously, so making the Yioop server scalable
to handle these results is challenging.
- Indexing these results on Yioop is challenging.
References:
[LM2006]
Amy N. Langville and Carl D. Meyer. Google's
PageRank and Beyond. Princton University Press. 2006.
[GBM2009]
Gavin Bell. Building Social
Web Applications. O'Reilly Media. 2009.
[KF 2007]
Kenneth C. Feldt.Programming Firefox:
Building Rich Internet Applications with XUL.O'Reilly. 2007
[SH 1998]
Steven Holzner.JavaScript- Complete- 1998.
"http://developer.mozilla.org/en/docs/Building_an_Extension":Official page of Mozilla.
|