CS297 Proposal

Yioop! Introducing autosuggest and spell check

Sandhya Vissapragada (sandyvissa@gmail.com)

Advisor: Dr. Chris Pollett

Description:

The project aims to incorporate the features of autosuggest, autocomplete and spell check suggestions to the queries in Yioop, a php-based search engine. This would help the user in reducing the typing work, catch any spelling mistakes or repeat any search. Popular commercial search engines search large indices or popular query lists in under a few milliseconds so that the user sees results pop up while typing. Efficient storage of indices on many servers is responsible for minimum response times. Yioop runs on three machines for now. This project aims to implement these functionalities without increasing any load on the servers in turn minimizing the response times for the queries.

Schedule:

Week 1: Feb.1-7Discuss the project in detail with the advisor
Week 2: Feb.8-14Install and understand the working of Yioop
Week 3-4: Feb.15-28Deliverable 1: Study the algorithms to implement autosuggest and spell check on the queries
Week 5: Feb.29-Mar.6Deliverable 2: Conduct experiments to estimate how other commercial search engines are implementing autosuggest functionalities
Week 6-7: Mar.7-20Deliverable 3: Code the drop down functionality for Yioop using Java script
Week 8: Mar.21-27Study the indexing implemented in Yioop in detail
Week 9-10: Mar.28-Apr.10Deliverable 4: Modify the index storage
Week 11-12: Apr.11-24Start implementing the autocomplete algorithm
Week 13-14: Apr.25-May.8Work on CS297 Report
Week 15: May.9-15Deliverable 5 due: CS297 Report

Deliverables:

The full project will be done when CS298 is completed. The following will be done by the end of CS297:

1. Study the algorithms useful to implement the autosuggest functionality

2. Estimate how other commercial search engines implement auto complete and spell check based on server hits

3. Code the drop down list suggesting the user on relevant searches

4. Modify the index storage in a way to help implement the project goals

5. Project write-up for 297

References:

[1] Information Retrieval: Implementing and Evaluating Search Engines. Stefan Battcher, Charles L. A. Clarke and Gordon V. Cormack. The MIT Press. 2010.

[2] Information Retrieval: Searching in the 21st Century. Ayse Goker, John Davies. John Wiley and Sons. 2009.

[3] Methods and systems for implementing auto-complete in a web page. United States Patent No. 7185271 B2. 2007.