Chris Pollett > Students > Niravkumar
[Bio] [Blog] [Question Answering System Patch - PATCH] |
CS297 ProposalQuestion Answering SystemNiravkumar Patel (niravkumar.patel1989@gmail.com) Advisor: Dr. Chris Pollett Description: Yioop is an open source search engine developed and managed by Dr. Christopher Pollett. Currently, when a query is searched, it suggests relevant documents based on the query. A summarizer is a process which extracts creates a short summary from a potentially long text document. The Yioop crawler when processing pages runs a summarizer and then only index the contents of the summary it produces. There are times when a user queries or tries to search for specific information. So the information in this summary part can be used to answer those queries. However, the summary itself might not have sentences arranged as question answer pairs.
Schedule:
Deliverables: The project would be considered as done when CS 298 is completed. The following will be completed by the end of CS 297: 1. A test set for Question-Answering System 2. A stemmer for Portuguese Locale 3. A basic implementation of parse tree generation 4. A basic implementation of extraction of triplet 5. CS 297 Report References: Yioop Documentation: Yioop Documentation [2015] Portuguese stemming algorithm. Portuguese Stemmer 2015 [2015] Information Extraction From Text Information Extraction From Text by Steven Bird, Ewan Klein, and Edward Loper 2015 [2007] Triplet Extraction From Sentences by Delia Rusu*, Lorand Dali*, Blaž Fortuna°, Marko Grobelnik°, Dunja Mladenić°. 2007. [2003] Integrating Web-based and Corpus-based Techniques for Question Answering by Boris Katz, Jimmy Lin, Daniel Loreto, Wesley Hildebrandt, Matthew Bilotti, Sue Felshin, Aaron Fernandes, Gregory Marton, Federico Mora. TREC 2003. [2001] Gathering Knowledge for a Question Answering System from Heterogeneous Information Sources by Boris Katz and Jimmy Lin and Sue Felshin. ACL Workshop 2001. |