CS297 Proposal
Enhancing XML Support in PostgreSQL
Khang Nguyen (khangng@yahoo.com)
Advisor: Dr. Chris Pollett
Description:
PostgreSQL is a database management system derived from the POSTGRES system
developed at UC Berkeley. PostgreSQL is one of most popular open source
databases available today. Currently, there are tools available to import XML
data into PostgreSQL tables. However, PostgreSQL does not support XML data
natively. The purpose of this project will be to extend PostgreSQL to natively
store XML data and to implement some recent algorithms to allow for efficient
XPath based retrieval of stored XML data.
Schedule:
Week 1: 01/29 - 02/04 |
Read [IT02] |
Week 2: 02/05 - 02/11 |
Read chapter 25 from [KD06], how tsearch2 works |
Week 3: 02/12 - 02/18 |
Programming Deliverable(1) |
Week 4: 02/19 - 02/25 |
Deliverable(1) due |
Week 5: 02/26 - 03/04 |
Read [RK03] |
Week 6: 03/05 - 03/11 |
Read [SL02] |
Week 7: 03/12 - 03/18 |
Programming Deliverable(2) |
Week 8: 03/19 - 03/25 |
Deliverable(2) due |
Week 9: 03/26 - 04/01 |
Read chapter 49 (GiST Indexing) from [PDG05] |
Week 10: 04/02 - 04/08 |
Read [JS] |
Week 11: 04/09 - 04/15 |
Programming Deliverable(3) |
Week 12: 04/16 - 04/22 |
Deliverable(3) due |
Week 13: 04/23 - 04/29 |
Read [JS99] |
Week 14: 04/30 - 05/06 |
Writing Deliverable(4) |
Week 15: 05/07 - 05/13 |
Writing Deliverable(4) |
Week 16: 05/14 - 05/20 |
Deliverable(4) due |
Deliverables:
The full project will be done when CS298 is completed. The following will be
done by the end of CS297:
1. Use PostgreSQL XML2 package to import Dr. Pollett’s digital library into
PostgreSQL database.
2. Write a program based on the stored data to do text search.
3. Store documents in tree-formats, using PostgreSQL indexing mechanism.
4. The CS297 project report.
References:
[IT02] Igor Tatarinov, Statis D. Viglas. Storing and Querying Ordered XML Using
a Relational Database System. ACM SIGMOD. 2002.
http://www.cs.cornell.edu/people/jai/papers/OrderedXML.pdf
[KD06] Korry Douglas, Susan Douglas. PostgreSQL. Sams Publishings. 2006.
[RK03] Rajasekar Krishnamurthy. Processing XML data using a relational database
: Schema-Based XML Storage. ICDT. 2003. Processing XML data using a relational
database
[SL02] Shiyong Lu, Yezhou Sun, Mustafa Atay, Farshad Fotouhi. A New Inlining
Algorithm for Mapping XML DTDs to Relational Schemas. ANSI. 2002.
http://wwwedit.cs.wayne.edu:8080/~shiyong/papers/xsdm02.pdf
[PDG05] Postgres Developement Group. PostgreSQL 8.1.0 Documentation. 2005.
[JS] Jayavel Shanmugasundaram. A General Technique for Querying XML Documents
using a Relational Database System. IBM.
[JS99] Jayavel Shanmugasundaram. Relational Databases for Querying XML
Documents: Limitations and Opportunities. University of Wisconsin-Madison. 1999
|