Chris Pollett > Students >
Khang

    ( Print View )

    [Bio]

    [Project Blog]

    [CS297 Proposal]

    [Relational XML-PPT]

    [tsearch2-PPT]

    [ProcessXML-PPT]

    [Del1]

    [Del2]

    [CS297Report-PDF]

    [CS298Proposal]

                          

























CS297 Proposal

Enhancing XML Support in PostgreSQL

Khang Nguyen (khangng@yahoo.com)

Advisor: Dr. Chris Pollett

Description:

PostgreSQL is a database management system derived from the POSTGRES system developed at UC Berkeley. PostgreSQL is one of most popular open source databases available today. Currently, there are tools available to import XML data into PostgreSQL tables. However, PostgreSQL does not support XML data natively. The purpose of this project will be to extend PostgreSQL to natively store XML data and to implement some recent algorithms to allow for efficient XPath based retrieval of stored XML data.

Schedule:

Week 1: 01/29 - 02/04 Read [IT02]
Week 2: 02/05 - 02/11 Read chapter 25 from [KD06], how tsearch2 works
Week 3: 02/12 - 02/18 Programming Deliverable(1)
Week 4: 02/19 - 02/25 Deliverable(1) due
Week 5: 02/26 - 03/04 Read [RK03]
Week 6: 03/05 - 03/11 Read [SL02]
Week 7: 03/12 - 03/18 Programming Deliverable(2)
Week 8: 03/19 - 03/25 Deliverable(2) due
Week 9: 03/26 - 04/01 Read chapter 49 (GiST Indexing) from [PDG05]
Week 10: 04/02 - 04/08 Read [JS]
Week 11: 04/09 - 04/15 Programming Deliverable(3)
Week 12: 04/16 - 04/22 Deliverable(3) due
Week 13: 04/23 - 04/29 Read [JS99]
Week 14: 04/30 - 05/06 Writing Deliverable(4)
Week 15: 05/07 - 05/13 Writing Deliverable(4)
Week 16: 05/14 - 05/20 Deliverable(4) due

Deliverables:

The full project will be done when CS298 is completed. The following will be done by the end of CS297:

1. Use PostgreSQL XML2 package to import Dr. Pollett’s digital library into PostgreSQL database.

2. Write a program based on the stored data to do text search.

3. Store documents in tree-formats, using PostgreSQL indexing mechanism.

4. The CS297 project report.

References:

[IT02] Igor Tatarinov, Statis D. Viglas. Storing and Querying Ordered XML Using a Relational Database System. ACM SIGMOD. 2002. http://www.cs.cornell.edu/people/jai/papers/OrderedXML.pdf

[KD06] Korry Douglas, Susan Douglas. PostgreSQL. Sams Publishings. 2006.

[RK03] Rajasekar Krishnamurthy. Processing XML data using a relational database : Schema-Based XML Storage. ICDT. 2003. Processing XML data using a relational database

[SL02] Shiyong Lu, Yezhou Sun, Mustafa Atay, Farshad Fotouhi. A New Inlining Algorithm for Mapping XML DTDs to Relational Schemas. ANSI. 2002. http://wwwedit.cs.wayne.edu:8080/~shiyong/papers/xsdm02.pdf

[PDG05] Postgres Developement Group. PostgreSQL 8.1.0 Documentation. 2005.

[JS] Jayavel Shanmugasundaram. A General Technique for Querying XML Documents using a Relational Database System. IBM.

[JS99] Jayavel Shanmugasundaram. Relational Databases for Querying XML Documents: Limitations and Opportunities. University of Wisconsin-Madison. 1999