CS 157B - Data Base Management System

Spring 2009


Green Sheet
Course Schedule
Midterms and Final
Ø      Midterm 1 Tuesday, February 17, 2009.  Do not come late !!!
Ø      Midterm 2  Thursday, March 5, 2009. Do not come late !!!
Ø      Midterm 3  Tuesday, April 28, 2009. Do not come late!!!
Ø      Final Exam Review/Study Guide  Tuesday, May 19, 2009. Do not come late!!!
Homework and Program Assignments
Ø      Ø      Program Assignment 1  Due Thursday, February 26, 2009.
Ø      Ø      Program Assignment 2  Due Tuesday, April 2, 2009
Ø      Ø      Program Assignment 3  Due Friday May 1, 2009
Ø      Ø    Earthquake example 1  ProjectReport example 1     (run the batch file inside the example zip file)
Ø      Ø    Earthquake example 2  ProjectReport example 2     (run the batch file inside the example zip file)
Ø      Ø    Earthquake example 3  ProjectReport example 3     (run the batch file inside the example zip file)
Lecture Notes:
Ø      Ø                Lecture 1 (1/27/2009)
 
Ø      Ø                Obtaining LaTeX: http://www.latex-project.org/ftp.html
 
Ø      Ø                LaTeX Documentation: http://www.latex-project.org/guides/
 
Ø      Ø                Using PowerPoint [Spring 09]
 
Ø      Ø                How to Draw Entity Relationship Diagrams using PowerPoint [Spring 09]
 
Ø      Ø                Lecture 2 (1/29)
 
Ø      Ø                Lecture 3 (2/3) 
 
Ø      Ø                Neural Networks (Robert Correll)
 
Ø      Ø                Lecture 4 (2/5)
 
                                      Practice assignment 1 (2/5)
 
Ø      Ø                Lecture 5 (2/10)
 
                                      Practice assignment 2 (2/7)
 
Ø      Ø         Lecture 6 (2/12)
 
Ø      Ø                Introduce to MySQL (Jianshi Chen)
Ø      Ø                SQL
 
Ø      Ø                Lecture 7 (2/19)
 
Ø      Ø                Functional Dependency Theory (Kat Powell)
Ø      Ø         Entity Relation Diagrams (Kelvin Nishikawa)
Ø      Ø                 Decision Tree (Ray Hopper)
                                             
Ø      Ø                Lecture 8 (2/24)
 
Ø      Ø                 1NF, 2NF, 3NF, and Normalization(Rajhdeep Jandir)
Ø      Ø                 BCNF(Mabel Thong)
 
Ø      Ø                Lecture 9 (2/26)
 
Ø      Ø                Lecture 10 (3/3)
 
Ø      Ø                Lecture 11 (3/10)
 
Ø      Ø                Lecture 12 (3/12)
 
Ø      Ø                Lecture 13 (3/17)
 
Ø      Ø                Lecture 14 (3/19)
 
Ø      Ø                Data Mining (Sangeeta Devadiga)
Ø      Ø                Online Analytical Processing (Hweichao Lu)
 
Ø      Ø                Lecture 15 Association Rule 2 (4/2)
 
Ø      Ø                Lecture 16 (4/7)
 
Ø      Ø               Object-Oriented Database Management System (Nguyen, Yen P)
 
Ø      Ø               Data Mining Association rule (Nathanael Chow)
 
Ø      Ø                Lecture 17 (4/9)
 
Ø      Ø              Data Warehousing (Alex Ostrovsky)
 
Ø      Ø                Lecture 18 (4/14)
 
Ø      Ø               Clustering in Data Mining ( Phuong Tran)
 
Ø      Ø                Lecture 19 (4/16)
 
Ø      Ø           Lecture 20 (4/21)
 
Ø      Ø               Data Mining with Clustering (Rajgopal)
 
Ø      Ø                Lecture 21 (4/23)
 
Ø      Ø                Data Mining and Clustering (Benjamin Lam)
 
Ø      Ø                Lecture 22 (4/30)
 
Ø      Ø               ID3 Algorithm (Medha, Pradhan)
 
Ø      Ø                Lecture 23 (5/5)
 
Ø      Ø               Hashing (Gogoi, Trideep)
 
Ø      Ø                Lecture 24 (5/6)
 
                                    Earthquake Prediction Methods (Jason Long)
                                    Clustering (Timofey Shulepov)
 
Ø      Ø                Lecture 25 (5/8)
Ø      Ø                Iterative Dichotomiser (ID3) Algorithm (Phuong H. Nguyen)
Ø      Ø                ID3 Algorithm (Allan Neymark)
Ø      Ø                Indexing (Vu Phan)
 
Ø      Ø                Lecture 26 (5/10)
 
Ø      Ø                Clustering revision (Falguni Negandhi)
Ø      Ø                Apriori Algorithm Review for Finals (Gaurang Negandhi)
Ø      Ø                ID3 Algorithm (Phoung H. Nguyen)
Ø      Ø                Predict Earthquakes (Lois Desplat)
 
Ø      Ø                Transaction concept
Ø      Ø                Concurrent Control
Ø      Ø                Query Processing
Ø      Ø                Two Phase Locking
Ø      Ø                Distributive Database 1
Ø      Ø                Distributive Database 2
Ø      Ø                Object Oriented Database
Ø      Ø                Data Warehouse
Ø      Ø                Data Mining Knowledge (Posted 12/2/2003)
 
 
Ø      Ø                Normalization
 
¡±                  ¡±                  Normalization
¡±                  ¡±                  Lossless Decomposition
¡±                  ¡±                  Normalization Forms
¡±                  ¡±                  Fourth Normal Form   
¡±                  ¡±                  Dependency Preservation
¡±                  ¡±                  Fourth Normal Form Continued
 
Ø      Ø                Distributed Database
 
¡±                  ¡±                  Distributed Database 1   
¡±                  ¡±                  Distributed Database 2   
¡±                  ¡±                  Distributed Database 3
 
Ø      Ø                Fundamentals and Basic SQL  
 
Ø      Ø                B-Trees and B+ Trees
 
¡±                  ¡±                  B-trees  
¡±                  ¡±                  B-Trees Continued
¡±                  ¡±                  B+ and B-Trees
¡±                  ¡±                  Animation of B+ Tree
 
Ø      Ø                Midterm 2 - Revision  
Ø      Ø                Midterm 3 - Revision (Updated 11/6/2003)  
 
Ø      Ø                Object Oriented Database
 
¡±                  ¡±                  Object Oriented Database
¡±                  ¡±                  Object Oriented Database Cont.
 
Ø      Ø                Transaction Concepts  
 
Ø      Ø                Motivation of Obj. Oriented DB
 
Ø      Ø                Data Mining
 
¡±                  ¡±                  Data mining 1
¡±                  ¡±                  Data Mining 2
¡±                  ¡±                  Data Mining 3
¡±                  ¡±                  Data Mining 4
 
Data Mining ¡V Association Rules
 
¡±                  ¡±                  Association Rules 2
 
  
 Reading Homework
Ø                Ø          Reading Assignment 1  
Ø                Ø                Reading Assignment 2 [R. Agrawal]
Homework Submission
Ø                Ø                Program Assignment 1
Ø                Ø                Program Assignment 2  
Ø                Ø                Program Assignment 3
 
  
 
Student Presentations Spring 2009
Ø      Blue Collar Database by David Twamley (2/5)
Ø      Relational Data Model by Lei Li (2/12)
Ø      1NF 2NF 3NF by Vishal Punjabi (2/12)
Ø      Using Neural Network in Data Mining by Tino Jimenez (2/19)
Ø      DataMining and Association Rules by Dongyi Jia (2/19)
Ø      Intro to SQL & PHP by David Sands (2/24)
Ø      Relational Data Model by Nidhi Patel (2/24)
Ø      Clustering Algorithms by Michael Smaili (2/26)
Ø      Clustering by Jonathan Silva (3/17)
Ø      4NF and Multivalue Dependency by Kristina Miguel
Ø      Database Encapsulation by Devon Wright
Ø      Data Mining by Farzana Forhad
Ø      Minimum Spanning Trees by Arya Soltanieh
Ø      Spatial Data Mining by Satoru Hozumi
Ø      Decision Trees by Asher Moody
Ø       
 
 
  
 
Student Presentations Spring 2008
            Chapter 5.1 5.2 by Brian Cobarrubia (1/31)
Chapter 5.3 5.4 by Deborah Costa (1/31)
Advanced Data Models by Richard Fleischman & Sharon Young (2/7)
Chapter 6.1 by Li Ch¡¦i OOi (2/5)
 
Relational Data Model by Yue Lu (2/14)
 
Techniques for Munipulating Relational Data Model by Herbert A. Evans (2/19)
 
SQL by Soid Quintero& Ervi Bongso (2/26)
 
SQL by David Wortham (2/28)
 
SQL by Chris Nelson (2/28)
 
Chapter 18 by Ali Parandian & Ashira Khera (3/11)
 
Query Optimization by Dennis Le & Weishan Wang (3/18)
 
Database Transactions by Charles Balkon (4/1)
 
Reliability and Security by Samuel Njoroge (4/15)
 
Iterative Dichotomiser 3 by Christopher Archibald (4/17)
 
Data Comression by Keerthi Gundapaneni (4/17)
 
OODB by Shantanu Narang (4/29)
 
Lossless Decomposition by Kenneth Leung (5/06)
 
Reliability and Security by Neyha Amar (5/06)
 
ID3 and Decision Tree by Tuan Nguyen (5/06)
 
Distributed Databases by Jorge Pombar (5/08)
 
 
 
 
Student Presentations Spring 2006
            LaTeX by Wen Ying Gao (2/7)
            3NF & BCNF by Jasbir Jassu (2/9)
            Indexing and B+-Tree by Kenneth Cheung (2/9)
            B Tree by Takahiko Saito (2/14)
            B Tree Index Files by Huy Nguyen (2/14)
            Multivalued Dependencies by Tony Palladino (2/21)
            Hashing by Rafael Jaffarove (3/14)
            B+ Tree by Li Wen (2/21)
            LaTeX by John Eagle (2/21)
            LaTeX Graphics Inclusion Techniques (2/23)
            Lossless Decomposition 4NF by Karen McVay (2/23)
            Using Graphics LaTeX by Matt Edwards (2/28)
            B Trees and B+ Trees by Jay Yim (2/28)
            Object-Based Databases by Jose Reyes Jose (3/2)
            B-Trees by Brian Lee (3/7)
            Data Mining and Decision Tree by Masumi Shimoda (3/7)
            Introduction to Data Analysis and Mining by Laura Jordana (3/14)
            Data Analysis And Mining by Kat Powell (3/21)
            Data Mining by Larry Varela (3/23)
            Data Mining by Tracy Juang (3/23)
            Data Mining by Johan Johansson (4/4)
            Data Mining by Rui Zhao (4/6)
            Data Mining by Timothy Vu (4/6)
            Data Mining: Association Rules (4/6)
            Data Mining: Association Rules by Thanh Troung (4/11)
             Data Mining with JDM API by Regina Wang (4/11)
            Data Mining by Glen Shih (4/11)
            Data Preparation for Data Mining by Yuenho Leung (4/13)
            Data Mining by Mandeep Jandir (4/13)
            Data Mining Association by Andrew Williamson (4/13)
            Data Mining by Brandon Leonardo (4/18)
            Data Minging by Surya Bhagvat (4/18)
            Data Mining ¡V Techniques and Applications by Charlie Chough (5/2)
            Information Gain (5/2)
            Data Mining by Carol Zhou (5/2)
            Decision Trees & the Iterative Dichotomiser 3 (ID3) Algorithm by David Ramos (5/4)
            Data Mining by Hae-Sun Jung (5/4)
            Decision Trees by Michael Jovero (5/4)
            Decision Trees by Victor Chan (5/24) : Decision Tree Applet tutorial video
            Data Mining by Cecilia Prang (5/11)
            Data Mining and Neural Networks by Danny Leung (5/11)
            Data Mining by Jemini Islam (5/11)
            Data Mining by Andrie Suherman (5/11)
            Data Mining by Dave Maung (5/11)
            Data Mining by Keng Teng Lao (5/11)
            Clustering Algorithms by Mu Yu Lu (5/11)
            Clustering Algorithms by Jia Huang (5/11)
 
 
Student Presentations Fall 2004
Ø      Ø      Nimi Berman - Object Relational Bridge
Ø      Ø      Arnold Messa - Indexing
Ø      Ø      Jenny M -  PHP and MySQL
Ø      Ø      Dashiel Fryer
Ø      Ø      Anthony Galano - MySQL
Ø      Ø      Sandeep Jangity - XML
Ø      Ø      Thomas: Security and Integrity
Ø      Ø      Wen- Lin Chang: SQL
Ø      Ø      Distributed Databases by Alex Genadinik
Ø      Ø      Object Oriented Database by Ashish Kaul
Ø        Object Oriented Database Management Systems by Wendy Wooters
Ø      Ø      Distributed Databases by Chien-Pin Hsu
Ø        Data Mining by Joseph Yu
Ø             Data Mining by Sze Ming (Leo) Tung
Ø             Data Mining by Rajnikant Modi
Ø        Query Processing by Melissa Jamili
Ø        Data Mining by Minh Osborne
Ø        Data Mining by Yanhua Xue
Ø        Distributed Databases and Twelve Objective by Yingying Wu
Ø        Database System Architectures by Wei Jiang
PREVIOUS RESOURCES
 
 
Student Presentations Fall 2003
Ø      Ø      Relational Models: A Historical Perspective
Ø      Ø      Chapter 5 SQL Data Manipulation
Ø      Ø      Chapter 6: SQL Data Definition
Ø      Ø      Query Examples in Relational Algebra and SQL
Ø      Ø      Chapter 7 Query by Example
Ø      Ø      Relational Algebra to SQL
Ø      Ø      Commercial RDBMs: Access and Oracle
Ø      Ø      Relational Calculus
Ø      Ø      Programmatic SQL
Ø      Ø      Security
Ø      Ø      Database Security
Ø      Ø      Locking and Deadlock
Ø      Ø      Query Processing
Ø      Ø      Transaction Management
Ø      Ø      Chapter 22 Distributed DBMSs
Ø      Ø      Chapter 22 Distributed DBMSs Concepts and Design
Ø      Ø      Introduction to Object DBMS
Ø      Ø      Security and Transaction
Ø      Ø      Object-Relational DBMSs
Ø      Ø      Database Security
Ø      Ø      Web Technology & DBMSs
Students¡¦ Presentation:
Ø      Ø                Concurrency Control
Ø      Ø                Data Mining Presentation
Ø      Ø                Decision Tree Presentation
Ø      Ø                Index and Hashing
Ø      Ø                B+ Tree
Ø      Ø                Transactions
Ø      Ø                Transcations 2
Ø      Ø                Query Processing
Ø      Ø                Query Optimization
Ø      Ø                Database System Architectures
 
Study Guide for Exams
Ø                Ø               Midterm 1 Friday 14, 2003 - Do not forget and come late!!!
Ø                Ø               Midterm 2 Friday March 7, 2003 - Do not forget and come late!!!
Ø                Ø               Midterm 3 Monday April 28, 2003 - Do not forget to be on time!!!
Ø                Ø               Final Friday May 16, 2003 ¡V Arm yourself, don¡¦t be late!!!
 
Some Useful Links:

 

Peter P. Chen. The entity-relationship model - Towards a unified view of data. ACM Transactions on Database Systems (TODS), 1(1):9-36, 1976.

 

E. F. Codd. A relational model of data for large shared databanks. Communications of the ACM, 13(6):377-387, 1970. (.pdf) (.html )

 

E. F. Codd. Data models in database management. In Proc. ACM SIGMOD/PODS Workshop on Data Abstraction, Databases, and Conceptual Modeling, pp. 112-114,  1980.

 

William Kent. A simple guide to five normal forms in relational database theory. Communications of the ACM, 26(2):120-25, 1983

 

J. Gray. The Transaction Concept, Virtues And Limitations. In Proc. 7th VLDB Conference, Cannes, France, pp. 144-154. 1981. (MS Word document)

 

Zloof Moshe. Query-by-Example: A data base language. IBM Systems Journal, 16(4):324-343, 1977.

Ø                Ø                B-trees
Ø                Ø                B-trees and 2-3-trees
Ø                Ø                B+-trees
Ø                Ø                Hash Tables
Ø                Ø                Databases Glossary
Ø                Ø                Normal form(1,2,3,4)
Ø                Ø                Understanding Normalization
Ø                Ø                Anomalies
Ø                Ø                Normalization---formal definition
Ø                Ø                Decomposition, Normalization
Ø                Ø                Designing Relational databases
Ø                Ø                Database theory
Ø                Ø                BCNF decomposition (interesting algorithm for decomposition)
Ø                Ø                How to use pro*C and writing program for database
Ø                Ø                Functional Dependency
Ø                Ø                Normalization
Ø                Ø                Relation Design
Ø                Ø                Database Design
Ø                Ø                Lossless Join Decomposition
Ø                Ø                Relation Decomposition
Ø                Ø                 Data Mining White Paper
Ø                Ø                 Data warehouse
Ø                Ø                 Data mining3
Ø                Ø                 Introduction to Machine Learning and Data Mining
Ø                Ø                Advanced Topics in Database Management System ¡V High Dimensional Indexing, Similarity Join and Data Mining
Ø                Ø                Principles of Knowledge Discovery in Database
Ø                Ø                B+ tree definition, animation program link
Ø                Ø                Oracle Call Interface Programmer's Guide + SQL (CS157A)
 
Posted on September 30, 2003
Ø                Ø                Database Design
Ø                Ø                Transform EER to relational Model
Ø                Ø                Functional Dependencies and Normalization  

Ø               Ø               Relation Algebra  

Ø               Ø               The relational model: operations

Ø               Ø               The SQL language

Ø               Ø               Simple SQL

Ø               Ø               Data Manipulation: Retrieval Operations

Ø               Ø               Advanced SQL

Ø               Ø               More Queries

Ø               Ø               Object-relational Data model

Ø               Ø               Data Warehouse

 

Ø               Ø               ER diagrams 4

http://www.cis.ohio-state.edu/~srini/670/Lectures/lec7.pdf

Ø               Ø               ER-diagrams 5

http://www.cis.ohio-state.edu/~srini/670/Lectures/lec8.pdf

Ø               Ø               ER and EER

http://www.cis.ohio-state.edu/~srini/670/Lectures/lec9.pdf

Ø               Ø               Relation Algebra and SQL

http://www.ug.cs.sunysb.edu/~kifer/cse532/slides/ch6.pdf

Ø               Ø               Relational Calculus

http://www.cs.unc.edu/Courses/comp130-s03/comp130-8.pdf

Ø               Ø               Relational Calculus and visual query language

http://www.ug.cs.sunysb.edu/~kifer/cse532/slides/ch7.pdf

Ø               Ø               Theory of functional dependencies

http://www.cis.ohio-state.edu/~srini/670/Lectures/lec12.pdf

Ø               Ø               Theory of normalization

http://www.cis.ohio-state.edu/~srini/670/Lectures/lec13.pdf

Ø               Ø               Normalization 2

http://www.cis.ohio-state.edu/~srini/670/Lectures/lec10.pdf

Ø               Ø               Normalization 3

http://www.cis.ohio-state.edu/~srini/670/Lectures/lec11.pdf

Ø               Ø               Database Design: Normalization

http://www.cs.unc.edu/Courses/comp130-s03/comp130-11.pdf

Ø               Ø               Relational Normalization

http://www.ug.cs.sunysb.edu/~kifer/cse532/slides/ch8.pdf

Ø               Ø               Distributed Databases

http://www.ug.cs.sunysb.edu/~kifer/cse532/slides/ch18.pdf

Ø               Ø               OLAP and data mining

http://www.ug.cs.sunysb.edu/~kifer/cse532/slides/ch19.pdf

Ø               Ø               Query Processing

http://www.ug.cs.sunysb.edu/~kifer/cse532/slides/ch13.pdf

Ø               Ø               An overview of Query Optimization

http://www.ug.cs.sunysb.edu/~kifer/cse532/slides/ch14.pdf

Ø               Ø               Transactions

http://www.cis.ohio-state.edu/~srini/670/Lectures/lec14.pdf

Ø               Ø               Transaction and Concurrency

http://www.cis.ohio-state.edu/~srini/670/Lectures/lec15.pdf

Ø               Ø               Object Database

http://www.ug.cs.sunysb.edu/~kifer/cse532/slides/ch16.pdf

Ø               Ø               XML and databases

http://www.ug.cs.sunysb.edu/~kifer/cse532/slides/ch17.pdf

Ø                Ø                Guidance note on Plagiarism---From School of Computer Science, The University of Birmingham
Ø                Ø                Talking in Code [ Interview with B. Stroustrup]
Ø                Ø                Beautiful Life
Ø                Ø                SARS