Finish OLAP - Data Mining




CS157b

Chris Pollett

May 8, 2023

Outline

Introduction

Star Schemas

Illustration of a Star Schema

Quiz

Which of the following statements is true?

  1. It is possible for different distributed transactions in the same distributed database to use a different machine as the lock coordinator.
  2. A federated database is a software component that supports a virtual database, which the user may query. It stores no data itself, but instead, queries its sources and synthesizes from them a response.
  3. A raw data cube would contain aggregates of the data in all subsets of dimensions, as well as the data itself.

Slicing and Dicing Data

Data Cube Dicing Data Cube Slicing

Example of Slicing and Dicing

The Cube Operator

How a cube operator augments a data cube

SQL CUBE and ROLLUP operations

Mysql ROLLUP operations

Data Mining

Frequent Sets of Items

Viewers Bourne Harry Potter Movies

Association Rules

The Computation Model for Frequent Itemsets

Naive Algorithms for Frequent Item Sets

A Priori Algorithm

A Priori Example