Information Integration - OLAP




CS157b

Chris Pollett

May 3, 2023

Outline

Information Integration

Use Cases of Information Integration

The Heterogeneity Problem

Three Information Integration Techniques

Below are three common techniques for integrating data source:

  1. Federated Databases. Here we have multiple independent data sources, but can use one source to call on the others to supply info.
  2. Warehousing. Copies of data from several sources are stored in a single database called a warehouse. This data has usually been processed to a common schema, filtered, and aggregated. The warehouse is usually updated periodically (say once a day at off peak times).
  3. Mediation. This is a software component that supports a virtual database, which the user may query. It stores no data itself, but instead, queries its sources and synthesizes from them a response.

Federated Database Systems

Example Federated Database Architecture

Data Warehouses

Example Data Warehouse Architecture

Warehouses Example

In-Class Exercise

Mediators

Example Mediator Mechanics

Mediator Example

Mediator Wrappers - Templates

Mediator Wrappers - Filters

Global as View Optimization

Online Analytical Processing

Example OLAP Application

Multi Dimensional View of OLAP data

Car dealer example of multidimensional data

Approaches to OLAP