Joins, Log-Structured Merge Trees, Enterprise Storage




CS257

Chris Pollett

Nov 2, 2020

Outline

Sort-based Join

Partitioning Relations by Hashing

Hashed based Duplicate Elimination

Hash-Join

Index-based Selection

Index-based Join

Quiz

Which of the following is true?

  1. It is impossible for a table to be in more than one tablespace.
  2. Sorting a table cannot be done if the table cannot fit in RAM.
  3. In the map reduce paradigm, a combiner runs the reduce job on the local machine for each sorted file, producing sorted reduced files which can then be sent to their respective machines.

Obtaining Estimates for Size Parameters

Incremental Computation of Statistics

Incremental Indexing

Building a Log-Structured Merge Tree

Enterprise Storage Subsystems and Business Continuity

Intermittent Failures

Checksums

Stable Storage

Implementing Stable Storage

Failure Model for Disks

Mirroring as a Redundancy Technique

Parity Blocks

RAID 4 and Failures

RAID 5