Parallel and Distributed Databases




CS157b

Chris Pollett

Apr 27, 2020

Outline

Introduction

Models of Parallelism

Parallel Algorithms on Relations

Tuple-at-a-Time Operations in Parallel

Quiz

Which of the following statements is true?

  1. It is impossible to have a write-too-late situation when multi-version timestamping is used for concurrency control.
  2. A DBMS handling a transaction at a serializable isolation might need to make use of IX or IS locks.
  3. Serializable schedules are always recoverable.

Parallel Algorithms for Full-Relation Operations - Distinct Case

Parallel Algorithms for Full-Relation Operations - General Case

Cost of Parallel Algorithm for Full-Relations

Map-Reduce Parallelism Framework

The Basic Framework

Distinct Phases of a Map-Reduce Job

Example Map-Reduce Job for Counting

Example map reduce job for counting

Distributed Databases