Parallel and Distributed Databases




CS157b

Chris Pollett

Apr 25, 2018

Outline

Introduction

Models of Parallelism

Parallel Algorithms on Relations

Tuple-at-a-Time Operations in Parallel

In-Class Exercise

Parallel Algorithms for Full-Relation Operations - Distinct Case

Parallel Algorithms for Full-Relation Operations - General Case

Cost of Parallel Algorithm for Full-Relations

Map-Reduce Parallelism Framework

The Basic Framework

Distinct Phases of a Map-Reduce Job

Example Map-Reduce Job for Counting

Example map reduce job for counting

Distributed Databases