Parallel and Distributed Computing Terminology

Parallel: Running multiple computations (often on the same computer) at the same time
Distributed: Running a calculation across multiple, networked computers
Often used together, interchangeably, especially in HPC

Instruction pipelining
- Completely transparent (invisible) parallelism
- Interleave steps, independent operations, "speculative"
Data parallelism: SIMD (Single Instruction, Multiple Data)
- Instructions operate on multiple values simultaneously
- vectorization: MMX, SSE, AVX
- Sometimes inferred by the compiler from loops
- Hand-written assembly, special functions, libraries

Distributed Computing

Homogeneous cluster of machines
Low-latency network allows fast communication
- Shared network filesystems
- Low-latency networks (10x slower than local memory, 100x faster than SSD)
Makes distributed computing similar to process-level parallelism