Skip to main content

Section outline

    • Spark vs Hadoop, Lambda Architecture

    • RDD Operations, Caching, Checkpointing

    • Spark Internals: DAG, Partitions, Shuffling

    • Performance Tuning and Cluster Setup

    • Lab: RDD manipulation, caching, metrics analysis