Skip to main content

Section outline

    • What is Apache Ozone and how it differs from HDFS/S3

    • Ozone architecture: OM, SCM, containers, block storage

    • S3 API support and CLI/Web UI usage

    • Lab: Manage volumes, buckets, keys; configure fault tolerance

    • Monitoring: Recon, Prometheus, Grafana

    • Replication, Erasure Coding, Multi-tenancy, Disaster Recovery

    • Spark/Flink/Hive integration with Ozone

    • Lab: Spark jobs on Ozone, performance tuning, Teragen/TeraSort

    • Kerberos & Ranger-based RBAC

    • Encryption in transit/at rest, TLS setup

    • Kafka → Flink → Ozone streaming pipeline

    • Lab: Secure access control, end-to-end data lake project

    • Capstone: RDBMS → Kafka → Flink/Spark → Ozone pipeline

    • Q&A and certification quiz