HRDC Reg. No: 10001547563
Duration: 5 days (35 hours)
Course Overview
This intensive hands-on course provides a comprehensive understanding of Apache Kafka and the Confluent platform. Participants will gain expertise in building scalable, real-time streaming architectures, managing Kafka clusters, creating custom producers/consumers, integrating Kafka with Spark, and leveraging Confluent tools like Kafka Connect and KSQL DB for streamlined data pipeline development.
Who Should Attend?
-
Data Engineers
-
Backend Developers
-
DevOps Engineers
-
Solution Architects
-
System Integrators
Targeted Industries
-
Financial Services & Banking
-
Telecommunications
-
E-Commerce & Retail
-
Media & Entertainment
-
Government & Smart Infrastructure
-
Logistics & Manufacturing
Why Choose This Course
HRDC Claimable – [Insert HRDC Claimable ID once registration number is available]
A practical guide to building enterprise-grade, real-time data platforms with Kafka and Confluent tools, optimized for mission-critical event-driven systems and data pipelines.
Learning Outcomes
By the end of this course, participants will be able to:
-
Deploy and manage multi-node Kafka clusters
-
Create custom Kafka producers and consumers using Java
-
Build and manage streaming pipelines using Spark and Kafka Streams
-
Secure Kafka with SSL, SASL, and ACLs
-
Use Kafka Connect for integration with external systems
-
Query and process real-time streams using KSQL DB
-
Monitor and tune Kafka for performance and reliability
Prerequisites
-
Knowledge of distributed computing
-
Basic understanding of Hadoop and Spark
-
Programming experience in Java or Python
-
Familiarity with Linux and command-line tools
-
Awareness of enterprise architecture concepts
Lab Setup
Each participant will receive a dedicated environment with:
-
3-node Kafka cluster (includes Zookeeper, Kafka, Spark, and connectors)
-
Hardware Requirements:
-
Processor: Intel i5 (8 cores)
-
RAM: 32 GB
-
Storage: 200 GB SSD (2,000 IOPS, 100 Mbps)
-
OS: Ubuntu 22.04
-
Software: IntelliJ, PyCharm, Docker, Java 8/11, Maven, Python 3.8+, Chrome
-
Access: Internet (GitHub, Google Drive), SSH, sudo access
-
Note: AWS setup, IP whitelisting, and proxy configuration as needed
Teaching Methodology
-
Instructor-led architecture deep dives
-
Hands-on coding and labs with real-time data
-
Project simulations and daily quizzes
-
Scenario-based exercises using Twitter or finance data streams