“Live Online Segment Course for Data Engineering” has been added to your cart. View cart

Sale!

Data Engineering Technologies, Workflow Orchestration

Live Online Luigi Course for Data Engineering

Name: Live Online Luigi Course for Data Engineering
SKU: 3373
Availability: InStock

Original price was: ₹35,000.00.Current price is: ₹25,000.00.

Duration: 4 Weeks | Total Time: 40 Hours

Format: Live online sessions using Google meet or MS Teams with hands-on coding, mini-projects, and a capstone project by an industry expert.
Target Audience: College Students, Professionals in Finance, HR, Marketing, Operations, Analysts, and Entrepreneurs
Tools Required: Laptop with internet
Trainer: Industry professional with hands on expertise

Categories: Data Engineering Technologies, Workflow Orchestration

Description
Reviews (0)

Live Course Module: Luigi Course for Data Engineering

Total Duration: 40 Hours (4 Weeks)

Week 1: Introduction to Luigi & Core Concepts

Duration: 8 hours (4 sessions × 2 hrs)

Topics:

Introduction to Workflow Orchestration (2 hrs)
- What is workflow orchestration?
- Why Luigi? Comparison with Airflow and Prefect
- Overview of Luigi architecture: Scheduler, Worker, Task, Target
Luigi Installation & Environment Setup (2 hrs)
- Installing Luigi using Python environment
- Setting up Luigi central scheduler (luigid)
- Exploring Luigi web UI
Building Your First Luigi Task (2 hrs)
- Creating a simple Luigi pipeline
- Understanding requires(), output(), and run() methods
- Task dependencies and data flow
Mini Project + Q&A (2 hrs)
- Build a two-stage Luigi pipeline (data extraction + transformation)
- Run and monitor through the Luigi dashboard

Week 2: Intermediate Luigi – Task Dependencies & Data Pipelines

Duration: 10 hours (5 sessions × 2 hrs)

Topics:

Task Parameters and Dynamic Pipelines (2 hrs)
- Using luigi.Parameter for configurable workflows
- Dynamic dependencies and parameterized tasks
Input/Output Targets (2 hrs)
- FileSystemTarget, LocalTarget, S3Target, and HdfsTarget
- Managing inputs and outputs in pipelines
Error Handling & Task Re-Runs (2 hrs)
- Handling task failures
- Task retry policies
- Incremental pipeline execution
Integrating Luigi with Databases (2 hrs)
- Reading/Writing data from PostgreSQL, MySQL, and SQLite
- Example: ETL task using SQL queries in Luigi
Mini Project + Q&A (2 hrs)
- Build a 3-step ETL pipeline using Luigi and SQL databases

Week 3: Advanced Luigi – Integration with Big Data Tools

Duration: 10 hours (5 sessions × 2 hrs)

Topics:

Luigi with Apache Spark (2 hrs)
- Submitting Spark jobs through Luigi
- Managing distributed data transformations
Luigi with Hadoop / HDFS (2 hrs)
- Using HdfsTarget for file I/O
- Integrating Luigi with existing Hadoop workflows
Luigi with Kafka for Streaming Data (2 hrs)
- Real-time ingestion pipeline design
- Triggering Luigi tasks for batch micro-batching from Kafka topics
Monitoring & Logging Pipelines (2 hrs)
- Luigi logging configuration
- Job status monitoring and troubleshooting
Mini Project + Q&A (2 hrs)
- Build a data pipeline integrating Spark + HDFS + Luigi

Week 4: Deployment, Scaling & Capstone Project

Duration: 12 hours (6 sessions × 2 hrs)

Topics:

Scheduling Luigi Workflows (2 hrs)
- Running Luigi pipelines on schedule
- Using CRON and Luigi daemon mode
Scaling Luigi in Production (2 hrs)
- Multi-worker setup
- Parallel task execution
- Optimizing performance and resource usage
Luigi in Cloud Environments (2 hrs)
- Running Luigi on AWS/GCP/Azure
- Luigi with S3/GCS for input-output data management
CI/CD for Luigi Pipelines (2 hrs)
- Version control with Git
- Automating Luigi deployments with Docker
Capstone Project Development (2 hrs)
- Build an end-to-end data pipeline using Luigi, Spark, and Cloud storage
Capstone Presentation & Review (2 hrs)
- Pipeline presentation and feedback
- Best practices and career use-cases

🧩 Capstone Project Example

Project Title: Automated Data Pipeline for Daily Sales Analytics
Goal: Ingest sales data from APIs → store in S3 → transform via Spark → load into a data warehouse → orchestrate with Luigi
Tech Stack: Luigi, Python, Spark, AWS S3, PostgreSQL

Reviews

There are no reviews yet.

Be the first to review “Live Online Luigi Course for Data Engineering”

Live Online Luigi Course for Data Engineering

Live Course Module: Luigi Course for Data Engineering

Week 1: Introduction to Luigi & Core Concepts

Topics:

Week 2: Intermediate Luigi – Task Dependencies & Data Pipelines

Topics:

Week 3: Advanced Luigi – Integration with Big Data Tools

Topics:

Week 4: Deployment, Scaling & Capstone Project

Topics:

🧩 Capstone Project Example

Reviews

Related products

Live Online Fivetran Course for Data Engineering

Live Online Apache Flink Course for Data Engineering

Live Online Kubernetes Course for Data Engineering