Packt

Ultimate AWS Data Engineering Bootcamp - 15 Real-World Labs

Packt

Ultimate AWS Data Engineering Bootcamp - 15 Real-World Labs

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Process and visualize real-time data using Kinesis, Spark Streaming, and Streamlit.

  • Automate workflow execution using ECS, Lambda, Step Functions, and GitHub Actions.

  • Build and manage lakehouses using Glue, S3, Athena, and Delta Lake architecture.

  • Design, deploy, and orchestrate AWS-native batch and real-time data pipelines.

Details to know

Shareable certificate

Add to your LinkedIn profile

Recently updated!

February 2026

Assessments

16 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 16 modules in this course

In this module, we will set the foundation for your journey through AWS data engineering. You'll gain clarity on the course structure, explore the tech stack—including Docker, AWS CLI, and more—and ensure your local environment is ready for executing the real-world labs. This introduction is critical to align expectations and configure the tools required for success.

What's included

3 videos1 reading

In this module, we will implement a batch data processing project for music streaming data. You'll learn to use Airflow for orchestration and Redshift Serverless for storage and querying, culminating in a full pipeline execution. The focus is on understanding the interaction between orchestration tools and AWS services.

What's included

9 videos1 assignment

In this module, we will process music stream data using a distributed system that combines PySpark and DynamoDB. You'll use Airflow to orchestrate the workflow and execute jobs using the AWS Glue Docker image locally. This project introduces scalable and parallel data processing techniques.

What's included

5 videos1 assignment

In this module, we will build a robust ETL pipeline for rental apartment data. You will set up MySQL in AWS Aurora, use Glue for data transformation, and orchestrate the workflow using Step Functions and EventBridge. This lab emphasizes automation and modular pipeline execution.

What's included

9 videos1 assignment

In this module, we will create a datalake for a rental vehicle store using scalable services like EMR and Athena. You'll execute PySpark both locally and on the cloud, integrate metadata using Glue crawlers, and automate the pipeline using Step Functions.

What's included

8 videos1 assignment

In this module, we will develop an event-driven data pipeline tailored for an e-commerce application. You'll containerize Python apps, deploy them using ECS, and automate workflows using Step Functions and EventBridge. This lab blends DevOps and data pipeline principles.

What's included

7 videos1 assignment

In this module, we will build a lakehouse architecture combining the flexibility of data lakes and the performance of data warehouses. You will use PySpark with Delta Lake, manage metadata with Glue Catalog, and query data through Athena and Redshift.

What's included

5 videos1 assignment

In this module, we will implement real-time processing of taxi trip data using a serverless approach. You'll set up Kinesis streams, deploy Lambda functions, and execute a complete pipeline. This lab reinforces serverless computing and event-driven design.

What's included

5 videos1 assignment

In this module, we will process mobile network logs using real-time technologies and deliver interactive insights via Streamlit. You'll build and deploy dashboards to ECS, leveraging Spark for streaming data and Glue Catalog for metadata management.

What's included

6 videos1 assignment

In this module, we will set up CI/CD pipelines to automate deployment of AWS Glue jobs, ECS tasks, and Lambda functions using GitHub Actions. You'll learn how to build and manage version-controlled workflows for repeatable deployments.

What's included

5 videos1 assignment

In this module, we will ingest real-time clickstream data using Kinesis Firehose and enrich it using Lambda before storing it in Redshift. You'll build a robust pipeline suitable for web analytics or behavioral tracking applications.

What's included

4 videos1 assignment

In this module, we will challenge you to independently set up a MySQL database on AWS Aurora. This assignment reinforces database fundamentals and AWS RDS deployment skills.

What's included

2 videos1 assignment

In this module, you will independently implement a lakehouse architecture for a commercial flights dataset. This assignment consolidates your understanding of data lakes, delta tables, and metadata integration with Glue.

What's included

4 videos1 assignment

In this module, you'll build a real-time system that dynamically adjusts pricing for e-commerce users based on events. This assignment emphasizes practical business applications of event-driven data processing.

What's included

2 videos1 assignment

In this module, you'll build a real-time streaming job to process Spotify metrics. This assignment helps you apply PySpark and AWS Glue in real-world streaming scenarios.

What's included

2 videos1 assignment

In this final module, you'll implement CI/CD automation for Lambda functions using GitHub Actions. This assignment solidifies your DevOps knowledge and prepares you for real-world deployment automation.

What's included

2 videos2 assignments

Instructor

Packt - Course Instructors
Packt
1,471 Courses 392,127 learners

Offered by

Packt

Why people choose Coursera for their career

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Coursera Plus

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Frequently asked questions