0% found this document useful (0 votes)

17 views12 pages

Big Data Engineer Roadmap

Uploaded by

Sushmita Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views12 pages

Big Data Engineer Roadmap

Uploaded by

Sushmita Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Big Data Engineer Roadmap

A Practical, Step-by-Step Guide to Becoming a Job-Ready Big Data Engineer

By Akber Shaikh 📺 YouTube 💼 LinkedIn

Important Advice
Many students feel excited when they first see a roadmap. But excitement alone doesn’t bring
results. What matters is consistency—showing up every day, even when you don’t feel
motivated.

1. Start Small, But Daily – Study for at least 1 hour every day. Even small progress
adds up faster than waiting for the “perfect time.”

2. Track Progress – Keep a notebook or use a simple tracker. Ticking off tasks
gives you a sense of achievement and keeps you motivated.

3. Learn by Doing – Don’t just read or watch videos. Practice by coding, building
small projects, and solving problems.

4. Stay Consistent, Not Perfect – Missing one day is okay, but never miss two days
in a row. Getting back on track quickly is the real secret.

5. Find a Learning Buddy or Community – Share your goals with a friend or online
group. Accountability makes it much harder to quit.

Remember: Motivation starts you, but discipline finishes the journey. Follow this roadmap with
consistency, and you’ll be amazed at how much you can achieve in just a few months.
Stage 1 – Basics of Programming and Data

🐍 Python – The Core Language of Data

Why learn: It’s the most widely used language for data processing, cleaning, automation, and analysis.

Learn these basics:

● Syntax & basics → variables, data types, loops, conditionals, functions

● Data structures → lists, dictionaries, sets, tuples

● File handling → read/write CSV, JSON, text files

● Libraries:

○ pandas → data cleaning & manipulation

○ numpy → numerical operations

○ datetime → date/time handling

● Error handling → try/except

● Basic OOP (optional)

● Scripting & automation → run Python scripts automatically

🔗 Best Resources:
● freeCodeCamp Python Course

● W3Schools Python Tutorial

● Kaggle Python Micro-Course

🗃️ SQL – Querying and Managing Data

Why learn: Used to store, retrieve, and manipulate data efficiently in databases.

Learn these basics:

● CRUD: SELECT, INSERT, UPDATE, DELETE

● Filtering & sorting: WHERE, LIKE, ORDER BY

● Aggregation: GROUP BY, COUNT, SUM, AVG

● Joins: INNER, LEFT, RIGHT

● Subqueries & CTEs

● Indexes for optimization

🔗 Best Resources:
● Kaggle SQL Micro-Course

● Mode SQL Tutorial

● freeCodeCamp SQL Full Course

💻 Linux – Command Line & System Basics

Why learn: Big Data tools primarily run on Linux-based systems.

Learn these basics:

● Navigate folders & files → cd, ls, pwd

● File management → touch, rm, mv, cp

● Permissions → chmod, chown

● Data commands → grep, awk, sed, sort

● Processes → ps, top, kill

● Basic shell scripting

● Edit files → nano, vim

● Software installation & environment variables

🔗 Best Resources:
● Traversy Media Linux Crash Course

● Linux Journey

● OverTheWire Bandit Practice

🔧 Git – Version Control & Collaboration

Why learn: To track, manage, and collaborate on code.

Learn these basics:

● git init, add, commit

● git status, git log

● Branching & merging

● git remote add/push/pull

● Undo mistakes → git reset, git revert

● Collaboration → pull requests, code reviews

🔗 Best Resources:
● freeCodeCamp Git & GitHub Full Course

● Atlassian Git Tutorial

● Git Documentation

🧩 Mini Project – Movie Data Analyzer

Goal: Combine Python, SQL, Linux, and Git.

Dataset: Movies dataset from Kaggle

Steps:

1. Load CSV using Python → clean and analyze data (top genres, ratings).

2. Store cleaned data in SQLite/PostgreSQL and run SQL queries.

3. Use Linux commands to automate runs.

4. Push code to GitHub.

Stage 2 – Real-World Data Flow
🧱 Databases (DBMS)
Concepts: Tables, rows, columns, primary/foreign keys, joins, indexes, constraints.

Resources:

● Gate Smashers DBMS Playlist

● GeeksforGeeks DBMS Notes

🏗️ Data Warehouses
Learn:

● Difference between DBMS & Data Warehouse

● Fact & Dimension tables

● Partitioning, clustering

● Tools: BigQuery, Redshift, Snowflake

Resources:

● BigQuery Basics – Google Cloud Skills Boost

● Snowflake Free Training

● AWS Redshift Tutorials

⚙️ ETL Pipelines (Extract, Transform, Load)

Learn:

● Extract: CSV, APIs, databases

● Transform: clean, normalize, standardize

● Load: store in databases or data warehouses

● Automation with Python scripts

Resources:
● Simplilearn ETL Basics

● YouTube – ETL Pipeline Project

⚡ Batch vs Real-Time Processing

● Batch = daily/weekly jobs

● Real-Time = dashboards, live notifications

Resources:

● Stream vs Batch Processing – DataCamp

● Google Cloud Pub/Sub Intro

🌐 Distributed Systems Basics

Learn: Consistency, Partitioning, Replication, CAP Theorem

Resources:

● System Design Basics – Tech Dummies

● ByteByteGo YouTube Channel

Stage 3 – Big Data Tools
🔥 Apache Spark
Learn:

● RDDs & DataFrames

● Transformations & Actions (map, filter, groupBy)

● Reading/writing CSV, Parquet

● Spark SQL, Spark Streaming

Resources:

● Databricks Free Spark Course

● freeCodeCamp Spark Crash Course

📡 Apache Kafka
Learn: Topics, partitions, producers/consumers, streaming data ingestion

Resources:

● Confluent Kafka Tutorials

● Kafka in 1 Hour – freeCodeCamp

🐘 Hadoop & Hive

Learn: HDFS basics, MapReduce, Hive queries, file formats (Parquet, Avro, ORC)

Resources:

● Simplilearn Hadoop Tutorial

● Hive Tutorial – TutorialsPoint

Stage 4 – Automate & Optimize Pipelines
🪶 Apache Airflow
Learn: DAGs, task scheduling, monitoring, error handling

Resources:

● [Link] Airflow Docs

● YouTube: Airflow Crash Course

🧱 Databricks
Learn: Workspaces, notebooks, cluster setup, job scheduling

Resource: Databricks Academy

🐳 Docker
Learn: Containers, Dockerfiles, local testing

Resources:

● freeCodeCamp Docker Full Course

● Docker Docs

🔁 CI/CD Basics
Learn: GitHub Actions / Jenkins / GitLab CI

Resource: GitHub Actions Docs

✅ Data Quality Tools

● Great Expectations → Data validation framework

● Soda → Optional, scalable data testing

Resources:
● Great Expectations Docs

● Soda Academy
Stage 5 – Cloud & Deployment
☁️ Cloud Platforms
Pick one: AWS / GCP / Azure

Recommended: AWS (the most in-demand)

Learn:

● Data storage → S3, Redshift, BigQuery

● Compute → EC2, Dataproc

● IAM roles, monitoring, and alerts

Resources:

● AWS Skill Builder

● Google Cloud Skills Boost

● Azure Fundamentals by Microsoft Learn

Stage 6 – Projects & Portfolio
💼 Project 1 – End-to-End E-Commerce ETL Pipeline
Build a pipeline for e-commerce data:

● Collect → Clean → Store → Automate → Report

● Tools: Python, SQL, Airflow, Docker, Databricks

⚙️ Project 2 – Real-Time Analytics System

Process streaming data for dashboards or recommendations

● Tools: Kafka, Spark Streaming, Python

📊 Project 3 – Big Data Warehouse with Dashboard

● Combine datasets, create a warehouse, and visualize KPIs

● Tools: Snowflake/BigQuery + Tableau/Power BI

✅ Tip: Document every project with a short blog or GitHub README — it increases your credibility for
interviews.
Make the most of the Roadmap
To fully understand how to use this roadmap effectively, you MUST watch my video:

👉 [Link]
In the video, I explain:

● Why I chose these specific skills

● What common mistakes do beginners make?

● How to actually succeed as a beginner

Share the Knowledge

If this guide helps, share it with friends who want to break into cybersecurity. One shared roadmap could
change someone’s career.

Roadmap To Become Data Engineer in 2024
No ratings yet
Roadmap To Become Data Engineer in 2024
8 pages
Data Engineer in 3 Months
No ratings yet
Data Engineer in 3 Months
2 pages
Data Engineering Bootcamp for All
No ratings yet
Data Engineering Bootcamp for All
12 pages
Data Engineering
0% (1)
Data Engineering
3 pages
Step by Step Guide For Data Engineering
No ratings yet
Step by Step Guide For Data Engineering
7 pages
Data Engineering Roadmap
No ratings yet
Data Engineering Roadmap
2 pages
Big Data - Road Map
No ratings yet
Big Data - Road Map
22 pages
Data Engineering Learning Path
No ratings yet
Data Engineering Learning Path
4 pages
Data Engineering Learning Path Guide
No ratings yet
Data Engineering Learning Path Guide
3 pages
Data Engineering Learning Pathways
No ratings yet
Data Engineering Learning Pathways
4 pages
Iran
No ratings yet
Iran
7 pages
Data Engineering Roadmap
No ratings yet
Data Engineering Roadmap
4 pages
Data Engineering Guide for Beginners
100% (1)
Data Engineering Guide for Beginners
4 pages
Data Engineering Roadmap 2023
No ratings yet
Data Engineering Roadmap 2023
1 page
Data Enguneer
No ratings yet
Data Enguneer
5 pages
Data Engineering Course Resources Guide
No ratings yet
Data Engineering Course Resources Guide
4 pages
Road Map 1741960074
No ratings yet
Road Map 1741960074
24 pages
Data Engineering Brochure
No ratings yet
Data Engineering Brochure
24 pages
Data Engineering 6 Months Plan
No ratings yet
Data Engineering 6 Months Plan
3 pages
Azure de and Fabric de Full Edited
No ratings yet
Azure de and Fabric de Full Edited
7 pages
Python Developer Curriculum Recommendations
No ratings yet
Python Developer Curriculum Recommendations
6 pages
Data Engineering Roadmap Guide
No ratings yet
Data Engineering Roadmap Guide
3 pages
Roadmap For Data Engineering
No ratings yet
Roadmap For Data Engineering
33 pages
Roadmap and Skills
No ratings yet
Roadmap and Skills
15 pages
Bigdata Engineering Syllabus
No ratings yet
Bigdata Engineering Syllabus
14 pages
Comprehensive Tech Learning Guide
No ratings yet
Comprehensive Tech Learning Guide
9 pages
Data Engineering - Syllabus
No ratings yet
Data Engineering - Syllabus
18 pages
Big Data Training in Chennai - Big Data Course in Chennai
No ratings yet
Big Data Training in Chennai - Big Data Course in Chennai
1 page
Complete Step-By-Step Roadmap To Learn Data Engineering in 2025
No ratings yet
Complete Step-By-Step Roadmap To Learn Data Engineering in 2025
13 pages
Data Engineer Curriculum
No ratings yet
Data Engineer Curriculum
19 pages
Data Engineering Roadmap
No ratings yet
Data Engineering Roadmap
10 pages
Data Scientist Roadmap
No ratings yet
Data Scientist Roadmap
14 pages
Data Engineering Roadmap Guide
No ratings yet
Data Engineering Roadmap Guide
4 pages
Data Engineering Roadmap For Freshers & Resources
No ratings yet
Data Engineering Roadmap For Freshers & Resources
6 pages
Data Enginner Roadmap
No ratings yet
Data Enginner Roadmap
5 pages
Cloud Data Engineering Program Overview
No ratings yet
Cloud Data Engineering Program Overview
5 pages
MCA - II Sem - Curriculum and Syllabus
No ratings yet
MCA - II Sem - Curriculum and Syllabus
15 pages
8-Month Data Science Roadmap Guide
No ratings yet
8-Month Data Science Roadmap Guide
25 pages
Data Engineer Roadmap
No ratings yet
Data Engineer Roadmap
2 pages
Aiml Roadmap
No ratings yet
Aiml Roadmap
15 pages
DE Python
No ratings yet
DE Python
11 pages
Data Analytics Engineering Roadmap
No ratings yet
Data Analytics Engineering Roadmap
2 pages
Data Engineer Toolkit in 2025 - Must Have Skills, Tools & Resources - by Vijay Gadhave - May, 2025 - Medium
No ratings yet
Data Engineer Toolkit in 2025 - Must Have Skills, Tools & Resources - by Vijay Gadhave - May, 2025 - Medium
15 pages
Data Engineer Roadmap
No ratings yet
Data Engineer Roadmap
2 pages
Data Engineer Preparation
No ratings yet
Data Engineer Preparation
5 pages
Big Data Course Outline for Students
No ratings yet
Big Data Course Outline for Students
2 pages
Complete Data Engineering Roadmap With Resources
No ratings yet
Complete Data Engineering Roadmap With Resources
16 pages
Data Engineering Learning Path
No ratings yet
Data Engineering Learning Path
2 pages
Data Analyst & Engineer 12-Week Course
No ratings yet
Data Analyst & Engineer 12-Week Course
4 pages
IIT Kharagpur Data Science PDF
No ratings yet
IIT Kharagpur Data Science PDF
22 pages
Data Engineering Roadmap
No ratings yet
Data Engineering Roadmap
3 pages
Data Engineers Instagram Story
No ratings yet
Data Engineers Instagram Story
8 pages
AI - ML - DS 1-Credit Program-Learning-Guide
No ratings yet
AI - ML - DS 1-Credit Program-Learning-Guide
7 pages
Data Engineering Curriculum Overview
No ratings yet
Data Engineering Curriculum Overview
33 pages
Data and ML Roadmap
No ratings yet
Data and ML Roadmap
4 pages
NPN 1 Credit Course Learning Guide V1
No ratings yet
NPN 1 Credit Course Learning Guide V1
7 pages
Data Science & AI Roadmap
No ratings yet
Data Science & AI Roadmap
14 pages
Open Source Software and Python Programming
No ratings yet
Open Source Software and Python Programming
11 pages
Data Analytics TOC
No ratings yet
Data Analytics TOC
6 pages
Paid RC Tones in Hinglish
No ratings yet
Paid RC Tones in Hinglish
2 pages
October Monthly Combo (Eng)
No ratings yet
October Monthly Combo (Eng)
220 pages
UCAS 2015 Acceptance Routes Data
No ratings yet
UCAS 2015 Acceptance Routes Data
2 pages
Bihar Property Registration Details
No ratings yet
Bihar Property Registration Details
1 page
RTL Assessment 2
No ratings yet
RTL Assessment 2
12 pages
Otrum PMS
No ratings yet
Otrum PMS
5 pages
The Wife of Bath's Tale Overview
No ratings yet
The Wife of Bath's Tale Overview
3 pages
Yoruba Ifa Divination Explored
No ratings yet
Yoruba Ifa Divination Explored
2 pages
CBT and Typing Test Results For Passed Candidates For Junior Clerk and Computer Operator Positions in BISE Kohat
No ratings yet
CBT and Typing Test Results For Passed Candidates For Junior Clerk and Computer Operator Positions in BISE Kohat
8 pages
Step-By-Step Guide To Essay Writing
No ratings yet
Step-By-Step Guide To Essay Writing
14 pages
David Godman, Papaji - Papaji - It Is So Simple - Nothing Ever Happened Volume 1 - Papaji Biography-David Godman, Papaji (2024)
No ratings yet
David Godman, Papaji - Papaji - It Is So Simple - Nothing Ever Happened Volume 1 - Papaji Biography-David Godman, Papaji (2024)
12 pages
Educ 213 Prelim Module
No ratings yet
Educ 213 Prelim Module
30 pages
Unit 4 - More Practice
No ratings yet
Unit 4 - More Practice
4 pages
Sacred Triduum Holy Thursday Morning Prayer
No ratings yet
Sacred Triduum Holy Thursday Morning Prayer
4 pages
Bài Tập Tag Question
No ratings yet
Bài Tập Tag Question
1 page
Chapter 2 2013 Phil 101-CHAPTER TWO LOGIC AND REASONING
No ratings yet
Chapter 2 2013 Phil 101-CHAPTER TWO LOGIC AND REASONING
5 pages
PhD Thesis Proofreading Services Guide
100% (2)
PhD Thesis Proofreading Services Guide
7 pages
Understanding Cheese: Types and History
No ratings yet
Understanding Cheese: Types and History
13 pages
Into One: Sentence
No ratings yet
Into One: Sentence
1 page
Summer Homework for Class VIII Students
No ratings yet
Summer Homework for Class VIII Students
3 pages
Computer Basics for Beginners
No ratings yet
Computer Basics for Beginners
55 pages
Mini Dua Book - Faith & Fiqh v2
No ratings yet
Mini Dua Book - Faith & Fiqh v2
15 pages
Geometric and Physical Meanings of Double Integrals
No ratings yet
Geometric and Physical Meanings of Double Integrals
32 pages
Nomor Kelompok
No ratings yet
Nomor Kelompok
3 pages
Group 8 Planning and Constructing of TOS
No ratings yet
Group 8 Planning and Constructing of TOS
36 pages
Google Map API
No ratings yet
Google Map API
41 pages
Updated Framework of ESP Program Evaluat
No ratings yet
Updated Framework of ESP Program Evaluat
13 pages
Guilford's Structure of Intellect Model
100% (1)
Guilford's Structure of Intellect Model
6 pages
Exploring Meditation and Spirituality
No ratings yet
Exploring Meditation and Spirituality
3 pages
Asking & Describing Daily Activity: Kelompok 5
No ratings yet
Asking & Describing Daily Activity: Kelompok 5
7 pages
Rizal Module Final by RGD Sept. 24 2020
No ratings yet
Rizal Module Final by RGD Sept. 24 2020
174 pages
The Persuaders Solomon Carter PDF Download
100% (1)
The Persuaders Solomon Carter PDF Download
31 pages
Ieee Paper Format Spring2022
No ratings yet
Ieee Paper Format Spring2022
2 pages
Gea 1
No ratings yet
Gea 1
1 page

Big Data Engineer Roadmap

Uploaded by

Big Data Engineer Roadmap

Uploaded by

Big Data Engineer Roadmap

A Practical, Step-by-Step Guide to Becoming a Job-Ready Big Data Engineer

By Akber Shaikh 📺 YouTube 💼 LinkedIn

🐍 Python – The Core Language of Data

Learn these basics:

●​ Syntax & basics → variables, data types, loops, conditionals, functions​

●​ Data structures → lists, dictionaries, sets, tuples​

●​ File handling → read/write CSV, JSON, text files​

○​ pandas → data cleaning & manipulation​

○​ numpy → numerical operations​

○​ datetime → date/time handling​

●​ Error handling → try/except​

●​ Basic OOP (optional)​

●​ Scripting & automation → run Python scripts automatically​

●​ W3Schools Python Tutorial​

●​ Kaggle Python Micro-Course​

🗃️ SQL – Querying and Managing Data

Learn these basics:

●​ CRUD: SELECT, INSERT, UPDATE, DELETE​

●​ Filtering & sorting: WHERE, LIKE, ORDER BY​

●​ Aggregation: GROUP BY, COUNT, SUM, AVG​

●​ Subqueries & CTEs​

●​ Indexes for optimization​

●​ Mode SQL Tutorial​

●​ freeCodeCamp SQL Full Course​

💻 Linux – Command Line & System Basics

Learn these basics:

●​ Navigate folders & files → cd, ls, pwd​

●​ File management → touch, rm, mv, cp​

●​ Permissions → chmod, chown​

●​ Data commands → grep, awk, sed, sort​

●​ Processes → ps, top, kill​

●​ Basic shell scripting​

●​ Edit files → nano, vim​

●​ Software installation & environment variables​

●​ OverTheWire Bandit Practice​

🔧 Git – Version Control & Collaboration

Learn these basics:

●​ git init, add, commit​

●​ git status, git log​

●​ Branching & merging​

●​ git remote add/push/pull​

●​ Undo mistakes → git reset, git revert​

●​ Collaboration → pull requests, code reviews​

●​ Atlassian Git Tutorial​

🧩 Mini Project – Movie Data Analyzer

Dataset: Movies dataset from Kaggle

2.​ Store cleaned data in SQLite/PostgreSQL and run SQL queries.​

3.​ Use Linux commands to automate runs.​

4.​ Push code to GitHub.​

●​ Gate Smashers DBMS Playlist​

●​ GeeksforGeeks DBMS Notes​

●​ Difference between DBMS & Data Warehouse​

●​ Fact & Dimension tables​

●​ Tools: BigQuery, Redshift, Snowflake​

●​ BigQuery Basics – Google Cloud Skills Boost​

●​ Snowflake Free Training​

●​ AWS Redshift Tutorials​

⚙️ ETL Pipelines (Extract, Transform, Load)

●​ Extract: CSV, APIs, databases​

●​ Transform: clean, normalize, standardize​

●​ Load: store in databases or data warehouses​

●​ Automation with Python scripts​

●​ YouTube – ETL Pipeline Project​

⚡ Batch vs Real-Time Processing

●​ Real-Time = dashboards, live notifications​

●​ Stream vs Batch Processing – DataCamp​

●​ Google Cloud Pub/Sub Intro​

🌐 Distributed Systems Basics

●​ System Design Basics – Tech Dummies​

●​ ByteByteGo YouTube Channel​

●​ RDDs & DataFrames​

●​ Transformations & Actions (map, filter, groupBy)​

●​ Reading/writing CSV, Parquet​

●​ Spark SQL, Spark Streaming​

●​ Databricks Free Spark Course​

● Syntax & basics → variables, data types, loops, conditionals, functions

● Data structures → lists, dictionaries, sets, tuples

● File handling → read/write CSV, JSON, text files

○ pandas → data cleaning & manipulation

○ numpy → numerical operations

○ datetime → date/time handling

● Error handling → try/except

● Basic OOP (optional)

● Scripting & automation → run Python scripts automatically

● W3Schools Python Tutorial

● Kaggle Python Micro-Course

● CRUD: SELECT, INSERT, UPDATE, DELETE

● Filtering & sorting: WHERE, LIKE, ORDER BY

● Aggregation: GROUP BY, COUNT, SUM, AVG

● Subqueries & CTEs

● Indexes for optimization

● Mode SQL Tutorial

● freeCodeCamp SQL Full Course

● Navigate folders & files → cd, ls, pwd

● File management → touch, rm, mv, cp

● Permissions → chmod, chown

● Data commands → grep, awk, sed, sort

● Processes → ps, top, kill

● Basic shell scripting

● Edit files → nano, vim

● Software installation & environment variables

● OverTheWire Bandit Practice

● git init, add, commit

● git status, git log

● Branching & merging

● git remote add/push/pull

● Undo mistakes → git reset, git revert

● Collaboration → pull requests, code reviews

● Atlassian Git Tutorial

2. Store cleaned data in SQLite/PostgreSQL and run SQL queries.

3. Use Linux commands to automate runs.

4. Push code to GitHub.

● Gate Smashers DBMS Playlist

● GeeksforGeeks DBMS Notes

● Difference between DBMS & Data Warehouse

● Fact & Dimension tables

● Tools: BigQuery, Redshift, Snowflake

● BigQuery Basics – Google Cloud Skills Boost

● Snowflake Free Training

● AWS Redshift Tutorials

● Extract: CSV, APIs, databases

● Transform: clean, normalize, standardize

● Load: store in databases or data warehouses

● Automation with Python scripts

● YouTube – ETL Pipeline Project

● Real-Time = dashboards, live notifications

● Stream vs Batch Processing – DataCamp

● Google Cloud Pub/Sub Intro

● System Design Basics – Tech Dummies

● ByteByteGo YouTube Channel

● RDDs & DataFrames

● Transformations & Actions (map, filter, groupBy)

● Reading/writing CSV, Parquet

● Spark SQL, Spark Streaming

● Databricks Free Spark Course

● freeCodeCamp Spark Crash Course

● Confluent Kafka Tutorials

● Kafka in 1 Hour – freeCodeCamp

● Simplilearn Hadoop Tutorial

● Hive Tutorial – TutorialsPoint

● [Link] Airflow Docs

● YouTube: Airflow Crash Course

● freeCodeCamp Docker Full Course

● Soda → Optional, scalable data testing

● Data storage → S3, Redshift, BigQuery

● Compute → EC2, Dataproc

● IAM roles, monitoring, and alerts

● AWS Skill Builder

● Google Cloud Skills Boost

● Azure Fundamentals by Microsoft Learn

● Collect → Clean → Store → Automate → Report

● Tools: Python, SQL, Airflow, Docker, Databricks

● Tools: Kafka, Spark Streaming, Python

● Tools: Snowflake/BigQuery + Tableau/Power BI

● Why I chose these specific skills