Starred repositories
Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
A high-performance observability data pipeline.
This is the central repository for all the materials related to Apache Kafka For Absolute Beginners Course by Prashant Pandey.
A unified framework for machine learning with time series
KSP Autopilot for Final Rendezvous and Docking Operations
Free open public domain football datasets for national & international football club leagues & cups from around the world - home of the leagues.db download/release
Apache Spark - A unified analytics engine for large-scale data processing
Apache Spark Course Material
High performance Kafka consumer for InfluxDB. Supports collectd message formats.
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
[EOL] This is a place for various components in the Kubernetes ecosystem that aren't part of the Kubernetes core.
REST job server for Apache Spark
A library that provides an embeddable, persistent key-value store for fast storage.
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
Time Series Benchmark Suite, a tool for comparing and evaluating databases for time series data
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
A list of useful resources to learn Data Engineering from scratch
An Awesome List of Open-Source Data Engineering Projects
Protocol Buffers - Google's data interchange format
Apache Druid: a high performance real-time analytics database.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
The interactive graphing library for Python ✨ This project now includes Plotly Express!