Skip to content
View LucasRGoes's full-sized avatar
  • Agriness Edge
  • Campinas, São Paulo

Organizations

@AgrinessEdgeIoT @thedatasociety

Block or report LucasRGoes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....

Jinja 74 16 Updated Feb 2, 2025

A high-performance observability data pipeline.

Rust 18,714 1,659 Updated Feb 8, 2025

This is the central repository for all the materials related to Apache Kafka For Absolute Beginners Course by Prashant Pandey.

Java 83 142 Updated Oct 1, 2020

A unified framework for machine learning with time series

Python 8,180 1,453 Updated Feb 7, 2025

KSP Autopilot for Final Rendezvous and Docking Operations

Python 2 Updated Dec 19, 2018

Free open public domain football datasets for national & international football club leagues & cups from around the world - home of the leagues.db download/release

41 11 Updated Jan 7, 2025

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,480 28,481 Updated Feb 8, 2025

Apache Spark Course Material

Scala 87 157 Updated Apr 21, 2023

High performance Kafka consumer for InfluxDB. Supports collectd message formats.

Python 215 53 Updated Dec 8, 2022

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Go 2,856 1,393 Updated Feb 5, 2025

[EOL] This is a place for various components in the Kubernetes ecosystem that aren't part of the Kubernetes core.

Go 2,454 1,682 Updated Apr 17, 2019

Spark library for easy MongoDB access

Scala 308 96 Updated Aug 30, 2016

Scala toolchain for InfluxDB

Scala 27 11 Updated Jul 29, 2024

Big Data Ecosystem Docker

VBA 402 319 Updated Apr 29, 2023

REST job server for Apache Spark

Scala 2,836 993 Updated Jan 4, 2025

A library that provides an embeddable, persistent key-value store for fast storage.

C++ 29,088 6,398 Updated Feb 8, 2025

Data-Centric Pipelines and Data Versioning

Go 6,201 569 Updated Feb 3, 2025

A k8s operator for InfluxDB

Go 76 32 Updated Apr 1, 2022

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊

Clojure 40,483 5,336 Updated Feb 8, 2025

Time Series Benchmark Suite, a tool for comparing and evaluating databases for time series data

Go 1,323 310 Updated Aug 6, 2024

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Python 18,072 2,410 Updated Feb 1, 2025

A list of useful resources to learn Data Engineering from scratch

3,640 521 Updated Jun 19, 2024

An Awesome List of Open-Source Data Engineering Projects

2,270 381 Updated Oct 4, 2024

Protocol Buffers - Google's data interchange format

C++ 66,549 15,607 Updated Feb 8, 2025

Apache Druid: a high performance real-time analytics database.

Java 13,604 3,725 Updated Feb 7, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,247 5,986 Updated Feb 8, 2025

Context aware, pluggable and customizable data protection and de-identification SDK for text and images

Python 4,108 595 Updated Feb 5, 2025

Open Source AI/ML Platform

Python 8,514 791 Updated Feb 7, 2025

The interactive graphing library for Python ✨ This project now includes Plotly Express!

Python 16,686 2,588 Updated Feb 7, 2025
Next