Skip to content
View holdenk's full-sized avatar

Sponsors

@clstaudt

Organizations

@sparklingpandas @high-performance-spark @scalingpythonml @PigsCanFlyLabs

Block or report holdenk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Encrypt files uploaded to a Django application.

Python 7 1 Updated Jun 19, 2022

Let's RAG it RAW without fancy frameworks

Jupyter Notebook 26 2 Updated Sep 15, 2024

A collection of learning resources for curious software engineers

Python 47,270 3,749 Updated Jan 31, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,172 498 Updated May 3, 2024

pyspark methods to enhance developer productivity 📣 👯 🎉

Python 659 99 Updated Dec 6, 2024

Apache Spark Connect Client for Golang

Go 190 37 Updated Jan 31, 2025

A Python Library to support running data quality rules while the spark job is running⚡

Python 171 44 Updated Jan 24, 2025

A tool to validate data, built around Apache Spark.

Scala 101 34 Updated Feb 1, 2025

8-bit CUDA functions for PyTorch, modified to build on Jetson Xavier

C 14 11 Updated Apr 26, 2023

LLM finetuned for medical question answering

Python 504 59 Updated Sep 7, 2023

English SDK for Apache Spark

Python 850 131 Updated Jun 12, 2024

Python Stream Processing

Python 1,624 68 Updated Jan 31, 2025

A modular implementation of timely dataflow in Rust

Rust 3,358 276 Updated Feb 4, 2025

State of the Art Natural Language Processing

Scala 3,912 719 Updated Feb 8, 2025

Your self-hosted, globally interconnected microblogging community

Ruby 47,682 7,088 Updated Feb 7, 2025

A POC for multilingual UDFs in KSQL

Shell 3 Updated Mar 16, 2019

TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.

Go 37,862 5,884 Updated Feb 8, 2025

Prototype implementation of Service-Level Fault Injection Testing in Python.

Python 69 2 Updated Nov 5, 2022

Replaces the factory firmware on the SwitchBot Plug Mini via OTA, enabling the use of Tasmota without disassembling the unit.

C 115 18 Updated Jul 21, 2024

A Label Printer Application

C 250 31 Updated Jan 23, 2025

lakeFS - Data version control for your data lake | Git for data

Go 4,538 367 Updated Feb 7, 2025
Scala 13 Updated Sep 20, 2023

Java imap nio client that is designed to scale well for thousands of connections per machine and reduce contention when using large number of threads and cpus.

Java 58 50 Updated Aug 23, 2023

Inofficial Qualcomm Firehose / Sahara / Streaming / Diag Tools :)

Python 1,758 406 Updated Jan 27, 2025

Reverse Engineering Furby Connect's Bluetooth Protocol and Update Format

JavaScript 485 84 Updated Jan 16, 2024

Open source version of Arrow Connect Platform developed by Arrow Electronics

Java 6 1 Updated Jan 12, 2023

A PowerDNS pipe dynamic backend to serve dnswall style A, AAAA and PTR DNS records for any given CIDR ranges.

Python 23 10 Updated Aug 5, 2024

Main repository for the Howlr application

JavaScript 47 15 Updated Feb 26, 2022
Kotlin 4 1 Updated Oct 29, 2020
Next