Skip to content
View gaodayue's full-sized avatar

Block or report gaodayue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 3,984 228 Updated Nov 26, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 53,303 7,784 Updated Nov 30, 2024

A launch point for your personal nvim configuration

Lua 20,173 24,637 Updated Nov 20, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 100,176 7,989 Updated Nov 30, 2024

Use your Neovim like using Cursor AI IDE!

Lua 7,472 278 Updated Nov 29, 2024

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

1,762 111 Updated Nov 29, 2024

Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshift fleet. We provide query metadata for 200 provisioned and s…

45 1 Updated Sep 12, 2024

MLX: An array framework for Apple silicon

C++ 17,617 1,017 Updated Nov 29, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 11,986 1,510 Updated Aug 18, 2024

Open, Multi-modal Catalog for Data & AI

Java 2,461 397 Updated Nov 27, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,809 1,124 Updated May 23, 2024

Pseudonymization with Cryptography

C++ 16 1 Updated Sep 10, 2024

Curated list of project-based tutorials

205,785 26,851 Updated Aug 15, 2024

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

Rust 9,986 218 Updated Nov 29, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31,071 4,721 Updated Dec 1, 2024

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Java 1,048 132 Updated Nov 30, 2024

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Rust 4,853 338 Updated Nov 29, 2024

The repo for SOSP23 paper: FIFO queues are all you need for cache evictions

C 96 9 Updated Jun 13, 2024

ZeroMQ core engine in C++, implements ZMTP/3.1

C++ 9,796 2,364 Updated Nov 24, 2024

BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)

C++ 228 20 Updated May 7, 2024

Fast Static Symbol Table (FSST): efficient random-access string compression

C++ 394 38 Updated Aug 10, 2024

Static reflection for enums (to string, from string, iteration) for modern C++, work with any enum type without any macro or boilerplate code

C++ 4,990 445 Updated Nov 23, 2024

Self-Driving Database Management System from Carnegie Mellon University

C++ 1,745 503 Updated Nov 8, 2022

Apply a coding style with clang-format only to new code added to an existing code base.

Python 203 55 Updated Sep 2, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 95,548 15,514 Updated Nov 29, 2024

Stable Diffusion web UI

Python 143,670 27,028 Updated Nov 28, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,605 4,057 Updated Jul 17, 2024

Inference code for Llama models

Python 56,610 9,586 Updated Aug 18, 2024

LLM inference in C/C++

C++ 68,533 9,845 Updated Nov 30, 2024

eBPF-based Networking, Security, and Observability

Go 20,354 2,981 Updated Dec 1, 2024
Next