Skip to content
View hsgui's full-sized avatar

Block or report hsgui

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Google Research

Jupyter Notebook 35,022 8,020 Updated Mar 3, 2025

Extended pickling support for Python objects

Python 1,710 173 Updated Jan 14, 2025

Header-only C++/python library for fast approximate nearest neighbors

C++ 4,561 680 Updated Aug 11, 2024

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,654 28,526 Updated Mar 4, 2025

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Jupyter Notebook 671 71 Updated Oct 30, 2024

An autoregressive character-level language model for making more things

Python 2,881 756 Updated Jun 4, 2024

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 901 65 Updated Feb 7, 2025

Fully open reproduction of DeepSeek-R1

Python 22,001 1,967 Updated Mar 3, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,748 502 Updated Mar 3, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 11,277 1,130 Updated Mar 4, 2025

aider is AI pair programming in your terminal

Python 28,530 2,588 Updated Mar 3, 2025

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 8,030 4,304 Updated Mar 4, 2025

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 2,797 168 Updated Mar 4, 2025

Inspirational Mapping

Vue 2,430 62 Updated Sep 25, 2024

Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.

JavaScript 11,090 805 Updated Mar 1, 2025

Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Data

C++ 1,289 119 Updated Feb 27, 2025

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,405 2,253 Updated Feb 1, 2025

Knowhere is a vector search engine, integrating FAISS, HNSW, DiskANN.

C++ 205 85 Updated Mar 4, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,776 6,061 Updated Mar 4, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 26,876 5,485 Updated Feb 22, 2025

bpftune uses BPF to auto-tune Linux systems

C 1,551 85 Updated Feb 27, 2025

eBPF Observability - Distributed Tracing and Profiling

Go 3,143 350 Updated Mar 4, 2025

cuVS - a library for vector search and clustering on the GPU

Cuda 317 87 Updated Mar 3, 2025

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 32,888 3,052 Updated Mar 4, 2025

Development repository for the Triton language and compiler

MLIR 14,690 1,831 Updated Mar 4, 2025

Computational geometry and spatial indexing on the sphere

C++ 2,400 317 Updated Mar 1, 2025

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 10,774 1,230 Updated Feb 26, 2025

Fast C++ logging library.

C++ 25,443 4,693 Updated Feb 11, 2025
Next