- New York
-
03:03
(UTC -05:00) - ryanzhangofficial.github.io
- in/ryan-zhang-44557727b
Stars
An extremely fast Python linter and code formatter, written in Rust.
An extremely fast Python package and project manager, written in Rust.
A computer algebra system written in pure Python
OpenCE (Open Context Engineering): A community toolkit to implement, evaluate, and combine LLM context strategies (RAG, ACE, Compression). Evolved from the `ACE-open` reproduction.
Measure and optimize the energy consumption of your AI applications!
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthrβ¦
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
βποΈ The minimal, blazing-fast, and infinitely customizable prompt for any shell!
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges
Code for paper "Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication"
π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A high-throughput and memory-efficient inference and serving engine for LLMs
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
The Organization for Transformative Works (OTW) - Archive Of Our Own (AO3) Project
SWE-bench: Can Language Models Resolve Real-world Github Issues?
The library for web and native user interfaces.
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs oβ¦
Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models
A curated list of awesome approaches to AI model routing
A Robotics Simulator for Autodesk Fusion CAD Designs
π€ smolagents: a barebones library for agents that think in code.


