Skip to content
View aurora327's full-sized avatar

Block or report aurora327

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,767 433 Updated Dec 4, 2024

Header-only C++/python library for fast approximate nearest neighbors

C++ 4,417 655 Updated Aug 11, 2024

Inference code for Llama models

Python 56,656 9,594 Updated Aug 18, 2024

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,141 211 Updated Oct 8, 2024