Lists (2)
Sort Name ascending (A-Z)
Stars
The official implementation of the EMNLP 2023 paper LLM-FP4
An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).
An Open-Ended Embodied Agent with Large Language Models
A library for mechanistic interpretability of GPT-style language models