Stars
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
记录cv算法工程师的成长之路,分享计算机视觉和模型压缩部署技术栈笔记。https://harleyszhang.github.io/cv_note/
⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。
A self-learning tutorail for CUDA High Performance Programing.
📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
这里是sonder的有点又没有太多用的笔记本 “一个人只有不停的写作,才不会被人海淹没” 你可以通过这个链接来访问网页版:https://space.keter.top
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
👀 Apply YOLOv8 exported with ONNX or TensorRT(FP16, INT8) to the Real-time camera
模型部署白皮书(CUDA|ONNX|TensorRT|C++)🚀🚀🚀
欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
A high-throughput and memory-efficient inference and serving engine for LLMs
[ICML2023] Revisiting Data-Free Knowledge Distillation with Poisoned Teachers
[AAAI-2022] Up to 100x Faster Data-free Knowledge Distillation
The repository of Expanding Small-Scale Datasets with Guided Imagination (NeurIPS 2023).
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。
Vitis AI is Xilinx’s development stack for AI inference on Xilinx hardware platforms, including both edge devices and Alveo cards.
An opinionated list of awesome Python frameworks, libraries, software and resources.
A high-performance, extensible Python AOT compiler.
Tutorials based on the ASL (American Sign Language) dataset