Skip to content
View lenghuixing0330's full-sized avatar

Block or report lenghuixing0330

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

Python 393 34 Updated Jan 14, 2025

记录cv算法工程师的成长之路,分享计算机视觉和模型压缩部署技术栈笔记。https://harleyszhang.github.io/cv_note/

Python 2,475 384 Updated Dec 30, 2024

⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。

Python 6,324 1,144 Updated Jan 16, 2025

A self-learning tutorail for CUDA High Performance Programing.

JavaScript 328 36 Updated Dec 17, 2024

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 1,987 207 Updated Jan 17, 2025

这里是sonder的有点又没有太多用的笔记本 “一个人只有不停的写作,才不会被人海淹没” 你可以通过这个链接来访问网页版:https://space.keter.top

Shell 47 1 Updated Jan 15, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 13,056 1,464 Updated Jan 15, 2025

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python 384 44 Updated Jan 17, 2025

learning how CUDA works

Cuda 189 24 Updated Aug 16, 2024

Material for gpu-mode lectures

Jupyter Notebook 3,505 353 Updated Jan 6, 2025

👀 Apply YOLOv8 exported with ONNX or TensorRT(FP16, INT8) to the Real-time camera

Python 43 3 Updated May 23, 2024

YOLOv8 implementation using PyTorch

Python 129 23 Updated Aug 22, 2023

模型部署白皮书(CUDA|ONNX|TensorRT|C++)🚀🚀🚀

166 44 Updated Sep 18, 2024

欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。

Jupyter Notebook 284 33 Updated Jul 21, 2024

📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉

3,246 217 Updated Jan 16, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,954 5,214 Updated Jan 19, 2025

LLM Inference benchmark

Python 377 33 Updated Jul 23, 2024

[ICML2023] Revisiting Data-Free Knowledge Distillation with Poisoned Teachers

Python 23 1 Updated Jul 7, 2024

[AAAI-2022] Up to 100x Faster Data-free Knowledge Distillation

Python 67 11 Updated Oct 24, 2022

The repository of Expanding Small-Scale Datasets with Guided Imagination (NeurIPS 2023).

Jupyter Notebook 77 3 Updated Jan 15, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 42,216 4,531 Updated Jan 18, 2025

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Python 1,610 242 Updated Mar 28, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 11,956 1,733 Updated Jan 2, 2025

深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。

Python 413 58 Updated Jan 4, 2025

Vitis AI is Xilinx’s development stack for AI inference on Xilinx hardware platforms, including both edge devices and Alveo cards.

Python 1,526 640 Updated Sep 12, 2024

An opinionated list of awesome Python frameworks, libraries, software and resources.

Python 230,932 25,142 Updated Aug 11, 2024

Cherno C++课程个人笔记

C++ 129 32 Updated Feb 14, 2024

A high-performance, extensible Python AOT compiler.

C++ 417 40 Updated Sep 26, 2023

Tutorials based on the ASL (American Sign Language) dataset

Jupyter Notebook 8 8 Updated Apr 28, 2024
Next