Skip to content
View aCayF's full-sized avatar

Block or report aCayF

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

C/C++ Performance Profiler

C++ 4,280 353 Updated Jan 31, 2025

An extensible framework that instruments python programs at runtime

Python 8 Updated Mar 26, 2021

The book "Performance Analysis and Tuning on Modern CPU"

TeX 3,114 209 Updated Feb 20, 2025

Kata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workl…

Rust 6,058 1,089 Updated Apr 30, 2025

Heat map generation tools

Perl 320 58 Updated Oct 6, 2021

Materials for learning SGLang

395 29 Updated Apr 25, 2025

Fast and memory-efficient exact attention

Python 17,170 1,653 Updated Apr 30, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 13,832 1,634 Updated Apr 30, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 46,241 7,171 Updated Apr 30, 2025

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 874 98 Updated Apr 30, 2025

NVIDIA driver packaging for RHEL

Shell 9 4 Updated Feb 24, 2025

Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysis on Intel(R) Processor Graphics easily

C++ 225 58 Updated Apr 4, 2025

Effing package management! Build packages for multiple platforms (deb, rpm, etc) with great ease and sanity.

Ruby 11,280 1,074 Updated Mar 6, 2025

⚡ Energy consumption metrology agent. Let "scaph" dive and bring back the metrics that will help you make your systems and applications more sustainable !

Rust 1,724 111 Updated Feb 11, 2025

Funding rate arbitrage on cryptocurrency.

Python 221 50 Updated Nov 5, 2023

Source Code for 'Foundations of Libvirt Development' by W. David Ashley

Python 4 4 Updated Jun 19, 2019

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also …

C++ 313 55 Updated Apr 25, 2025

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 803 181 Updated Apr 30, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 47,305 6,716 Updated Apr 20, 2025

Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch

Python 344 20 Updated Apr 30, 2025

A playbook for systematically maximizing the performance of deep learning models.

28,609 2,346 Updated Jun 18, 2024

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.

Python 3,928 274 Updated Apr 27, 2025

compiler learning resources collect.

Python 2,371 346 Updated Mar 19, 2025

how to optimize some algorithm in cuda.

Cuda 2,140 189 Updated Apr 30, 2025

A collection of metrics to profile a single deep learning model or compare two different deep learning models

Python 26 9 Updated Nov 7, 2023

DeepLearning Framework Performance Profiling Toolkit

Python 284 27 Updated Mar 28, 2022

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Python 1,040 126 Updated Apr 17, 2024

Open-source implementation of Google Vizier for hyper parameters tuning

Jupyter Notebook 1,556 257 Updated Nov 11, 2019
Next
Showing results