Starred repositories
Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysis on Intel(R) Processor Graphics easily
Effing package management! Build packages for multiple platforms (deb, rpm, etc) with great ease and sanity.
⚡ Energy consumption metrology agent. Let "scaph" dive and bring back the metrics that will help you make your systems and applications more sustainable !
Funding rate arbitrage on cryptocurrency.
Source Code for 'Foundations of Libvirt Development' by W. David Ashley
Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also …
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch
A playbook for systematically maximizing the performance of deep learning models.
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
how to optimize some algorithm in cuda.
A collection of metrics to profile a single deep learning model or compare two different deep learning models
DeepLearning Framework Performance Profiling Toolkit
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
Open-source implementation of Google Vizier for hyper parameters tuning
A curated list of awesome open source tools and commercial products for autoML hyperparameter tuning 🚀
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
A library to analyze PyTorch traces.
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Awesome utilities for performance profiling