A unified framework for scalable computing
A Pythonic framework to simplify AI service building
Everything you need to build state-of-the-art foundation models
A high-performance ML model serving framework, offers dynamic batching
An MLOps framework to package, deploy, monitor and manage models
Neural Network Compression Framework for enhanced OpenVINO
Efficient few-shot learning with Sentence Transformers
FlashInfer: Kernel Library for LLM Serving
Easy-to-use deep learning framework with 3 key features
A library for accelerating Transformer models on NVIDIA GPUs
Unified Model Serving Framework
Superduper: Integrate AI models and machine learning workflows
Trainable models and NN optimization tools
A set of Docker images for training and serving models in TensorFlow
A lightweight vision library for performing large object detection
Powering Amazon custom machine learning chips
OpenMMLab Model Deployment Framework
Framework that is dedicated to making neural data processing
LLMFlows - Simple, Explicit and Transparent LLM Apps
A computer vision framework to create and deploy apps in minutes
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Implementation of "Tree of Thoughts
Implementation of model parallel autoregressive transformers on GPUs
Sequence-to-sequence framework, focused on Neural Machine Translation
OpenMMLab Video Perception Toolbox