[GuideLLM Refactor] mock server package creation by markurtz · Pull Request #357 · vllm-project/guidellm

markurtz · 2025-09-19T11:52:12Z

Summary

Introduces a comprehensive mock server implementation that simulates OpenAI and vLLM APIs with configurable timing characteristics and response patterns. The mock server enables realistic performance testing and validation of GuideLLM benchmarking workflows without requiring actual model deployments, supporting both streaming and non-streaming endpoints with proper token counting, latency simulation (TTFT/ITL), and error handling.

Details

Added mock_server package with modular architecture including configuration, handlers, models, server, and utilities
Implemented MockServerConfig with Pydantic settings for centralized configuration management supporting environment variables
Created HTTP request handlers for OpenAI-compatible endpoints:
- ChatCompletionsHandler for /v1/chat/completions with streaming support
- CompletionsHandler for /v1/completions legacy endpoint
- TokenizerHandler for vLLM-compatible /tokenize and /detokenize endpoints
Added comprehensive Pydantic models for request/response validation compatible with both OpenAI and vLLM API specifications
Implemented high-performance Sanic-based server with CORS support, middleware, and proper error handling
Created mock tokenizer and text generation utilities with deterministic token generation for reproducible testing
Added timing generators for realistic latency simulation including TTFT (Time To First Token) and ITL (Inter-Token Latency)
Included comprehensive test suite with integration tests using real HTTP server instances

Test Plan

Unit/integration style tests added to automation

Related Issues

Part of the larger scheduler refactor initiative

"I certify that all code in this PR is my own, except as noted below."

Use of AI

Includes AI-assisted code completion
Includes code generated by an AI application
Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes ## WRITTEN BY AI ##)

Copilot

Pull Request Overview

Introduces a comprehensive mock server implementation that simulates OpenAI and vLLM APIs with configurable timing characteristics and response patterns. This enables realistic performance testing and validation of GuideLLM benchmarking workflows without requiring actual model deployments.

Modular architecture with configuration, handlers, models, server, and utilities components
HTTP request handlers for OpenAI-compatible endpoints with streaming and non-streaming support
High-performance Sanic-based server with CORS support and proper error handling

Reviewed Changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
src/guidellm/mock_server/init.py	Package initialization exposing main MockServer and MockServerConfig classes
src/guidellm/mock_server/config.py	Pydantic-based configuration management with environment variable support
src/guidellm/mock_server/handlers/init.py	Handler module initialization exposing request handlers
src/guidellm/mock_server/handlers/chat_completions.py	OpenAI chat completions endpoint implementation with streaming support
src/guidellm/mock_server/handlers/completions.py	Legacy OpenAI completions endpoint with timing simulation
src/guidellm/mock_server/handlers/tokenizer.py	vLLM-compatible tokenization and detokenization endpoints
src/guidellm/mock_server/models.py	Pydantic models for request/response validation and API compatibility
src/guidellm/mock_server/server.py	Sanic-based HTTP server with middleware, routes, and error handling
src/guidellm/mock_server/utils.py	Mock tokenizer and text generation utilities for testing
tests/unit/mock_server/init.py	Test package initialization
tests/unit/mock_server/test_server.py	Comprehensive integration tests using real HTTP server instances

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

src/guidellm/mock_server/handlers/completions.py

src/guidellm/mock_server/handlers/chat_completions.py

tests/unit/mock_server/test_server.py

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

…nto features/refactor/base-draft [GuideLLM Refactor] mock server package creation #357

…factor/mock-server

markurtz requested review from AlonKellner-RedHat, DaltheCow, Copilot, jaredoconnell, markVaykhansky and sjmonson September 19, 2025 11:52

Copilot AI reviewed Sep 19, 2025

View reviewed changes

src/guidellm/mock_server/handlers/completions.py Outdated Show resolved Hide resolved

src/guidellm/mock_server/handlers/chat_completions.py Outdated Show resolved Hide resolved

tests/unit/mock_server/test_server.py Outdated Show resolved Hide resolved

markurtz force-pushed the features/refactor/benchmarker branch from 2515465 to 4834767 Compare September 19, 2025 12:20

markurtz force-pushed the features/refactor/mock-server branch from 841e82c to b1cce19 Compare September 19, 2025 12:22

markurtz added 2 commits September 19, 2025 12:31

Mock server implementation for guidellm

a28bbe3

fixes from copilot review

bb98193

Signed-off-by: Mark Kurtz <mark.j.kurtz@gmail.com>

markurtz force-pushed the features/refactor/mock-server branch from ca2be85 to bb98193 Compare September 19, 2025 12:31

sjmonson approved these changes Sep 23, 2025

View reviewed changes

sjmonson added a commit that referenced this pull request Sep 23, 2025

Merge remote-tracking branch 'origin/features/refactor/mock-server' i…

2521ea5

…nto features/refactor/base-draft [GuideLLM Refactor] mock server package creation #357

sjmonson added a commit that referenced this pull request Sep 25, 2025

[GuideLLM Refactor] mock server package creation \#357\n\nfeatures/re…

af81faa

…factor/mock-server

Base automatically changed from features/refactor/benchmarker to features/refactor/base September 29, 2025 14:19

markurtz merged commit da02ee8 into features/refactor/base Sep 29, 2025
11 of 17 checks passed

markurtz deleted the features/refactor/mock-server branch September 29, 2025 14:19

markurtz added this to the v0.4.0 milestone Oct 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GuideLLM Refactor] mock server package creation#357

[GuideLLM Refactor] mock server package creation#357
markurtz merged 2 commits intofeatures/refactor/basefrom
features/refactor/mock-server

markurtz commented Sep 19, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

markurtz commented Sep 19, 2025

Summary

Details

Test Plan

Related Issues

Use of AI

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants