Skip to content

Added rampup to constant rate type#549

Merged
jaredoconnell merged 2 commits intovllm-project:mainfrom
jaredoconnell:feature/constant-rampup
Jan 23, 2026
Merged

Added rampup to constant rate type#549
jaredoconnell merged 2 commits intovllm-project:mainfrom
jaredoconnell:feature/constant-rampup

Conversation

@jaredoconnell
Copy link
Collaborator

Summary

Simply allows a linear rampup of the constant rate profile.

Test Plan

The simplest test is to run a short constant test with 4 requests per second, with a long rampup. You can see how it ramps as expected.
There are also new tests.

Related Issues

Fulfills part of the goals of #428


  • "I certify that all code in this PR is my own, except as noted below."

Use of AI

  • Includes AI-assisted code completion
  • Includes code generated by an AI application
  • Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes ## WRITTEN BY AI ##)

Generated-by: Cursor AI
Signed-off-by: Jared O'Connell <joconnel@redhat.com>
Signed-off-by: Jared O'Connell <joconnel@redhat.com>
@sjmonson sjmonson force-pushed the feature/constant-rampup branch from d895b07 to eb6a803 Compare January 23, 2026 18:10
Copy link
Collaborator

@sjmonson sjmonson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@jaredoconnell jaredoconnell merged commit f9f1e31 into vllm-project:main Jan 23, 2026
16 checks passed
@jaredoconnell jaredoconnell deleted the feature/constant-rampup branch January 23, 2026 18:27
openclaw-architect bot pushed a commit to tylertitsworth/skills that referenced this pull request Mar 3, 2026
…ring Configurations section

- --rampup applies only to constant rate-type benchmarks per v0.5.3 changelog
  and implementing PR (vllm-project/guidellm#549); remove incorrect claim
  that it works with throughput and concurrent profiles
- Restore 'Comparing Configurations' section deleted unintentionally in prior commit
- yaml output format confirmed in source (GenerativeBenchmarkerSerialized registers json+yaml)
- ghcr.io/vllm-project/guidellm container image confirmed in release.yml workflow
tylertitsworth pushed a commit to tylertitsworth/skills that referenced this pull request Mar 4, 2026
… flag (#416)

* feat(guidellm): v0.5.3 — multimodal benchmarking, mock server, rampup flag

- Add version header (0.5.3), update install to guidellm[recommended]
- Add container-based usage via ghcr.io/vllm-project/guidellm
- Add --rampup flag: linearly ramp up request rate before steady-state
- Add Multimodal Benchmarking section: audio transcription/translation
  and vision/image benchmarks via --request-type flag
- Add Mock Server section: guidellm mock-server for CI testing without
  a live GPU backend; configurable latency, workers, model
- Document v0.5.0 throughput mode breaking change (explicit --rate required)
- Add yaml to output formats

* fix(guidellm): scope --rampup to constant profile only; restore Comparing Configurations section

- --rampup applies only to constant rate-type benchmarks per v0.5.3 changelog
  and implementing PR (vllm-project/guidellm#549); remove incorrect claim
  that it works with throughput and concurrent profiles
- Restore 'Comparing Configurations' section deleted unintentionally in prior commit
- yaml output format confirmed in source (GenerativeBenchmarkerSerialized registers json+yaml)
- ghcr.io/vllm-project/guidellm container image confirmed in release.yml workflow

* fix(guidellm): confirm --outputs yaml and container image source; add registry link

---------

Co-authored-by: openclaw-architect[bot] <2819504+openclaw-architect[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants