[Security] Reject non-finite temperature and repetition_penalty values by jperezdealgaba · Pull Request #45116 · vllm-project/vllm

jperezdealgaba · 2026-06-10T07:20:08Z

Summary

Add math.isfinite() validation for temperature and repetition_penalty in SamplingParams._verify_args().
NaN and Infinity bypass Python's comparison operators (<, >) due to IEEE 754 float semantics, allowing them to propagate to GPU sampling kernels where they cause undefined behavior or CUDA crashes.
Addresses advisory GHSA-7h4p-rffg-7823.

Test plan

Added tests/samplers/test_non_finite_params.py with 12 parametrized tests covering NaN, +Inf, -Inf rejection and valid value acceptance for both parameters.
pytest tests/samplers/test_non_finite_params.py -v — all 12 tests pass.
pre-commit run --files vllm/sampling_params.py tests/samplers/test_non_finite_params.py — all hooks pass.

Add math.isfinite() validation for temperature and repetition_penalty in SamplingParams._verify_args(). NaN and Infinity bypass comparison operators (< , >) in Python's IEEE 754 semantics, allowing them to propagate to GPU sampling kernels where they cause undefined behavior or CUDA crashes. Signed-off-by: Juan Perez de Algaba Sierra <jperezde@redhat.com> Signed-off-by: jperezde <jperezde@redhat.com>

hmellor

Would it be faster to use <= float('inf')? These checks are going to run a lot so we should try to use the fastest method

jperezdealgaba · 2026-06-10T20:46:03Z

@hmellor The problem using float is that <= float('inf') would still let temperature=Infinity through to the GPU kernels, which is the vulnerability I am trying to fix here.

That's the reason why. Do you think of a better solution for it? I don't really know it

hmellor · 2026-06-10T22:52:28Z

Oh yeah of course, I should have suggested < not <=.

Anyway, a micro benchmark suggests that comparison is only faster if we write inf to a variable for reuse (not what I originally suggested)

Approach	ns/call
`x < inf` (prebound)	~10
`math.isfinite(x)`	~15
`-inf < x < inf`	~25
`x < float('inf')` (inline)	~50

Let's stick with what you have

vllm-project#45116) Signed-off-by: jperezde <jperezde@redhat.com>

vllm-project#45116) Signed-off-by: jperezde <jperezde@redhat.com> Signed-off-by: divineearthly <divineearthly@gmail.com>

vllm-project#45116) Signed-off-by: jperezde <jperezde@redhat.com>

jperezdealgaba requested review from NickLucche and njhill as code owners June 10, 2026 07:20

hmellor reviewed Jun 10, 2026

View reviewed changes

hmellor approved these changes Jun 10, 2026

View reviewed changes

hmellor enabled auto-merge (squash) June 10, 2026 22:54

github-actions Bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jun 10, 2026

vllm-bot merged commit d598d23 into vllm-project:main Jun 11, 2026
63 of 65 checks passed

Saddss pushed a commit to Saddss/vllm that referenced this pull request Jun 14, 2026

[Security] Reject non-finite temperature and repetition_penalty values (

a40d49d

vllm-project#45116) Signed-off-by: jperezde <jperezde@redhat.com>

DRL-NextGen mentioned this pull request Jun 18, 2026

feat(cli): Add nexus validate benchmarks command IBM/algorithm-nexus#136

Merged

vivek8123 pushed a commit to odh-on-pz/vllm-upstream that referenced this pull request Jun 18, 2026

[Security] Reject non-finite temperature and repetition_penalty values (

a60f953

vllm-project#45116) Signed-off-by: jperezde <jperezde@redhat.com>

This was referenced Jun 19, 2026

Cp benchmark test pr 2 IBM/algorithm-nexus#142

Open

build(deps): update vLLM to 0.23.0 in candidate and 0.21.0 in product IBM/algorithm-nexus#143

Merged

This was referenced Jun 22, 2026

build(deps): update dependencies IBM/algorithm-nexus#146

Closed

build(hooks): update pre-commit hooks IBM/algorithm-nexus#147

Merged

tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Jun 22, 2026

[Security] Reject non-finite temperature and repetition_penalty values (

3542e7f

vllm-project#45116) Signed-off-by: jperezde <jperezde@redhat.com>

nixpkgs-security-tracker Bot mentioned this pull request Jun 23, 2026

vLLM: security issues < 0.23.1rc0 NixOS/nixpkgs#534486

Open

nkzhenhua pushed a commit to nkzhenhua/vllm that referenced this pull request Jun 24, 2026

[Security] Reject non-finite temperature and repetition_penalty values (

e14f56a

vllm-project#45116) Signed-off-by: jperezde <jperezde@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Security] Reject non-finite temperature and repetition_penalty values#45116

[Security] Reject non-finite temperature and repetition_penalty values#45116
vllm-bot merged 1 commit into
vllm-project:mainfrom
jperezdealgaba:fix/reject-non-finite-temperature

jperezdealgaba commented Jun 10, 2026

hmellor left a comment

jperezdealgaba commented Jun 10, 2026

hmellor commented Jun 10, 2026

Uh oh!

Labels

4 participants

Uh oh!

Uh oh!

Conversation

jperezdealgaba commented Jun 10, 2026

Summary

Test plan

hmellor left a comment

Choose a reason for hiding this comment

jperezdealgaba commented Jun 10, 2026

hmellor commented Jun 10, 2026

Uh oh!

Labels

4 participants