[Rust Frontend] Support thinking_token_budget for chat and completions by ricky-chaoju · Pull Request #46137 · vllm-project/vllm

ricky-chaoju · 2026-06-19T08:07:26Z

Add support for the thinking_token_budget request parameter in the Rust frontend, for both /v1/chat/completions and /v1/completions, reaching parity with the Python frontend (tracked in #44280, "Request compatibility and validation"). Previously the Rust frontend parsed thinking_token_budget on the chat endpoint but explicitly rejected it ("thinking_token_budget is not supported."), and the completions endpoint did not expose it at all. The V1 engine has supported the parameter since #20859 and the Python frontend exposes it on both endpoints, so this was a pure frontend gap. Normalization mirrors Python's validate_thinking_token_budget (None/-1 → unlimited; other negatives rejected; no upper bound) and happens once during lowering, so chat, completions, and /inference/v1/generate behave consistently.

Signed-off-by: RickyChen / 陳昭儒 <ricky.chen@infinirc.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: dac87f740d

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

BugenZhao

LGTM

Signed-off-by: Bugen Zhao <i@bugenzhao.com>

vllm-project#46137) Co-authored-by: Bugen Zhao <i@bugenzhao.com> Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Signed-off-by: RickyChen / 陳昭儒 <ricky.chen@infinirc.com> Signed-off-by: Bugen Zhao <i@bugenzhao.com>

vllm-project#46137) Co-authored-by: Bugen Zhao <i@bugenzhao.com> Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Signed-off-by: RickyChen / 陳昭儒 <ricky.chen@infinirc.com> Signed-off-by: Bugen Zhao <i@bugenzhao.com> Signed-off-by: Qiang Li <qiang.li2@amd.com>

[Rust Frontend] Support thinking_token_budget for chat and completions

dac87f7

Signed-off-by: RickyChen / 陳昭儒 <ricky.chen@infinirc.com>

ricky-chaoju requested review from BugenZhao and njhill as code owners June 19, 2026 08:07

mergify Bot added the rust label Jun 19, 2026

chatgpt-codex-connector Bot reviewed Jun 19, 2026

View reviewed changes

Comment thread rust/src/text/src/lower.rs

BugenZhao added the ready ONLY add when PR is ready to merge/full CI is needed label Jun 22, 2026

BugenZhao approved these changes Jun 22, 2026

View reviewed changes

Comment thread rust/src/engine-core-client/src/protocol/mod.rs Outdated

BugenZhao and others added 2 commits June 22, 2026 15:04

Update rust/src/engine-core-client/src/protocol/mod.rs

f39593c

Signed-off-by: Bugen Zhao <i@bugenzhao.com>

Merge branch 'main' into feat/rust-thinking-token-budget

2427515

BugenZhao merged commit 80abe0d into vllm-project:main Jun 22, 2026
23 checks passed

ricky-chaoju deleted the feat/rust-thinking-token-budget branch June 22, 2026 09:12

BugenZhao mentioned this pull request Jun 22, 2026

[Rust Frontend] Correct --reasoning-parser semantics #46359

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Rust Frontend] Support thinking_token_budget for chat and completions#46137

[Rust Frontend] Support thinking_token_budget for chat and completions#46137
BugenZhao merged 3 commits into
vllm-project:mainfrom
ricky-chaoju:feat/rust-thinking-token-budget

ricky-chaoju commented Jun 19, 2026

chatgpt-codex-connector Bot left a comment

Uh oh!

BugenZhao left a comment

Uh oh!

Uh oh!

Labels

2 participants

Uh oh!

Uh oh!

Conversation

ricky-chaoju commented Jun 19, 2026

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

BugenZhao left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Labels

2 participants