Skip to content

[Don't Merge] Update cli args qwen#946

Draft
zhentaocc wants to merge 3 commits intoSemiAnalysisAI:mainfrom
zhentaocc:update_cli_args_qwen
Draft

[Don't Merge] Update cli args qwen#946
zhentaocc wants to merge 3 commits intoSemiAnalysisAI:mainfrom
zhentaocc:update_cli_args_qwen

Conversation

@zhentaocc
Copy link

No description provided.

Copy link
Contributor

@claude claude bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

@zhentaocc zhentaocc force-pushed the update_cli_args_qwen branch from 746b52e to 7992757 Compare March 25, 2026 19:52
Chen, Todd added 3 commits March 25, 2026 14:52
* Added CONTEXT_LENGTH and MAX_PREFILL_TOKENS variables for better configuration.
* Updated launch_server command with new options: --tokenizer-worker-num, --enable-aiter-allreduce-fusion, --cuda-graph-max-bs, --context-length, --disable-radix-cache, --max-prefill-tokens, and --scheduler-recv-interval.
… benchmark configurations for MI355X, enhancing performance with updated CLI arguments.
….yaml to v0.5.9, ensuring compatibility with recent changes.
@zhentaocc zhentaocc force-pushed the update_cli_args_qwen branch from 7992757 to a8cf15f Compare March 25, 2026 19:54
@zhentaocc zhentaocc marked this pull request as draft March 26, 2026 06:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

1 participant