-
Notifications
You must be signed in to change notification settings - Fork 672
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add timeout to OpenAI client instantiations in eval and benchmark
#4450
opened Mar 23, 2026 by
badhra-ajaz
Loading…
2 tasks
fix(async_engine): make safe_run cancellation cleanup reliable with shield and SafeRunException
Bug:P0
#4439
opened Mar 20, 2026 by
lvhan028
Loading…
Split/tool call args json for qwen3coder tool calls (Qwen3.5)
#4433
opened Mar 19, 2026 by
lapy
Loading…
[Feature] Support n parameter in /v1/chat/completions and /v1/completions
#4419
opened Mar 17, 2026 by
ziyangliu-666
Loading…
Assign sequential api_server ports when proxy_url is unset
improvement
#4416
opened Mar 16, 2026 by
lvhan028
Loading…
[Fix][Feat] Fix worker sorting with external pg bundles & Support persistent buffer for update_params
#4397
opened Mar 6, 2026 by
CyCle1024
Loading…
Support MiniMax-M2 in TurboMind engine
enhancement
New feature or request
#4343
opened Feb 10, 2026 by
zh-nj
Loading…
add preliminary support for EP(single-node) of turbomind backend
enhancement
New feature or request
#4332
opened Feb 6, 2026 by
irexyc
Loading…
change ascend paged attention from BSH format to TND format for better performace
#4295
opened Jan 27, 2026 by
jinminxi104
•
Draft
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.