InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 672
Star 7.7k

Code
Issues 517
Pull requests 59
Discussions
Actions
Projects
Security 1
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: InternLM/lmdeploy

Labels 34 Milestones 0

New pull request New

59 Open 2,076 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add timeout to OpenAI client instantiations in eval and benchmark

#4450 opened Mar 23, 2026 by badhra-ajaz

Loading…

2 tasks

[ascend] fix prefix caching

#4448 opened Mar 23, 2026 by yao-fengchen • Draft

fix security issues

#4447 opened Mar 23, 2026 by CUHKSZzxy

Loading…

docs: add gitcgr code graph badge

#4446 opened Mar 22, 2026 by vitali87

Loading…

feat(turbomind): linear-attention prefix caching for Gated Delta Net

#4445 opened Mar 22, 2026 by lapy • Draft

fix(async_engine): make safe_run cancellation cleanup reliable with shield and SafeRunException Bug:P0

#4439 opened Mar 20, 2026 by lvhan028

Loading…

[WIP]: qwen35 mtp WIP

#4437 opened Mar 20, 2026 by RunningLeon • Draft

Split/tool call args json for qwen3coder tool calls (Qwen3.5)

#4433 opened Mar 19, 2026 by lapy

Loading…

update h config and add glm4.7 mtp test

#4424 opened Mar 18, 2026 by littlegy

Loading…

lmdeploy support kernel block size

#4421 opened Mar 17, 2026 by Tsundoku958

Loading…

[Feature] Support n parameter in /v1/chat/completions and /v1/completions

#4419 opened Mar 17, 2026 by ziyangliu-666

Loading…

Assign sequential api_server ports when proxy_url is unset improvement

#4416 opened Mar 16, 2026 by lvhan028

Loading…

[WIP] Support qwen3-omni

#4411 opened Mar 13, 2026 by CUHKSZzxy • Draft

2 of 4 tasks

fix metrics Bug:P1

#4410 opened Mar 13, 2026 by CUHKSZzxy

Loading…

[ci] add nightly docker build workflow

#4406 opened Mar 12, 2026 by zhulinJulia24

Loading…

Add model deployment best practice section in user guide

#4399 opened Mar 9, 2026 by lvhan028 • Draft

[Fix][Feat] Fix worker sorting with external pg bundles & Support persistent buffer for update_params

#4397 opened Mar 6, 2026 by CyCle1024

Loading…

[Ascend] support qwen3.5 27B

#4395 opened Mar 4, 2026 by wanfengcxz • Draft

add tool and reasoning test

#4388 opened Mar 2, 2026 by littlegy

Loading…

Fix Structured Output for GPT-OSS Models

#4386 opened Mar 2, 2026 by windreamer

Loading…

Improve proxy server improvement

#4354 opened Feb 12, 2026 by lvhan028

Loading…

Support MiniMax-M2 in TurboMind engine enhancement

New feature or request

#4343 opened Feb 10, 2026 by zh-nj

Loading…

[WIP]Support torch compile

#4336 opened Feb 8, 2026 by grimoire • Draft

add preliminary support for EP(single-node) of turbomind backend enhancement

New feature or request

#4332 opened Feb 6, 2026 by irexyc

Loading…

change ascend paged attention from BSH format to TND format for better performace

#4295 opened Jan 27, 2026 by jinminxi104 • Draft

Previous 1 2 3 Next

Previous Next

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!