Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix client-disconnect session leaks in PyTorch MP engine
#4655 opened Jun 6, 2026 by grimoire Collaborator Loading…
1 task
fix cancel stopped seq
#4654 opened Jun 5, 2026 by RunningLeon Collaborator Draft
Improve kernel dispatch for dp>1
#4653 opened Jun 5, 2026 by RunningLeon Collaborator Loading…
Fix qwen3.5 mtp
#4652 opened Jun 5, 2026 by RunningLeon Collaborator Loading…
[ci] add mtp test config in pr_test
#4651 opened Jun 5, 2026 by zhulinJulia24 Collaborator Loading…
dispatch to decoding internally even dp force to prefill
#4650 opened Jun 5, 2026 by RunningLeon Collaborator Loading…
Improve engine health monitoring and wakeup scheduling Bug:P0
#4645 opened Jun 4, 2026 by lvhan028 Collaborator Loading…
refactor: unify interleaved MRoPE rotary embedding
#4644 opened Jun 3, 2026 by CUHKSZzxy Collaborator Draft
Add multimodal preprocessing metrics
#4640 opened Jun 1, 2026 by CUHKSZzxy Collaborator Loading…
support disaggregated weight update planned feature
#4638 opened May 29, 2026 by irexyc Collaborator Loading…
update gated delta rule state layout improvement
#4636 opened May 28, 2026 by grimoire Collaborator Loading…
TEST: Improve tool test
#4632 opened May 28, 2026 by littlegy Contributor Loading…
[WIP] Interleave long-context prefill chunks with decode
#4631 opened May 28, 2026 by grimoire Collaborator Draft
1 task
modify save model in lite module improvement
#4624 opened May 26, 2026 by 43758726 Contributor Loading…
Refactor prefix caching improvement
#4618 opened May 24, 2026 by grimoire Collaborator Loading…
feat(turbomind): support priority schedule policy
#4614 opened May 22, 2026 by 4mengy Loading…
3 of 4 tasks
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605 opened May 21, 2026 by windreamer Collaborator Loading…
1 of 4 tasks
Intern s2 preview lite awq fix bug
#4600 opened May 19, 2026 by 43758726 Contributor Loading…
[WIP]: Support reuse routed experts on eviction
#4599 opened May 19, 2026 by RunningLeon Collaborator Loading…
update anthropic endpoint test
#4594 opened May 18, 2026 by littlegy Contributor Loading…
docs(advance): add Add a New Speculative Decoding Method guide documentation Improvements or additions to documentation
#4589 opened May 17, 2026 by SuperMarioYL Loading…
4 tasks done
refactor ascend multinode
#4588 opened May 15, 2026 by yao-fengchen Collaborator Draft
ProTip! Updated in the last three days: updated:>2026-06-04.