-
Notifications
You must be signed in to change notification settings - Fork 700
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix client-disconnect session leaks in PyTorch MP engine
#4655
opened Jun 6, 2026 by
grimoire
Collaborator
Loading…
1 task
dispatch to decoding internally even dp force to prefill
#4650
opened Jun 5, 2026 by
RunningLeon
Collaborator
Loading…
Fix unit test by removing latest-transformers-unsupported models
Bug:P1
#4649
opened Jun 5, 2026 by
lvhan028
Collaborator
Loading…
Improve engine health monitoring and wakeup scheduling
Bug:P0
#4645
opened Jun 4, 2026 by
lvhan028
Collaborator
Loading…
Extend v1/messages by introducing token-in/out and returning routed experts
improvement
#4642
opened Jun 1, 2026 by
lvhan028
Collaborator
Loading…
support disaggregated weight update
planned feature
#4638
opened May 29, 2026 by
irexyc
Collaborator
Loading…
update gated delta rule state layout
improvement
#4636
opened May 28, 2026 by
grimoire
Collaborator
Loading…
modify save model in lite module
improvement
#4624
opened May 26, 2026 by
43758726
Contributor
Loading…
feat(turbomind): support priority schedule policy
#4614
opened May 22, 2026 by
4mengy
Loading…
3 of 4 tasks
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605
opened May 21, 2026 by
windreamer
Collaborator
Loading…
1 of 4 tasks
[WIP]: Support reuse routed experts on eviction
#4599
opened May 19, 2026 by
RunningLeon
Collaborator
Loading…
docs(advance): add Add a New Speculative Decoding Method guide
documentation
Improvements or additions to documentation
#4589
opened May 17, 2026 by
SuperMarioYL
Loading…
4 tasks done
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-04.