Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Moe Reduce kernel
#4228 opened Dec 21, 2025 by grimoire Loading…
Refactor turbomind engine improvement
#4223 opened Dec 19, 2025 by lzhangzz Loading…
bump version to v0.11.1
#4221 opened Dec 18, 2025 by lvhan028 Loading…
Improve aborting all sessions improvement
#4215 opened Dec 16, 2025 by lvhan028 Loading…
Update benchmark serving script for proxy_server
#4173 opened Dec 1, 2025 by lvhan028 Loading…
Update installation.md
#4095 opened Nov 3, 2025 by krescent Loading…
Add step_map to track token decoding order in DLLM
#4057 opened Oct 21, 2025 by Auraithm Loading…
4 tasks done
[POC] Encoder Disaggregation
#4047 opened Oct 17, 2025 by CUHKSZzxy Draft
2 of 7 tasks
quant blocked fp8 enhancement New feature or request
#4018 opened Sep 29, 2025 by CUHKSZzxy Loading…
4 of 5 tasks
Add reasoning parser for GPT-OSS style channels.
#3998 opened Sep 21, 2025 by GY19A Loading…
[PD Disaggregation] remote recomputation preemption
#3854 opened Aug 18, 2025 by JimyMa Loading…
add ppu quick start doc documentation Improvements or additions to documentation
#3841 opened Aug 14, 2025 by guozixu2001 Loading…
support pp in turbomind
#3768 opened Jul 24, 2025 by irexyc Draft
1 task
fix: make project PEP 517 compliant.
#3738 opened Jul 17, 2025 by windreamer Loading…
5 tasks done
Add dp rank into proxy node status
#3720 opened Jul 8, 2025 by RunningLeon Loading…
[ascend] support lora enhancement New feature or request
#3715 opened Jul 7, 2025 by tangzhiyi11 Draft
expert distributions
#3709 opened Jul 4, 2025 by CUHKSZzxy Loading…
ProTip! Filter pull requests by the default branch with base:main.