RTP-LLM

community

https://github.com/alibaba/rtp-llm

AI & ML interests

None defined yet.

Recent Activity

Met4physics submitted a paper about 3 hours ago

Rethinking Cross-Layer Information Routing in Diffusion Transformers

zykRichard submitted a paper 3 days ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

zykRichard authored a paper 5 days ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

View all activity

Papers

Rethinking Cross-Layer Information Routing in Diffusion Transformers

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

View all Papers

models 1

RTP-LLM/Qwen3-Coder-30B-A3B-Instruct-RTPurbo

31B • Updated Dec 29, 2025 • 1 • 2

datasets 0

None public yet