Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning Paper • 2402.10110 • Published Feb 15, 2024 • 3
What Matters in Transformers? Not All Attention is Needed Paper • 2406.15786 • Published Jun 22, 2024 • 33
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts Paper • 2503.05066 • Published Mar 7, 2025 • 4
SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning Paper • 2504.10369 • Published Apr 14, 2025 • 2
CogniPair: From LLM Chatbots to Conscious AI Agents -- GNWT-Based Multi-Agent Digital Twins for Social Pairing -- Dating & Hiring Applications Paper • 2506.03543 • Published Jun 4, 2025 • 1
CoIn: Counting the Invisible Reasoning Tokens in Commercial Opaque LLM APIs Paper • 2505.13778 • Published May 19, 2025 • 5
Dense Video Understanding with Gated Residual Tokenization Paper • 2509.14199 • Published Sep 17, 2025 • 3
Understanding and Harnessing Sparsity in Unified Multimodal Models Paper • 2512.02351 • Published Dec 2, 2025 • 4
Making Large Language Models Efficient Dense Retrievers Paper • 2512.20612 • Published Dec 23, 2025 • 4
ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model Paper • 2603.22281 • Published 22 days ago • 17
Demystifying When Pruning Works via Representation Hierarchies Paper • 2603.24652 • Published 9 days ago • 19
Demystifying When Pruning Works via Representation Hierarchies Paper • 2603.24652 • Published 9 days ago • 19
Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers Paper • 2410.13184 • Published Oct 17, 2024 • 3
Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning Paper • 2310.11716 • Published Oct 18, 2023 • 6
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts Paper • 2310.09832 • Published Oct 15, 2023 • 1
Vega-MT: The JD Explore Academy Translation System for WMT22 Paper • 2209.09444 • Published Sep 20, 2022
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning Paper • 2402.00530 • Published Feb 1, 2024 • 2