-
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 230 -
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Paper • 2410.20672 • Published • 7 -
The Y-Combinator for LLMs: Solving Long-Context Rot with λ-Calculus
Paper • 2603.20105 • Published • 37
J C
dark-pen
AI & ML interests
None yet
Recent Activity
liked a model about 2 hours ago
McGill-NLP/A3-Qwen3.5-4B liked a dataset about 2 hours ago
McGill-NLP/A3-Synth upvoted a paper about 2 hours ago
Structured Distillation of Web Agent Capabilities Enables Generalization