Edilmo Palencia
edilmo
AI & ML interests
None yet
Organizations
None yet
Agentic RL
-
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
Paper • 2508.13167 • Published • 129 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 236 -
PRewrite: Prompt Rewriting with Reinforcement Learning
Paper • 2401.08189 • Published -
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning
Paper • 2509.11543 • Published • 50
CoreML
Context Engineering
Agentic RL
-
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
Paper • 2508.13167 • Published • 129 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 236 -
PRewrite: Prompt Rewriting with Reinforcement Learning
Paper • 2401.08189 • Published -
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning
Paper • 2509.11543 • Published • 50