🔄 In a Training Loop

Joel Wang

joelhenwang

3 339 72

joelhenwang

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

Appleroll-Research/PromptForest-Llama-Prompt-Guard-2-86M

upvoted a paper 1 day ago

SkillRise: Agentic Reinforcement Learning for Cross-Task Skill Evolution

liked a model 2 days ago

XYZAILab/XYZ-Aquila-mini

View all activity

Organizations

liked a model 1 day ago

Appleroll-Research/PromptForest-Llama-Prompt-Guard-2-86M

0.3B • Updated Feb 14 • 17 • 1

upvoted a paper 1 day ago

SkillRise: Agentic Reinforcement Learning for Cross-Task Skill Evolution

Paper • 2607.26784 • Published 2 days ago • 24

liked 3 models 2 days ago

upvoted 9 papers 3 days ago

Multi-Head Latent Control: A Unified Interface for LLM Agent Decision Making

Paper • 2607.14277 • Published 16 days ago • 10

A Frozen 12B Beats Frontier Models on Verified Work: 100% Accuracy, 0 Tokens, Bit-Exact, Forever

Paper • 2607.23806 • Published 5 days ago • 7

Sol-Attn: Accelerating Video Generation Inference via On-the-Fly Attention Sparsification

Paper • 2607.24027 • Published 4 days ago • 34

Kimi K3: Open Frontier Intelligence

Paper • 2607.24653 • Published 4 days ago • 403

IDEAgent: Agentic Quality-Diversity Search for Research Idea Generation

Paper • 2607.22375 • Published 7 days ago • 9

Interactive Training 2: Auditable Control Plane for Live Model Training

Paper • 2607.18314 • Published 14 days ago • 20

Agentic Context Management: Solving Agent Memory and Cost by Treating Them as Lifecycle and Architecture Problems

Paper • 2607.21503 • Published 8 days ago • 24

Molt: A Scalable PyTorch-Native Training Framework for Agentic Reinforcement Learning

Paper • 2607.21653 • Published 9 days ago • 30

Skill Self-Play: Pushing the Frontier of LLM Capability with Co-Evolving Skills

Paper • 2607.22529 • Published 7 days ago • 47

upvoted 4 papers 4 days ago

Stepwise Reasoning Enhancement for LLMs via External Subgraph Generation

Paper • 2606.04454 • Published Jun 3 • 1

DeLIVeR: Decomposed Learning for Information-grounded Veracity Recognition via Reinforced Knowledge Graph Exploration

Paper • 2607.17935 • Published 11 days ago • 1

Efficient Retrieval-Augmented Generation via Token Co-occurrence Graphs

Paper • 2606.30093 • Published Jun 29 • 1

All Relations Lead to Rome: Automated Knowledge Graph Creation and Question Generation

Paper • 2606.22645 • Published Jun 21 • 1

liked 2 models 4 days ago

BAAI/bge-reranker-v2-m3

Text Classification • 0.6B • Updated Jun 24, 2024 • 19.2M • • 1.12k

Querit/Querit

Text Ranking • 64.5k • Updated Jun 18 • 173 • 2

Joel Wang

AI & ML interests

Recent Activity

Organizations

joelhenwang's activity