arxiv:2605.20552
🤝 Open to Collab
Michal Valko
AI & ML interests
large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models
Recent Activity
liked a dataset 18 days ago
ulamai/verified-research-reasoning-trajectories authored a paper about 2 months ago
Spectral bandits for smooth graph functions with applications in recommender systems updated a dataset about 2 months ago
misovalko/my-research-papers