Running on CPU Upgrade 21 BigCodeBench Evaluator 🥇 21 Evaluate code samples using specified parameters
Running Agents 230 BigCodeBench Leaderboard 🥇 230 Explore code-generation model leaderboards and task details