Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
OpenEvals
community
Activity Feed
Follow
161
AI & ML interests
LLM evaluation
Recent Activity
SaylorTwift
updated
a dataset
about 19 hours ago
OpenEvals/leaderboard-data
nielsr
submitted
a paper
about 19 hours ago
Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders
nielsr
submitted
a paper
4 days ago
V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning
View all activity
Team members
9
OpenEvals
's datasets
5
Sort: Recently updated
OpenEvals/leaderboard-data
Viewer
•
Updated
about 18 hours ago
•
93
•
301
•
1
OpenEvals/IMO-AnswerBench
Viewer
•
Updated
Jan 23
•
400
•
210
•
1
OpenEvals/MuSR
Viewer
•
Updated
Dec 12, 2025
•
756
•
33
OpenEvals/aime_24
Viewer
•
Updated
Dec 12, 2025
•
30
•
81
•
1
OpenEvals/SimpleQA
Viewer
•
Updated
Dec 12, 2025
•
4.33k
•
1.38k
•
4