Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
53
22
69
Ryan Marten
ryanmarten
Follow
jxm's profile picture
Fishtiks's profile picture
eliebak's profile picture
41 followers
·
97 following
https://ryanmarten.com
ryanmart3n
ryanmarten
ryan-marten
AI & ML interests
None yet
Recent Activity
new
activity
about 7 hours ago
harborframework/parity-experiments:
SpreadsheetBench adapter parity (claude-code + Haiku 4.5, 400 tasks × 3 trials)
new
activity
5 days ago
harborframework/terminal-bench-2.0:
Define 'harbor' as eval framework 🎉
updated
a dataset
6 days ago
harborframework/terminal-bench-2.0
View all activity
Organizations
ryanmarten
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
harborframework/parity-experiments
about 7 hours ago
SpreadsheetBench adapter parity (claude-code + Haiku 4.5, 400 tasks × 3 trials)
2
#106 opened about 7 hours ago by
ryanmarten
New activity in
harborframework/terminal-bench-2.0
5 days ago
Define 'harbor' as eval framework 🎉
#3 opened 6 days ago by
burtenshaw
updated
a dataset
6 days ago
harborframework/terminal-bench-2.0
Benchmark
•
Updated
5 days ago
•
179
•
2
New activity in
harborframework/terminal-bench-2.0
6 days ago
Add an eval yaml to integrate this benchmark into Community Evals.
#1 opened 7 days ago by
burtenshaw
published
a dataset
9 days ago
harborframework/terminal-bench-2.0
Benchmark
•
Updated
5 days ago
•
179
•
2
liked
a dataset
10 days ago
zai-org/terminal-bench-2-verified
Updated
4 days ago
•
5.63k
•
51
liked
a dataset
2 months ago
open-thoughts/OpenThoughts-Agent-v1-SFT
Viewer
•
Updated
26 days ago
•
15.2k
•
1.99k
•
79
updated
a Space
3 months ago
Running
README
🦀
liked
a dataset
4 months ago
jupyter-agent/jupyter-agent-dataset
Viewer
•
Updated
Sep 10, 2025
•
95.8k
•
10.6k
•
156
updated
2 datasets
6 months ago
ryanmarten/OpenThoughts-1k-sample
Viewer
•
Updated
Aug 31, 2025
•
2k
•
188k
open-thoughts/OpenThoughts-114k
Viewer
•
Updated
Aug 31, 2025
•
228k
•
67.2k
•
807
published
a dataset
6 months ago
ryanmarten/OpenThoughts-1k-sample
Viewer
•
Updated
Aug 31, 2025
•
2k
•
188k
liked
a dataset
6 months ago
SWE-bench/SWE-smith-trajectories
Viewer
•
Updated
Jul 19, 2025
•
76k
•
3.27k
•
47
liked
a Space
8 months ago
Running
6
OpenThoughts Benchmark Explorer
📊
6
Explore benchmark correlations and model performance
liked
a model
9 months ago
open-thoughts/OpenThinker3-7B
Text Generation
•
8B
•
Updated
Jun 9, 2025
•
4.76k
•
•
134
updated
2 collections
9 months ago
Reasoning Models
Collection
53 items
•
Updated
Jun 8, 2025
•
1
Reasoning Datasets
Collection
50 items
•
Updated
Jun 8, 2025
•
11
liked
a dataset
9 months ago
open-thoughts/OpenThoughts3-1.2M
Viewer
•
Updated
Jun 9, 2025
•
1.2M
•
8.56k
•
209
authored
a paper
9 months ago
OpenThoughts: Data Recipes for Reasoning Models
Paper
•
2506.04178
•
Published
Jun 4, 2025
•
52
updated
a collection
9 months ago
OpenThinker3
Collection
4 items
•
Updated
Jul 24, 2025
•
4
Load more