Jin Chen
VanZieks
ยท
AI & ML interests
Large language model
Recent Activity
upvoted a paper about 1 month ago
BABE: Biology Arena BEnchmark upvoted a paper about 1 month ago
Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities upvoted a paper 6 months ago
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction Organizations
None yet