AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution
DR$^{3}$-Eval: Towards Realistic and Reproducible Deep Research Evaluation
models 0
None public yet
datasets 9
NJU-LINK/WebCompass
Viewer • Updated • 933 • 18.3k • 6
NJU-LINK/ViDiC-1K
Updated • 395 • 5
NJU-LINK/DR3-Eval
Viewer • Updated • 100 • 2.56k • 2
NJU-LINK/CodeTraceBench
Viewer • Updated • 4.32k • 3.12k • 2
NJU-LINK/OmniVideoBench
Viewer • Updated • 1k • 1.9k • 5
NJU-LINK/camerabench_binary
Viewer • Updated • 7.83k • 19
NJU-LINK/MT-Video-Bench
Updated • 70 • 4
NJU-LINK/T2AV-Compass
Viewer • Updated • 500 • 129 • 4
NJU-LINK/IF-VidCap
Viewer • Updated • 1.4k • 1.03k • 2