SwingBench

community

https://github.com/menik1126/Swing-Bench/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

kirailol authored a paper 2 minutes ago

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

kirailol authored a paper 3 minutes ago

Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models

kirailol authored a paper 3 minutes ago

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

View all activity

authored a paper 2 minutes ago

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Paper • 2505.15929 • Published May 21, 2025 • 49

authored 3 papers 3 minutes ago

Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models

Paper • 2508.03332 • Published Aug 5, 2025

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Paper • 2509.07403 • Published Sep 9, 2025 • 35

PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models

Paper • 2509.16989 • Published Sep 21, 2025 • 1

authored a paper 4 minutes ago

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Paper • 2505.23932 • Published May 29, 2025

authored a paper 5 minutes ago

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 200

submitted a paper to Daily Papers 27 days ago

DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning

Paper • 2602.19895 • Published 28 days ago • 13

submitted a paper to Daily Papers about 2 months ago

V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

Paper • 2602.06034 • Published Feb 5 • 8

updated a dataset about 2 months ago

SwingBench/SwingBench

Viewer • Updated Feb 6 • 34.8k • 206

submitted a paper to Daily Papers about 2 months ago

OVD: On-policy Verbal Distillation

Paper • 2601.21968 • Published Jan 29 • 4

published a dataset about 2 months ago

SwingBench/SwingBench

Viewer • Updated Feb 6 • 34.8k • 206

authored 6 papers 2 months ago

ATTS: Asynchronous Test-Time Scaling via Conformal Prediction

Paper • 2509.15148 • Published Sep 18, 2025 • 1

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 98

Efficient Diffusion Models: A Survey

Paper • 2502.06805 • Published Feb 3, 2025

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Paper • 2505.23932 • Published May 29, 2025

SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Paper • 2503.12340 • Published Mar 16, 2025

MMFormalizer: Multimodal Autoformalization in the Wild

Paper • 2601.03017 • Published Jan 6 • 106

submitted a paper to Daily Papers 2 months ago

MMFormalizer: Multimodal Autoformalization in the Wild

Paper • 2601.03017 • Published Jan 6 • 106

authored 2 papers 5 months ago

ATTS: Asynchronous Test-Time Scaling via Conformal Prediction

Paper • 2509.15148 • Published Sep 18, 2025 • 1

TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models

Paper • 2310.10180 • Published Oct 16, 2023 • 1