ColBERT-Zero 🐶 Collection First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT • 10 items • Updated 9 days ago • 17
OpenDataArena/MMFineReason-SFT-123K-Qwen3-VL-235B-Thinking Viewer • Updated 25 days ago • 123k • 1.2k • 76
Kanade: A Simple Disentangled Tokenizer for Spoken Language Modeling Paper • 2602.00594 • Published 29 days ago • 1