google/timesfm-2.5-200m-transformers Time Series Forecasting • 0.2B • Updated 3 days ago • 581 • 10
knowledgator/gliclass-instruct-large-v1.0 Text Classification • 0.4B • Updated 6 days ago • 135 • 15
CoPE Collection CoPE is a drop-in enhancement of RoPE that delivers consistent gains within the training context and during long-context extrapoaltion. • 9 items • Updated 17 days ago • 2
CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs Paper • 2602.05258 • Published 18 days ago • 7
Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention Paper • 2602.03338 • Published 20 days ago • 26
Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published 26 days ago • 13
Shaping capabilities with token-level data filtering Paper • 2601.21571 • Published 25 days ago • 27
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published 24 days ago • 56
EEG Foundation Models: Progresses, Benchmarking, and Open Problems Paper • 2601.17883 • Published 29 days ago • 20
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models Paper • 2601.21639 • Published 25 days ago • 50
Self-Improving Pretraining: using post-trained models to pretrain better models Paper • 2601.21343 • Published 25 days ago • 17