GLiNER-X Collection The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type. β’ 6 items β’ Updated Jan 29 β’ 22
view article Article Taxonomy Completion with Embedding Quantization and an LLM-based Pipeline: A Case Study in Computational Linguistics dcarpintero β’ Jul 22, 2024 β’ 7
Emerging Properties in Unified Multimodal Pretraining Paper β’ 2505.14683 β’ Published May 20, 2025 β’ 136
view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito β’ May 12, 2025 β’ 614
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs davidberenstein1957 β’ May 7, 2025 β’ 42
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper β’ 2505.02567 β’ Published May 5, 2025 β’ 82
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper β’ 2504.20571 β’ Published Apr 29, 2025 β’ 99
UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities Paper β’ 2504.20734 β’ Published Apr 29, 2025 β’ 62
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? Kseniase β’ Mar 17, 2025 β’ 360
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse Paper β’ 2503.16365 β’ Published Mar 20, 2025 β’ 41
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper β’ 2503.11576 β’ Published Mar 14, 2025 β’ 163
view article Article Trace & Evaluate your Agent with Arize Phoenix +1 schavalii, jgilhuly16, m-ric β’ Feb 28, 2025 β’ 41
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google +1 merve, ariG23498, andsteing β’ Feb 19, 2025 β’ 75
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π manu β’ Jul 5, 2024 β’ 320
view article Article Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios pratikbhavsar β’ Feb 12, 2025 β’ 28
view article Article Open-source DeepResearch β Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier β’ Feb 4, 2025 β’ 1.32k