view article Article 🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It? Kseniase • Mar 17, 2025 • 357
view article Article We now support VLMs in smolagents! +1 m-ric, merve, albertvillanova • Jan 24, 2025 • 113
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion Paper • 2412.04424 • Published Dec 5, 2024 • 62
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper • 2409.01704 • Published Sep 3, 2024 • 83
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model Paper • 2408.16767 • Published Aug 29, 2024 • 32
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models +1 andito, merve, SkalskiP • Jun 24, 2024 • 207
view article Article Diffusers welcomes Stable Diffusion 3 +4 dn6, YiYiXu, sayakpaul, OzzyGT, kashif, multimodalart • Jun 12, 2024 • 99