Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
syddharth
's Collections
LLM
Audio
Video
Image
Vision
Vision
updated
Sep 23, 2024
Upvote
-
01-ai/Yi-VL-34B
Image-Text-to-Text
•
Updated
Jun 26, 2024
•
271
•
265
01-ai/Yi-VL-6B
Image-Text-to-Text
•
Updated
Jun 26, 2024
•
202
•
124
NousResearch/Nous-Hermes-2-Vision-Alpha
Text Generation
•
Updated
Dec 3, 2023
•
1.46k
•
305
liuhaotian/llava-v1.5-13b
Image-Text-to-Text
•
Updated
May 9, 2024
•
40.7k
•
528
fancyfeast/joytag
Image Classification
•
Updated
Mar 9, 2024
•
652
•
115
internlm/internlm-xcomposer2-7b
Text Generation
•
Updated
Feb 27, 2024
•
12.4k
•
31
internlm/internlm-xcomposer2-4khd-7b
Visual Question Answering
•
Updated
Apr 18, 2024
•
1.02k
•
73
SmilingWolf/wd-vit-large-tagger-v3
0.3B
•
Updated
Jul 26, 2024
•
808
•
92
Aryn/deformable-detr-DocLayNet
Object Detection
•
41.1M
•
Updated
Aug 8, 2025
•
12.2k
•
51
abetlen/Phi-3.5-vision-instruct-gguf
4B
•
Updated
Oct 1, 2024
•
1.3k
•
30
MiaoshouAI/Florence-2-base-PromptGen-v1.5
0.3B
•
Updated
Oct 9, 2024
•
1.11k
•
102
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
0.7B
•
Updated
Feb 4, 2025
•
149k
•
1.54k
Upvote
-
Share collection
View history
Collection guide
Browse collections