Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tencent
/
Youtu-VL-4B-Instruct
like
154
Follow
Tencent
9.85k
Image-Text-to-Text
Transformers
Safetensors
youtu_vl
text-generation
conversational
custom_code
arxiv:
2601.19798
arxiv:
2512.24618
License:
youtu-vl
Model card
Files
Files and versions
xet
Community
11
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (8)
Sort: Recently created
cascaded norm
#11 opened about 2 months ago by
J22
How to process multi-images?
1
#8 opened 2 months ago by
Yiyiyi
[Urgent Suggestion] Complete Deployment Guide: SDPA Patch (2x speed), 4-bit Fix for 8GB GPUs & Visual Examples
😎
🔥
1
2
#6 opened 2 months ago by
NodeLinker