Rationale-aided Efficient 7B size Large Language and Vision Models. Let's enjoy it!
Byung-Kwan Lee
BK-Lee
AI & ML interests
Vision-Language Models
Recent Activity
upvoted a paper about 17 hours ago
AnyGroundBench: A Specialized-Domain Benchmark for Video Grounding in Vision-Language Models upvoted a paper 3 days ago
Evolution Fine-Tuning: Learning to Discover Across 371 Optimization Tasks upvoted a paper 7 days ago
LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation