A GPT-4V Level Multimodal LLM on Your Phone
chongyi
yuzaa
AI & ML interests
multimodal large language models
Recent Activity
updated a model about 11 hours ago
openbmb/MiniCPM-V-4.6 authored a paper 2 days ago
LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs? updated a collection 3 days ago
MiniCPM-V