Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

H-EmbodVis

university
https://github.com/H-EmbodVis
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

dkliang  authored a paper about 3 hours ago
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
dkliang  submitted a paper about 8 hours ago
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
HENGFANG  published a model 11 days ago
H-EmbodVis/PUMA
View all activity

Papers

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

View all Papers

Dingkang Liang's profile pictureXin Zhou's profile pictureCheng's profile pictureCheng Zhang's profile pictureXianjin-Wu's profile pictureHENG FANG's profile pictureEllery Kant's profile picture
H-EmbodVis 's Papers 5
Submitted by
Dingkang Liang
98

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

H-EmbodVis H-EmbodVis
15 1
Submitted by
Dingkang Liang
153

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

H-EmbodVis H-EmbodVis
220 4
Submitted by
Dingkang Liang
95

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

H-EmbodVis H-EmbodVis
328 5
Submitted by
Dingkang Liang
3

Towards Generalizable Robotic Manipulation in Dynamic Environments

H-EmbodVis H-EmbodVis
136 2
Submitted by
Dingkang Liang
7

Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution

H-EmbodVis H-EmbodVis
359 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs