Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Gengze Zhou's picture
2 11 2

Gengze Zhou

ZGZzz
luckybag's profile picture VLN-MME's profile picture zhaoc5's profile picture
·
https://gengzezhou.github.io/
  • GengzeZhou
  • GengzeZhou
  • gengze-zhou-159095203

AI & ML interests

Embodied Ai, Vision-and-Language Navigation, Computer vision, Multimodality Learning, LLM

Organizations

None yet

authored a paper 2 months ago

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published Dec 16, 2025 • 119
authored a paper 3 months ago

Rethinking Training Dynamics in Scale-wise Autoregressive Generation

Paper • 2512.06421 • Published Dec 6, 2025 • 7
authored a paper about 1 year ago

SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts

Paper • 2412.05552 • Published Dec 7, 2024 • 6
authored 4 papers over 1 year ago

WebVLN: Vision-and-Language Navigation on Websites

Paper • 2312.15820 • Published Dec 25, 2023

NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation

Paper • 2402.15852 • Published Feb 24, 2024

NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models

Paper • 2305.16986 • Published May 26, 2023

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Paper • 2407.12366 • Published Jul 17, 2024 • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs