arxiv:2603.09079
Md Selim Sarowar
selim-sarowar
ยท
AI & ML interests
Vision Language Action Models, World Models, 5D Robot Manipulation, 3D Computer Vision
Recent Activity
authored
a paper
1 day ago
Explainable Parkinsons Disease Gait Recognition Using Multimodal RGB-D Fusion and Large Language Models authored
a paper
4 days ago
GST-VLA: Structured Gaussian Spatial Tokens for 3D Depth-Aware Vision-Language-Action Models Organizations
None yet