| --- |
| license: apache-2.0 |
| --- |
| # ποΈ GLaMM-RefSeg |
|
|
| --- |
| ## π Description |
| GLaMM-RegCap-VG is the model specific to referring expression segmentation. "RefSeg" denotes its focus on segmentation tasks related to referring expressions. |
|
|
|
|
| ## π» Download |
| To get started with GLaMM-RefSeg, follow these steps: |
| ``` |
| git lfs install |
| git clone https://huggingface.co/MBZUAI/GLaMM-RefSeg |
| ``` |
|
|
| ## π Additional Resources |
| - **Paper:** [ArXiv](https://arxiv.org/abs/2311.03356). |
| - **GitHub Repository:** For training and updates: [GitHub - GLaMM](https://github.com/mbzuai-oryx/groundingLMM). |
| - **Project Page:** For a detailed overview and insights into the project, visit our [Project Page - GLaMM](https://mbzuai-oryx.github.io/groundingLMM/). |
|
|
| ## π Citations and Acknowledgments |
|
|
| ```bibtex |
| @article{hanoona2023GLaMM, |
| title={GLaMM: Pixel Grounding Large Multimodal Model}, |
| author={Rasheed, Hanoona and Maaz, Muhammad and Shaji, Sahal and Shaker, Abdelrahman and Khan, Salman and Cholakkal, Hisham and Anwer, Rao M. and Xing, Eric and Yang, Ming-Hsuan and Khan, Fahad S.}, |
| journal={ArXiv 2311.03356}, |
| year={2023} |
| } |
| |
| |