Back to Search
V

VIMA

Modelactive

Stanford/NVIDIA multi-modal agent using heterogeneous prompts for robotic manipulation.

Details

Updated:6/21/2026
open sourcetrue
release date2022-10-06
github urlhttps://github.com/vimalabs/VIMA
paper urlhttps://arxiv.org/abs/2210.03094
model familyVIMA

Tags

multimodal-promptingfew-shot-learningmanipulationstanfordnvidiaICML-2023object-manipulation

Relationships

Sources

https://vimalabs.github.io/
website
Visit
https://arxiv.org/abs/2210.03094
paper
Visit
https://github.com/vimalabs/VIMA
github
Visit

Related Knowledge Pages

No related knowledge pages.
VIMA | Model | EmbodiedHub