Back to Search
R

ROSA

Project

ROSA (Robotics Oriented Serving Architecture) is a serving system for Robotics Foundation Models (RFMs) designed for robot factories. It is built on three principles: shared GPU-pool serving where a fleet of robots accesses server-class GPUs over the network, a robotics-aware programming abstraction supporting multi-model pipelines with per-task SLOs and failure handling, and factory-objective-driven scheduling. Implemented on Ray Serve with vLLM, PyTorch, and JAX backends.

Details

Updated:7/3/2026
languagePython

Tags

RFM servingrobot factoryGPU poolNVIDIA ResearchvLLM

Relationships

No relationships found.

Sources

ROSA paper
paper
Visit

Appears In

No related landscapes.