What is OpenReward’s Architecture?
OpenReward provides infrastructure for hosting and running AI agent environments. The platform has two main components:The Two Components
Environments: Persistent Evaluation Servers
An environment is a long-running server that hosts your evaluation: What it provides:- Evaluation server that agents connect to
- Tasks for agents to solve
- Tools agents can call
- Session management for multiple agents
- Automatic scaling based concurrent sessions
- Isolated storage for datasets and artifacts
Sandboxes: Temporary Execution Containers
A sandbox is an isolated container for running code: What it provides:- Isolated execution environment
- Configurable resources (CPU, memory)
- Network isolation options
- Automatic cleanup after use
How They Work Together
Common Patterns
Pattern 1: Environment-Only For environments that don’t need code execution:Data Storage
Environment Storage
Each environment includes isolated cloud storage that is shared between the environment and sandboxes. Access:- Environment server mounts storage at:
/orwd_data/ - Sandboxes can choose the path to mount the storage at
- Data persists across restarts and sandbox runs

