Prometheus Metrics
Every daemon exposes a/metrics endpoint:
Available Metrics
| Metric | Type | Description |
|---|---|---|
orpheus_queue_depth | Gauge | Pending requests per agent |
orpheus_queue_pending | Gauge | Requests waiting in queue |
orpheus_queue_processing | Gauge | Requests currently executing |
orpheus_queue_max_size | Gauge | Maximum queue capacity |
orpheus_pool_workers_total | Gauge | Total workers per agent |
orpheus_pool_workers_idle | Gauge | Idle workers available |
orpheus_pool_workers_busy | Gauge | Workers currently executing |
orpheus_pool_desired_size | Gauge | Target worker count |
orpheus_execlog_requests_total | Counter | Total requests processed |
orpheus_execlog_success_rate | Gauge | Success percentage |
orpheus_service_up | Gauge | Model server status |
orpheus_service_uptime_seconds | Gauge | Model server uptime |
Prometheus Scrape Config
Grafana Queries
Execution Logs
Every request is tracked in the exec log with full lifecycle:Request States
| State | Meaning |
|---|---|
QUEUED | Request received, waiting for worker |
STARTED | Worker picked up the request |
COMPLETED | Execution finished successfully |
FAILED | Handler returned an error |
CRASHED | Daemon died mid-execution |
Source Tracking
Exec logs distinguish between HTTP and MCP requests:- http = Blue text
- mcp = Magenta text
Debugging Agent Chains
When agent-router calls calculator-python via MCP:API Endpoints
Get Metrics
Query Exec Logs
Custom Labels
Add labels to metrics via agent.yaml:Autoscaling
Learn how queue-depth drives scaling →

