CHAPTER 04
Placement — workloads on the right silicon.
Heavy CNN/ViT inference, MoE expert layers, and agentic pipeline stages don't all need an H200. AnaROS routes each stage to the cheapest GPU class that holds its SLO — heterogeneous by design.
One inspection pipeline, four stages, four different GPU classes — and one governance plane on top.
INSPECTION PIPELINE · UNIT OF WORK
amsf:inspection-b6-amsf · production-team
STAGE 1
Image Ingest
L4PCIe
NIC + storage bound
host · anavec1-0
STAGE 2
Tile / Pre-process
L40SPCIe
normalize, augment, batch
host · anavec1-0
STAGE 3
Defect Inference
H200SXM
heavy CNN / ViT
host · anavec2-0
STAGE 4
Classify / Dispose
T4PCIe
small classifier, final decision
host · anavec3-0
AnaROS™ — Pipeline Governance Layer — SLOs · Isolation · Routing · Chargeback
PIPELINE-AWARE
WORKFLOW PLACEMENT JOURNEY
workflow: amsf:inspection-b6-amsf
01 · DISCOVER
Discover
5 containers declared · source workflow_intent.yaml
02 · RECONCILE
Reconcile
declared shape bound · strategy per_container
03 · PLACE
Place
stages routed to L4 · L40S · H200 · T4 · advisory plan ready
04 · DEPLOY
Deploy
deploy state pending · Terraform walkthrough on apply
05 · GOVERN
Govern
SLOs, isolation, chargeback wire after deploy
RESOURCE MAP · LIVE SNAPSHOT
advisory only — no commit until Apply
gpu-anavec1-0
cap 128 · used 16.9 (87% free)
8 models loaded
gpu-anavec2-0
cap 128 · used 21.1 (84% free)
2 models loaded
gpu-anavec3-0
cap 128 · used 4.7 (96% free)
2 models loaded
⚡ Apply this plan · Terraform Deploy walkthrough