Kubernetes controller that gates distributed training Jobs AND vLLM inference Deployments on an OSS preflight — DCGM, nccl-tests, nvbandwidth, perftest, fio, torch.distributed (training); KV-cache, model-load, sharding-lint, OpenAI-surface, tokenizer-parity (inference). No webhook, no CRDs, no node agents.