nd2 LLM

GX10 Queued LLM Gateway

An authenticated API that queues long-running inference jobs against the GX10 box (vLLM), handles model switching, and returns results by polling or webhook callback.

Admin Portal Health

Full API reference ships with this service as API.md — drop it into an AI agent and it can drive the API end-to-end.