Model as a Service for Red Hat Demo Platform
No results found.
Model-as-a-Service for Red Hat Demo Platform — managed LLM access with API key management, usage tracking, rate limiting, and budget controls. OpenAI-compatible endpoint, zero infrastructure overhead.
Full component diagram — all LiteMaaS layers, PostgreSQL, Redis, model servers, GitOps flow, and sequence diagrams.
What LiteMaaS is, the problem it solves, key features, role hierarchy, and the end-to-end user flow.
Model capability types (Chat, Embeddings, Document Conversion), API endpoints, and curl/SDK examples.
Prerequisites, Ansible collection variables reference, step-by-step deployment, post-install checklist.
Upgrading, scaling, key cleanup cronjob, adding models, LITELLM_AUTO_SYNC, DB sync, admin operations.
Grafana dashboards, vLLM and GPU metrics, ServiceMonitors, LiteMaaS usage analytics, and key cleanup cronjob.
10 common issues with exact fix commands — model sync, OAuth, migrations, cache, OOMKills, timeouts.
GPU nodes (L40S, L4, T4), model placement, OpenShift AI, KServe InferenceServices, and model deployment.
rhpds/rhpds.litemaas on GitHub — Ansible roles, playbooks, templates and scripts for RHDP deployment.