← Back to API Documentation Home

vllm-medgemma

vLLM always-on MedGemma 27B Text IT FP8 service with 32K context for medical text comprehension, clinical reasoning, and biomedical QA. Text-only (no vision). Shared Delta GPU.