Documentation Index
Fetch the complete documentation index at: https://docs.zerogpu.ai/llms.txt
Use this file to discover all available pages before exploring further.
ZeroGPU provides a REST API for running AI model inference. Send text inputs to models and receive structured responses.
Base URL
https://api.zerogpu.ai/v1
Authentication
All requests require two headers:
| Header | Description |
|---|
x-api-key | Your ZeroGPU API key (zgpu-...) |
x-project-id | Your project identifier (UUID) |
content-type | Must be application/json |
See Authentication for details.
Available endpoints
Responses
POST /v1/responses — Primary inference route; input as string or messages.
Chat completions
POST /v1/chat/completions — For models that use messages (chat-style).
Available models
| Model | Use case |
|---|
zlm-v1-summary-cloud | Text summarization |
zlm-v1-iab-classify-cloud | IAB content classification |
Select the model in your dashboard and pass its identifier in the model field of your request.
Error codes
| Status | Meaning |
|---|
200 | Success |
400 | Bad request — check your request body |
401 | Unauthorized — invalid or missing API key |
403 | Forbidden — invalid project ID or insufficient permissions |
420 | Input exceeds the model’s token limit — see Responses |
500 | Internal server error — retry with exponential backoff |
SDK examples
Integration examples are available for multiple languages: