Skip to main content
ZeroGPU provides a REST API for running AI model inference. Send text inputs to models and receive structured responses.

Base URL

https://api.zerogpu.ai/v1

Authentication

All requests require two headers:
HeaderDescription
x-api-keyYour ZeroGPU API key (zgpu-...)
x-project-idYour project identifier (UUID)
content-typeMust be application/json
See Authentication for details.

Available endpoints

Responses

POST /v1/responses — Send input to an AI model and receive a response.

Available models

ModelUse case
zlm-v1-summary-cloudText summarization
zlm-v1-iab-classify-cloudIAB content classification
Select the model in your dashboard and pass its identifier in the model field of your request.

Error codes

StatusMeaning
200Success
400Bad request — check your request body
401Unauthorized — invalid or missing API key
403Forbidden — invalid project ID or insufficient permissions
420Input exceeds the model’s token limit — see Responses
500Internal server error — retry with exponential backoff

SDK examples

Integration examples are available for multiple languages: