zlm-v1-followup-questions-edge

zlm-v1-followup-questions-edge: Responses

curl --request POST \
  --url https://api.zerogpu.ai/v1/responses \
  --header 'Content-Type: application/json' \
  --header 'x-api-key: <api-key>' \
  --header 'x-project-id: <api-key>' \
  --data '
{
  "input": "Electric vehicles are gaining popularity worldwide as battery costs drop and charging infrastructure expands. Most modern EVs offer a range of 250 to 350 miles on a single charge, and fast-charging stations can replenish 80% of battery capacity in under 30 minutes. However, concerns around battery disposal, rare earth mineral sourcing, and grid capacity continue to shape the debate around large-scale EV adoption.",
  "model": "zlm-v1-followup-questions-edge"
}
'

{}

POST

responses

zlm-v1-followup-questions-edge: Responses

curl --request POST \
  --url https://api.zerogpu.ai/v1/responses \
  --header 'Content-Type: application/json' \
  --header 'x-api-key: <api-key>' \
  --header 'x-project-id: <api-key>' \
  --data '
{
  "input": "Electric vehicles are gaining popularity worldwide as battery costs drop and charging infrastructure expands. Most modern EVs offer a range of 250 to 350 miles on a single charge, and fast-charging stations can replenish 80% of battery capacity in under 30 minutes. However, concerns around battery disposal, rare earth mineral sourcing, and grid capacity continue to shape the debate around large-scale EV adoption.",
  "model": "zlm-v1-followup-questions-edge"
}
'

{}

ZeroGPU’s zlm-v1-followup-edge is a 60M-parameter model fine-tuned specifically to generate contextual follow-up responses that drive deeper user engagement. Built for edge deployment on the ONNX runtime, it’s optimized for the kind of low-latency, conversational interactions that keep users active, think smart nudges, re-engagement prompts, and contextual suggestions. Unlike general-purpose generators, this model is laser-focused on conversational depth, making every response feel intentional rather than generic. Purpose-built for the ZeroGPU distributed inference network.

References: Model docs • Terms • Privacy

Specifications

Property	Value
Model ID	`zlm-v1-followup-questions-edge`
Task	Text Generation
Type	`t5`
Parameters	120M
Version	1
Max Tokens	400
Provider	ZeroGPU
Input Price	$0.25 / 1M
Output Price	$0.40 / 1M
Total Price	$0.65 / 1M

Try it

Send a live request with your x-api-key and x-project-id. Model is fixed to zlm-v1-followup-questions-edge. Use request examples below to switch use cases (JSON extraction, NER, PII, and so on).

Authorizations

x-api-key

string

header

required

x-project-id

string

header

required

Body

application/json

input

string<textarea>

required

Multi-line text or document content to send to the model.

Required string length: 1 - 131072

instructions

string

metadata

object

Response

Success

The response is of type object.

t5-small zlm-v1-iab-classify-edge

Overview

Endpoints

SDK Examples

zlm-v1-followup-questions-edge

Specifications

Try it

Authorizations

Body

Response

Overview

Endpoints

SDK Examples

Documentation Index

​Specifications

​Try it

Authorizations

Body

Response

Specifications

Try it