Skip to main content
POST
/
responses
LFM2.5-1.2B-Instruct: Responses
curl --request POST \
  --url https://api.zerogpu.ai/v1/responses \
  --header 'Content-Type: application/json' \
  --header 'x-api-key: <api-key>' \
  --header 'x-project-id: <api-key>' \
  --data '
{
  "input": "I just completed a 5K run in 28 minutes. Give me a short motivational follow-up message.",
  "model": "LFM2.5-1.2B-Instruct"
}
'
{}

Documentation Index

Fetch the complete documentation index at: https://docs.zerogpu.ai/llms.txt

Use this file to discover all available pages before exploring further.

Liquid AI’s LFM2.5-1.2B-Instruct is a hybrid architecture model purpose-built for on-device deployment, trained on 28 trillion tokens with multi-stage reinforcement learning. It delivers best-in-class instruction following and native tool-calling at the 1B scale: something very few models this size can do reliably. Its hybrid design gives it significantly faster inference with lower memory usage compared to transformer-only models of the same size. If you need a conversational model that can actually run on real devices without a GPU, this is it. Supports 8 languages including English, Chinese, Japanese, and Spanish.
References: Model docsTermsPrivacy

Specifications

PropertyValue
Model IDLFM2.5-1.2B-Instruct
TaskText Generation
Typelfm2.5
Parameters1.2B
Version2.5
Max Tokens800
ProviderLiquid AI
Input Price$0.20 / 1M
Output Price$0.40 / 1M
Total Price$0.60 / 1M

Try it

Send a live request with your x-api-key and x-project-id. Model is fixed to LFM2.5-1.2B-Instruct. Use request examples below to switch use cases (JSON extraction, NER, PII, and so on).

Authorizations

x-api-key
string
header
required
x-project-id
string
header
required

Body

application/json
input
string<textarea>
required

Multi-line text or document content to send to the model.

Required string length: 1 - 131072
instructions
string
metadata
object

Response

Success

The response is of type object.