llama-3.1-8b-instruct-fast

curl --request POST \ --url https://api.zerogpu.ai/v1/responses \ --header 'Content-Type: application/json' \ --header 'x-api-key: <api-key>' \ --header 'x-project-id: <api-key>' \ --data ' { "input": "NASA announced that its Artemis III mission is now scheduled for late 2026, marking the first time astronauts will land on the lunar surface since Apollo 17 in 1972. The mission will send a crew of four to the Moon aboard the Orion spacecraft, with two astronauts descending to the south pole using SpaceX Starship as a lunar lander. Scientists are particularly excited about exploring permanently shadowed craters that may contain water ice, which could be critical for sustaining long-term human presence on the Moon.", "model": "llama-3.1-8b-instruct-fast" } '

Property	Value
Model ID	`llama-3.1-8b-instruct-fast`
Task	Summarization
Type	`llama-3.1`
Parameters	3B
Version	1
Max Tokens	131072
Provider	Hugging Face
Input Price	$0.05 / 1M
Output Price	$0.40 / 1M
Total Price	$0.45 / 1M

Property

Value

Model ID

llama-3.1-8b-instruct-fast

Task

Summarization

Type

llama-3.1

Parameters

Version

Max Tokens

131072

Provider

Hugging Face

Input Price

$0.05 / 1M

Output Price

$0.40 / 1M

Total Price

$0.45 / 1M

Try it

Send a live request with your x-api-key and x-project-id. Model is fixed to llama-3.1-8b-instruct-fast. Use request examples below to switch use cases (JSON extraction, NER, PII, and so on).

Authorizations

x-api-key

string

header

required

x-project-id

string

header

required

Body

application/json

model

string

default:llama-3.1-8b-instruct-fast

required

Model identifier (fixed for this playground). Use request examples to change use cases.

Allowed value: "llama-3.1-8b-instruct-fast"

Example:

"llama-3.1-8b-instruct-fast"

input

string<textarea>

required

Multi-line text or document content to send to the model.

Required string length: 1 - 131072

instructions

string

metadata

object

Response

Success

The response is of type object.

Documentation Index

​Specifications

​Try it

Authorizations

Body

Response

Specifications

Try it