Skip to main content
POST
/
responses
llama-3.1-8b-instruct-fast: Responses
curl --request POST \
  --url https://api.zerogpu.ai/v1/responses \
  --header 'Content-Type: application/json' \
  --header 'x-api-key: <api-key>' \
  --header 'x-project-id: <api-key>' \
  --data '
{
  "input": "NASA announced that its Artemis III mission is now scheduled for late 2026, marking the first time astronauts will land on the lunar surface since Apollo 17 in 1972. The mission will send a crew of four to the Moon aboard the Orion spacecraft, with two astronauts descending to the south pole using SpaceX Starship as a lunar lander. Scientists are particularly excited about exploring permanently shadowed craters that may contain water ice, which could be critical for sustaining long-term human presence on the Moon.",
  "model": "llama-3.1-8b-instruct-fast"
}
'
{}

Documentation Index

Fetch the complete documentation index at: https://docs.zerogpu.ai/llms.txt

Use this file to discover all available pages before exploring further.

The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models in 8B, 70B and 405B sizes (text in/text out). The Llama 3.1 instruction tuned text only models (8B, 70B, 405B) are optimized for multilingual dialogue use cases and outperform many of the available open source and closed chat models on common industry benchmarks.
References: Model docsTermsPrivacy

Specifications

PropertyValue
Model IDllama-3.1-8b-instruct-fast
TaskSummarization
Typellama-3.1
Parameters3B
Version1
Max Tokens131072
ProviderHugging Face
Input Price$0.05 / 1M
Output Price$0.40 / 1M
Total Price$0.45 / 1M

Try it

Send a live request with your x-api-key and x-project-id. Model is fixed to llama-3.1-8b-instruct-fast. Use request examples below to switch use cases (JSON extraction, NER, PII, and so on).

Authorizations

x-api-key
string
header
required
x-project-id
string
header
required

Body

application/json
model
string
default:llama-3.1-8b-instruct-fast
required

Model identifier (fixed for this playground). Use request examples to change use cases.

Allowed value: "llama-3.1-8b-instruct-fast"
Example:

"llama-3.1-8b-instruct-fast"

input
string<textarea>
required

Multi-line text or document content to send to the model.

Required string length: 1 - 131072
instructions
string
metadata
object

Response

Success

The response is of type object.