A full request - here the llama-3.1-8b-instruct-fast model summarizing a passage:
curl https://api.zerogpu.ai/v1/responses \ -H "content-type: application/json" \ -H "x-api-key: $ZEROGPU_API_KEY" \ -H "x-project-id: $ZEROGPU_PROJECT_ID" \ -d '{ "model": "llama-3.1-8b-instruct-fast", "input": "NASA announced that its Artemis III mission is now scheduled for late 2026, marking the first time astronauts will land on the lunar surface since Apollo 17 in 1972. The mission will send a crew of four to the Moon aboard the Orion spacecraft, with two astronauts descending to the south pole using SpaceX Starship as a lunar lander. Scientists are particularly excited about exploring permanently shadowed craters that may contain water ice, which could be critical for sustaining long-term human presence on the Moon." }'
🎉 The model returns the summary:
NASA's Artemis III mission is now scheduled for late 2026, marking the first lunar landing since Apollo 17 in 1972, with a crew of four astronauts aboard the Orion spacecraft, including two descending to the Moon's south pole using SpaceX Starship as a lunar lander to explore permanently shadowed craters potentially containing water ice crucial for sustaining a long-term human presence on the Moon.