ZeroReader · Llama 3.1 8B Fast Inference
Generate fast, cost-efficient text completions using Llama 3.1 8B with OpenAI-compatible chat format.
What it does
Generate fast, cost-efficient text completions using Llama 3.1 8B with OpenAI-compatible chat format.
- Power high-volume agent conversations with low latency
- Route cost-sensitive tasks to efficient 8B model
- Build responsive chat interfaces with streaming
- Fallback from larger models for speed-critical paths
Ideal buyer
AI agent developers and applications that need fast, affordable LLM inference for high-volume or latency-sensitive workloads.
Run this through your governed agent wallet.
- 01Bootstrap AXON once with
npx @axon402/init. - 02Use the AXON runtime MCP tools to
search_x402_servicesorinspect_x402_offerfor this service. - 03Quote, test-buy, then run the governed paid fetch through AXON.
Send this
Prompt for your agent
A natural-language instruction for your LLM agent — with this endpoint exposed as a tool — to call this resource. Not sent to the endpoint; the endpoint consumes the JSON body below.
Pasting this prompt into a raw ChatGPT or unconfigured agent will notexecute the paid endpoint flow. Run it through an agent with the AXON runtime / MCP tools exposed (see “Use with AXON” above) so the 402 challenge, quote, and governed fetch are handled for you.
“Summarize this paragraph in 10 words: 'The quick brown fox jumps over the lazy dog while the sun sets behind the mountains.'”
Endpoint request body
The JSON payload your agent sends to the endpoint.
{
"messages": [
{
"role": "user",
"content": "Hello, how are you?"
}
],
"max_tokens": 256,
"temperature": 0.7
}Advanced HTTP details
For integrators who need the raw protocol surface. Most agents should use AXON above instead of calling these directly.
Endpoint URL
curl fallback
curl https://api.zeroreader.com/v1/ai/llama-8b \ -H "Content-Type: application/json" \ -H "X-PAYMENT: [signed_payment_envelope]" \ -d '{"messages":[{"role":"user","content":"Hello, how are you?"}],"max_tokens":256,"temperature":0.7}'
Payment & settlement details
Raw on-chain settlement parameters. AXON above handles these automatically through quote / test-buy / governed fetch.
Price & network
Trust & risk
More in AI / ML
Browse all →Other resources in this category
Category proxy — we don't track live co-purchase signals yet.
X402 · Fe71d056 7d94 4b74 Ae8e 047e0bae87e2 Chat Completions Gpt Audio Mini
x402.slinkylayer.ai
$0.500/callAgent1-gateway · 6a5de11c 2cca 4eb0 848a E11d78dd44cf Chat
agent1-gateway.aurracloud.com
$0.010/callProtocol · Agent Bc
protocol.origindao.ai
$0.0010/callPay · Agent 41tn Psh8wh Dp6skb Jfoi Dt1tqhs3tk Y9x51j Ycnga4na
pay.x402monopoly.com
$0.100/callApi · Agentic Chat
api.ai-pin.io
$0.010/call