Surf Inference · LLM Chat Completions
Generates chat completions via OpenAI-compatible endpoint supporting multiple LLM models with configurable parameters.
What it does
Generates chat completions via OpenAI-compatible endpoint supporting multiple LLM models with configurable parameters.
- Generate text responses for agent conversations
- Create embeddings and completions without API keys
- Access Chinese and multilingual LLMs programmatically
- Build chatbots with pay-per-use cost structure
Ideal buyer
AI agent developers seeking OpenAI-compatible inference with micropayment flexibility and multi-model access.
Run this through your governed agent wallet.
- 01Bootstrap AXON once with
npx @axon402/init. - 02Use the AXON runtime MCP tools to
search_x402_servicesorinspect_x402_offerfor this service. - 03Quote, test-buy, then run the governed paid fetch through AXON.
Send this
Prompt for your agent
A natural-language instruction for your LLM agent — with this endpoint exposed as a tool — to call this resource. Not sent to the endpoint; the endpoint consumes the JSON body below.
Pasting this prompt into a raw ChatGPT or unconfigured agent will notexecute the paid endpoint flow. Run it through an agent with the AXON runtime / MCP tools exposed (see “Use with AXON” above) so the 402 challenge, quote, and governed fetch are handled for you.
“Generate a completion for 'Explain quantum computing in 3 sentences' using moonshotai/kimi-k2.5 with max 100 tokens.”
Endpoint request body
The JSON payload your agent sends to the endpoint.
{
"model": "moonshotai/kimi-k2.5",
"messages": [
{
"role": "user",
"content": "Explain quantum computing in 3 sentences"
}
],
"max_tokens": 100
}Advanced HTTP details
For integrators who need the raw protocol surface. Most agents should use AXON above instead of calling these directly.
curl fallback
curl https://inference.surf.cascade.fyi/v1/chat/completions \ -H "Content-Type: application/json" \ -H "X-PAYMENT: [signed_payment_envelope]" \ -d '{"model":"moonshotai/kimi-k2.5","messages":[{"role":"user","content":"Explain quantum computing in 3 sentences"}],"max_tokens":100}'
Payment & settlement details
Raw on-chain settlement parameters. AXON above handles these automatically through quote / test-buy / governed fetch.
Price & network
Trust & risk
More in AI / ML
Browse all →Other resources in this category
Category proxy — we don't track live co-purchase signals yet.
X402 · Fe71d056 7d94 4b74 Ae8e 047e0bae87e2 Chat Completions Gpt Audio Mini
x402.slinkylayer.ai
$0.500/callAgent1-gateway · 6a5de11c 2cca 4eb0 848a E11d78dd44cf Chat
agent1-gateway.aurracloud.com
$0.010/callProtocol · Agent Bc
protocol.origindao.ai
$0.0010/callPay · Agent 41tn Psh8wh Dp6skb Jfoi Dt1tqhs3tk Y9x51j Ycnga4na
pay.x402monopoly.com
$0.100/callApi · Agentic Chat
api.ai-pin.io
$0.010/call