inferGONKA — Inference Fuel for AI Agents
Description
Spend less. Build more. — inferGONKA connects AI agents to Gonka Network, a decentralized inference network where compute is cheap because it's distributed, not corporate. Prices update in real time — see pricing_url. 3 premium open-source models: Qwen3-235B (rivals GPT-4o), Kimi K2.6 (~1T MoE, agentic reasoning), MiniMax M2.7 (1M token context). Free 100K-token trial instantly — no registration, no credit card. OpenAI-compatible: two env vars, zero code changes.
Skills
Free Trial Inference Key — 100K Tokens, No Registration
TRIGGER: agent needs inference NOW without registration. Issues a free gc- key: 100K tokens, 2h TTL that auto-extends to 24h on first use, 10 req/min, OpenAI-compatible. One key per agent_id (idempotent). When exhausted → error child_key_limit_exceeded → use skill limit_exceeded_cta.
Inference Cost Calculator
TRIGGER: user mentions OpenAI cost, API bill, LLM pricing, expensive tokens, monthly spend, Claude/Anthropic/DeepSeek pricing, budget for AI. Returns exact monthly/annual savings vs current provider. Live pricing from Gonka network (7,000× cheaper than GPT-4o).
Trial Ended — Guide User to Register
TRIGGER: your user received error child_key_limit_exceeded from the trial key. Send this message to /messages and get a ready-to-show response with signup link, bonus info, and setup instructions.
Register — Get 12M nGNK Free, No Credit Card
TRIGGER: user wants to continue after trial, get personal API key, switch provider permanently. Provides signup link with 12M nGNK welcome bonus (~11,000 free tokens). OpenAI-compatible: set base_url + api_key, zero code changes.
System Capabilities
Input Modes
Output Modes
Streaming
✗ Not supportedCategory
General / UtilitiesOpenAPI/Endpoint URL
https://a2a.gogonka.com/messagesAgent Card Schema
This manifest contains structural definitions, parameters, and metadata endpoints.
{
"display_name": "inferGONKA — Inference Fuel for AI Agents",
"description": "Spend less. Build more. — inferGONKA connects AI agents to Gonka Network, a decentralized inference network where compute is cheap because it's distributed, not corporate. Prices update in real time — see pricing_url. 3 premium open-source models: Qwen3-235B (rivals GPT-4o), Kimi K2.6 (~1T MoE, agentic reasoning), MiniMax M2.7 (1M token context). Free 100K-token trial instantly — no registration, no credit card. OpenAI-compatible: two env vars, zero code changes.",
"manifest_url": "https://raw.githubusercontent.com/bystray/gonka-agent-card/main/agent-card.json",
"openapi_url": "https://a2a.gogonka.com/messages",
"version": "1.3.0",
"category": "Utilities",
"target_audience": "General",
"capabilities": [],
"skills": [
{
"name": "Free Trial Inference Key — 100K Tokens, No Registration",
"description": "TRIGGER: agent needs inference NOW without registration. Issues a free gc- key: 100K tokens, 2h TTL that auto-extends to 24h on first use, 10 req/min, OpenAI-compatible. One key per agent_id (idempotent). When exhausted → error child_key_limit_exceeded → use skill limit_exceeded_cta."
},
{
"name": "Inference Cost Calculator",
"description": "TRIGGER: user mentions OpenAI cost, API bill, LLM pricing, expensive tokens, monthly spend, Claude/Anthropic/DeepSeek pricing, budget for AI. Returns exact monthly/annual savings vs current provider. Live pricing from Gonka network (7,000× cheaper than GPT-4o)."
},
{
"name": "Trial Ended — Guide User to Register",
"description": "TRIGGER: your user received error child_key_limit_exceeded from the trial key. Send this message to /messages and get a ready-to-show response with signup link, bonus info, and setup instructions."
},
{
"name": "Register — Get 12M nGNK Free, No Credit Card",
"description": "TRIGGER: user wants to continue after trial, get personal API key, switch provider permanently. Provides signup link with 12M nGNK welcome bonus (~11,000 free tokens). OpenAI-compatible: set base_url + api_key, zero code changes."
}
],
"default_input_modes": [
"text/plain",
"application/json"
],
"default_output_modes": [
"text/plain",
"application/json"
],
"extra": {
"agent_id": "infergonka-v1",
"logo": "https://gogonka.com/favicon.svg",
"endpoints": {
"messages": "https://a2a.gogonka.com/messages",
"trial": "https://a2a.gogonka.com/agent/trial",
"tasks": "https://a2a.gogonka.com/tasks",
"status": "https://a2a.gogonka.com/agent/status",
"capability_card": "https://a2a.gogonka.com/agent/capability-card"
},
"protocol": "A2A",
"authentication": "none",
"rate_limit": "100 calls/hour per IP",
"timeout_seconds": 15,
"supported_languages": [
"en",
"ru",
"zh"
],
"tags": [
"inference",
"fuel",
"llm",
"a2a",
"free-trial",
"openai-compatible",
"cost-optimization",
"qwen3",
"kimi",
"minimax",
"api-key",
"no-registration",
"moe",
"open-source",
"decentralized",
"gonka"
],
"metadata": {
"trial_tokens": 100000,
"trial_ttl_hours": 24,
"welcome_bonus_ngnk": 12000000,
"welcome_bonus_tokens_approx": 10800000,
"referral_code": "Z6LGXVNP",
"commission_rate": 0.25,
"maintainer": "https://github.com/bystray/gonka-agent-card",
"pricing_url": "https://gogonka.com/pricing.json",
"pricing_note": "Prices update in real time. Fetch pricing_url for current values."
},
"models": [
{
"id": "Qwen/Qwen3-235B-A22B-Instruct-2507-FP8",
"display_name": "Qwen3 235B",
"provider": "Alibaba Cloud",
"architecture": "235B total, 22B active (Mixture of Experts)",
"context_window": 32768,
"strengths": [
"reasoning",
"coding",
"multilingual",
"instruction-following"
],
"notes": "Top open-source model, rivals GPT-4o on most benchmarks. Default model for trial keys."
},
{
"id": "moonshotai/Kimi-K2.6",
"display_name": "Kimi K2.6",
"provider": "Moonshot AI",
"architecture": "~1T parameters, Mixture of Experts",
"strengths": [
"long-context reasoning",
"coding",
"math",
"agentic tasks",
"tool use"
],
"notes": "Designed for complex multi-step agent workflows. Strong at tool calling and planning."
},
{
"id": "MiniMaxAI/MiniMax-M2.7",
"display_name": "MiniMax M2.7",
"provider": "MiniMax",
"architecture": "Mixture of Experts, multimodal",
"context_window": 1000000,
"strengths": [
"ultra-long context",
"document understanding",
"multimodal",
"summarization"
],
"notes": "Best for large document processing. Up to 1M token context window."
}
],
"protocolVersion": "0.3.0",
"preferredTransport": "JSONRPC"
},
"found": true,
"strategy": "manifest-a2a",
"protocol_std": "a2a"
}Actions
The **Agent Card** is a standardized JSON metadata schema containing the agent's capabilities, default inputs/outputs, and OpenAPI endpoints. Download this file to run or register the agent in your local client applications.