Near-human IQ for Solana
Solana Agent combines strong reasoning, persistent memory, and native onchain execution in one hosted runtime. Built for teams that need agents that can plan, act, and keep context across real Solana work.
- Near-human planning, debugging, and follow-through
- Native Solana actions for research, swaps, and DeFi tasks
- Agent-paid x402 settlement on Solana in USDC or USDT
- Persistent memory across sessions and projects
- Dollar-equivalent pricing with clear retention choices
One runtime for the hosted service.
Solana Agent connects wallets, tools, orchestration, and hosted chat and memory access.
Hosted inference for production traffic.
Hosted inference where the agent pays via x402 on Solana in USDC or USDT, with dollar-equivalent pricing across chat and memory products. Works with a private key or Privy embedded signing for secure wallet access.
Near-Human Intelligence, Native to Solana
Solana Agent pairs strong reasoning with native Solana execution, persistent memory, and hosted delivery. It is built for agents that need to think well, act reliably, and keep context in production.
Near-Human Reasoning
Plans, adapts, debugs, and completes multi-step work with the kind of follow-through teams expect from a strong operator, not a stateless chatbot.
Native Solana Execution
Built-in tools for swaps, research, wallet actions, and DeFi workflows keep the model close to the chain instead of routing through brittle glue code.
Persistent Memory
Carries goals, prior decisions, and project context across sessions so long-running workflows stay coherent instead of resetting every turn.
Reliable Tool Orchestration
Coordinates multiple tools and steps without losing the objective, making it practical for research, trading, support, and internal automation.
Production Recovery
Handles failures, retries safely, and resumes work with context instead of collapsing after a single bad tool call.
Hosted for Real Traffic
Usage-based products, clear retention choices, and explicit priority behavior make it easier to ship customer-facing agents on Solana.
Near-Human Intelligence for Solana
A hosted runtime that can reason through multi-step work, make grounded decisions, and execute natively on Solana without losing the thread.
- Plans, diagnoses, and follows through across complex tasks
- Connects reasoning directly to onchain actions and wallet flows
- Built for production agents, not demo chat



Persistent Memory & Onchain Action
Keep context across sessions while using native Solana tools for swaps, research, and DeFi execution. The agent remembers the job and can continue it.
- Coherent recall across longer customer workflows
- First-party tools for wallet, market, and DeFi
- Recovery paths that help work continue after failures



Hosted Delivery for Real Customers
Customer-facing delivery with clear pricing, retention options, streaming responses, and priority handling for the workloads that need it.
- Standard and priority products
- Streaming and idempotency support
- Retention options matched to short and long-running work



How Pricing Works
Pick the hosted product first. Then decide whether to stay on standard service or pay the priority rate when faster admission matters.
Product
Hosted Chat
Stateless traffic and one-off requests.
- Input
- $5 / 1M tokens
- Output
- $30 / 1M tokens
- Retention
- None
Product
Hosted Memory
Persistent agents and workflows with 7-day work retention.
- Input
- $10 / 1M tokens
- Output
- $60 / 1M tokens
- Retention
- 7 days included
Product
Hosted Memory
Persistent agents and workflows with 30-day project retention.
- Input
- $10 / 1M tokens
- Output
- $60 / 1M tokens
- Retention
- +$0.50 / request for 30 days
Service tier
Standard service
Base rateDefault tier for chat and memory. Requests queue under load and bill at the standard product rate.
Service tier
Priority service
2x only when deliveredAvailable on chat and memory. If unavailable, the request falls back to standard billing and standard handling.
Products
Compare the hosted products directly: chat, memory with 7-day retention, and memory with 30-day retention, each available at standard or priority service.
| Decision point | Hosted Chat Standard | Hosted Chat Priority | Hosted Memory 7 days / standard | Hosted Memory 7 days / priority | Hosted Memory 30 days / standard | Hosted Memory 30 days / priority |
|---|---|---|---|---|---|---|
| Best for | One-off requests | Latency-sensitive traffic | Persistent agents | Urgent persistent agents | Longer-running projects | Urgent longer-running projects |
| Input price | $5 / 1M | $10 / 1M* | $10 / 1M | $20 / 1M* | $10 / 1M | $20 / 1M* |
| Output price | $30 / 1M | $60 / 1M* | $60 / 1M | $120 / 1M* | $60 / 1M | $120 / 1M* |
| Service tier | Standard | Priority | Standard | Priority | Standard | Priority |
| Memory retention | None | None | 7 days | 7 days | 30 days | 30 days |
| Extra memory fee | None | None | None | None | +$0.50 / request | +$0.50 / request |
| Queue treatment | Standard queue | Priority queue when available | Standard queue | Priority queue when available | Standard queue | Priority queue when available |
* Priority pricing is 2x only when priority is actually delivered. If
priority is unavailable, billing falls back to the standard product for
that same request. Hosted Memory uses 7-day work retention
by default; 30-day project retention adds $0.50
per request.