Complete guide to the stealth 100B AI model on OpenRouter. Free, fast (~65 tok/s), 256K context. Compare costs, get API setup code, and learn the best use cases.
See how much you save switching to Elephant Alpha from paid models.
| Model | Params | Context | Speed | Input $/M | Output $/M | Best For |
|---|---|---|---|---|---|---|
| Elephant Alpha FREE | 100B | 256K | ~65 tok/s FAST | $0 | $0 | Coding, agents, batch |
| Claude Opus 4.7 | ~2T | 200K | ~30 tok/s | $15 | $75 | Complex reasoning |
| GPT-5.4 | ~1.8T | 128K | ~40 tok/s | $10 | $30 | General purpose |
| Claude Sonnet 4.5 | ~350B | 200K | ~80 tok/s | $3 | $15 | Fast + smart balance |
| Gemini 3.1 Pro | ~500B | 2M | ~55 tok/s | $3.5 | $10.5 | Long context |
| Qwen 3.6-35B | 35B | 128K | ~90 tok/s | $0.15 | $0.60 | Local agents |
| DeepSeek V3 | 671B MoE | 128K | ~50 tok/s | $0.27 | $1.10 | Cost-efficient |
openrouter/elephant-alpha as the model identifier in your API calls.https://openrouter.ai/api/v1. Compatible with OpenAI SDK format.Use Claude or GPT for planning and task decomposition, then hand off individual steps to Elephant Alpha for execution. This "planner + executor" pattern cuts API costs by 80-90% while maintaining quality.
Elephant Alpha handles code generation, refactoring, and bug fixing with minimal token overhead. It stays focused on the task without unnecessary explanations — perfect for IDE integrations and CI/CD pipelines.
Summarize contracts, convert meeting transcripts to structured tables, extract data from PDFs. The 256K context window handles long documents, and the efficient output means faster processing at zero cost.
Process thousands of items cheaply: classify support tickets, generate product descriptions, tag content, extract entities. At $0/token, batch size is only limited by rate limits.
Powers tools like OpenClaw for web scraping, form filling, and automated testing. The fast response time keeps agent loops snappy, and the free pricing makes long-running automation affordable.
| Model ID | openrouter/elephant-alpha |
| Parameters | ~100 billion |
| Context Window | 262,144 tokens (256K) |
| Max Output | 32,768 tokens (32K) |
| Speed | ~65 tokens/second |
| Pricing | $0 / million tokens (input & output) |
| Release Date | ~April 13, 2026 |
| Provider | Anonymous (via OpenRouter stealth program) |
| Features | Function calling, structured outputs, prompt caching |
| Modality | Text only |
| Access | OpenRouter API (OpenAI-compatible) |
Elephant Alpha is part of OpenRouter's stealth program, where anonymous providers test models through blind evaluation. Other stealth models that have appeared include:
These models typically appear during testing phases and may change, improve, or disappear without notice. Take advantage of the free access while it lasts.