Venice AI is a drop-in replacement for OpenAI. Same SDK, same code — just change two lines. Get privacy-first inference, uncensored models, and competitive pricing.Documentation Index
Fetch the complete documentation index at: https://veniceai-experiment-guides-top-level-tab.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
The 2-Line Migration
Python
Node.js
cURL
Environment Variables
Model Mapping
| OpenAI Model | Venice Equivalent | Type | Pricing (Input/Output per 1M) |
|---|---|---|---|
| gpt-4o | zai-org-glm-4.7 (Private) | Text | 2.65 |
| gpt-4o | openai-gpt-52 (Anonymized) | Text | 17.50 |
| gpt-4o-mini | qwen3-4b | Text | 0.15 |
| gpt-4-turbo | mistral-31-24b | Text | 2.00 |
| o1 / o3 | qwen3-235b-a22b-thinking-2507 (Private) | Reasoning | 3.50 |
| o1 / o3 | grok-41-fast (Anonymized) | Reasoning | 1.25 |
| gpt-4-vision | mistral-31-24b or qwen3-vl-235b-a22b | Vision | 2.00 |
| text-embedding-3-small | text-embedding-bge-m3 | Embeddings | 0.60 |
| dall-e-3 | qwen-image (Private, $0.01) or flux-2-pro | Image | From $0.01 |
| whisper | nvidia/parakeet-tdt-0.6b-v3 | STT | $0.0001/sec |
| tts-1 | tts-kokoro | TTS | $3.50/1M chars |
Feature Compatibility
| Feature | OpenAI | Venice | Notes |
|---|---|---|---|
| Chat Completions | ✅ | ✅ | Fully compatible |
| Streaming | ✅ | ✅ | SSE format identical |
| Function Calling | ✅ | ✅ | Same tools parameter |
| Structured Output | ✅ | ✅ | Same response_format |
| Vision | ✅ | ✅ | Same content array format |
| Embeddings | ✅ | ✅ | Same API |
| Image Generation | ✅ | ✅ | OpenAI-compatible via /images/generations* |
| TTS | ✅ | ✅ | Compatible |
| STT | ✅ | ✅ | Compatible |
| Assistants API | ✅ | ❌ | Use Characters or Minds instead |
| Batch API | ✅ | ❌ | Not yet available |
| Fine-tuning | ✅ | ❌ | Not available |
POST /images/generations for easier migration from DALL-E. For Venice’s native image API with additional options, see Image Generate.
Venice-Only Features
Venice offers capabilities OpenAI doesn’t:1. Built-in Web Search
2. Web Scraping
3. Characters (AI Personas)
4. Uncensored Models
Venice’s private models have no content filtering, making them suitable for:- Creative writing without guardrails
- Security research and red teaming
- Honest analysis without refusal patterns
- Medical/legal information without disclaimers
5. Video Generation
Why Migrate?
Privacy
- Zero data retention on private models — your prompts are never stored
- No training on your data — ever
- OpenAI retains data for 30 days and may use it for safety research
Cost
- Private models are often cheaper than OpenAI equivalents
qwen3-4bat $0.05/1M input is 10x cheaper than gpt-4o-minivenice-uncensoredat 2.50/1M
Freedom
- No content filtering on uncensored models
- No account suspensions for controversial use cases
- Web3-native with crypto payment options
- DIEM staking for daily credits
Model Diversity
- Access to models from multiple providers (Qwen, Llama, Mistral, Gemma, Claude, GPT, Grok, etc.)
- Switch between private and anonymized models per request
- New models added regularly
Framework Migration
Most AI frameworks work with Venice by changing the base URL:| Framework | Change Required |
|---|---|
| LangChain | base_url in ChatOpenAI |
| Vercel AI SDK | baseURL in createOpenAI |
| CrewAI | OPENAI_API_BASE env var |
| LlamaIndex | api_base in OpenAI |
| AutoGen | base_url in config |
| Haystack | api_base_url in OpenAIGenerator |
| Claude Code | --api-base flag or env var |
| Cursor | Custom API endpoint in settings |
| Continue.dev | apiBase in config.json |
Get Your API Key
Generate a Venice API key and start migrating in minutes