Documentation Index
Fetch the complete documentation index at: https://docs.enconvo.ai/llms.txt
Use this file to discover all available pages before exploring further.
Overview
EnConvo supports 25+ AI providers and hundreds of models. This guide helps you choose the right model for your specific needs, balancing quality, speed, cost, and privacy.Quick Recommendation
Not sure where to start? Here are our top picks for common scenarios:| Scenario | Recommended Model | Why |
|---|---|---|
| General daily use | Claude Haiku 4.5 | Fast, affordable, handles most tasks well |
| Complex reasoning | Claude Sonnet 4.5 or GPT-5 | Deep analysis, multi-step problem solving |
| Coding tasks | Claude Sonnet 4.5 | Best code understanding and generation |
| Creative writing | GPT-5 or Claude Opus 4 | Natural language, creative expression |
| Long documents | Gemini 2.5 Flash | 1M token context window |
| Math and science | DeepSeek V3 | Specialized reasoning capabilities |
| Privacy-first | Ollama (Llama 3) | Runs entirely on your Mac |
| Budget-friendly | Gemini 2.5 Flash Lite | Excellent quality at lowest cost |
Model Capability Comparison
Flagship Models
| Model | Provider | Context | Multimodal | Tool Use | Strengths |
|---|---|---|---|---|---|
| GPT-5 | OpenAI | 128K | Yes | Yes | Broad knowledge, creative tasks, vision |
| Claude Sonnet 4.5 | Anthropic | 200K | Yes | Yes | Coding, analysis, instruction following |
| Claude Opus 4 | Anthropic | 200K | Yes | Yes | Complex reasoning, agentic tasks |
| Gemini 2.5 Pro | 1M | Yes | Yes | Long context, research, multimodal | |
| Gemini 2.5 Flash | 1M | Yes | Yes | Fast, efficient, long context | |
| DeepSeek V3 | DeepSeek | 128K | Yes | Yes | Math, coding, reasoning at low cost |
| Grok | xAI | 128K | Yes | Yes | Real-time knowledge, reasoning |
Cost-Efficient Models
| Model | Provider | Context | Best For | Relative Cost |
|---|---|---|---|---|
| GPT-5 Mini | OpenAI | 128K | General tasks, fast responses | Low |
| Claude Haiku 4.5 | Anthropic | 200K | Quick tasks, high volume | Low |
| Gemini 2.5 Flash Lite | 1M | Budget-friendly, long context | Lowest | |
| DeepSeek V3 | DeepSeek | 128K | Coding, math at great value | Low |
| Groq (Llama 3) | Groq | 128K | Ultra-fast inference | Low |
Local Models (Free, Private)
| Model | Platform | RAM Needed | Best For |
|---|---|---|---|
| Llama 3.3 70B | Ollama | 40+ GB | Most capable local model |
| Llama 3.1 8B | Ollama | 8 GB | General tasks, fast |
| Qwen 2.5 72B | Ollama | 40+ GB | Multilingual, coding |
| Mistral Small | Ollama | 8 GB | European languages |
| CodeLlama 34B | Ollama | 20+ GB | Code-focused tasks |
| DeepSeek Coder V2 | Ollama | 16+ GB | Coding at moderate size |
| Any GGUF model | LM Studio | Varies | Custom model loading |
Choosing by Use Case
Coding and Development
- Best Overall
- Best Value
- Best Local
Claude Sonnet 4.5 is the top choice for coding tasks:
- Excellent code understanding across 20+ languages
- Generates clean, well-structured code
- Strong at debugging and refactoring
- Understands complex codebases and architectures
- 200K context for large code files
Writing and Content Creation
- Creative Writing
- Technical Writing
- Quick Drafts
GPT-5 excels at creative tasks:
- Natural, engaging prose
- Strong narrative structure
- Good at maintaining voice and tone
- Excellent vocabulary and style variety
Research and Analysis
- Long Documents
- Deep Analysis
- Quick Lookups
Gemini 2.5 Pro or Gemini 2.5 Flash with their 1M token context:
- Process entire books, research papers, or codebases
- Cross-reference information across hundreds of pages
- Enable Google Search grounding for real-time information
- URL Context tool for analyzing web pages directly
Math, Science, and Reasoning
| Task | Recommended Model | Notes |
|---|---|---|
| Advanced math | DeepSeek V3 | Strong mathematical reasoning |
| Physics/Chemistry | Claude Sonnet 4.5 or GPT-5 | Good scientific knowledge |
| Data analysis | Gemini 2.5 Pro | Handles large datasets with long context |
| Logic puzzles | Claude Opus 4 | Excels at complex reasoning chains |
| Statistics | DeepSeek V3 | Cost-effective for quantitative tasks |
Multilingual Tasks
| Language Group | Best Cloud Model | Best Local Model |
|---|---|---|
| English | Any top-tier model | Llama 3.1 8B |
| Chinese | DeepSeek V3, Qwen | Qwen 2.5 |
| Japanese/Korean | GPT-5, Claude | Qwen 2.5 |
| European | Claude, GPT-5 | Mistral |
| Multilingual | Gemini 2.5 | Qwen 2.5 72B |
Cloud vs Local: Decision Guide
Choose Cloud Models When:
- You need the highest accuracy and capability
- You work with many different tasks throughout the day
- You have reliable internet access
- Speed of response matters more than privacy
- You need the latest knowledge and capabilities
Choose Local Models When:
- Privacy is critical (sensitive data, proprietary code)
- You work offline frequently
- You want zero ongoing API costs
- You have an Apple Silicon Mac with sufficient RAM
- Your tasks are well-suited to smaller models
Hybrid Approach (Recommended)
Most users benefit from combining both:| Task Type | Use |
|---|---|
| Sensitive code review | Local (Ollama) |
| Creative writing | Cloud (GPT-5 / Claude) |
| Quick translations | Cloud (fast model) |
| Confidential documents | Local (Ollama) |
| Complex research | Cloud (Gemini with long context) |
| Daily chat | Cloud (Haiku / GPT-5 Mini) |
Multimodal Capabilities
Models that can understand images, audio, and other media:| Model | Images | Audio | Video | Documents |
|---|---|---|---|---|
| GPT-5 | Yes | Yes | No | Yes |
| Claude Sonnet 4.5 | Yes | No | No | Yes |
| Gemini 2.5 Pro | Yes | Yes | Yes | Yes |
| Gemini 2.5 Flash | Yes | Yes | Yes | Yes |
| Grok | Yes | No | No | Yes |
Cost Optimization Strategies
Use tiered models
Use tiered models
Route simple tasks to cheap models and complex tasks to powerful ones:
- Quick questions: Claude Haiku 4.5 or GPT-5 Mini
- Standard work: Claude Sonnet 4.5 or GPT-5
- Complex analysis: Claude Opus 4 or Gemini 2.5 Pro
Leverage Enconvo Cloud Plan
Leverage Enconvo Cloud Plan
The Enconvo Cloud Plan provides access to all major providers through a single subscription with points-based pricing. This avoids managing multiple API keys and lets you switch models freely.
Use local models for high-volume tasks
Use local models for high-volume tasks
If you run hundreds of queries daily, local models via Ollama eliminate per-query costs entirely. A one-time hardware investment pays for itself quickly.
Optimize context length
Optimize context length
Sending unnecessary context increases costs. Use EnConvo’s context awareness features to include only relevant information, and leverage knowledge base RAG instead of stuffing entire documents into the prompt.
Choose the right model size
Choose the right model size
Bigger is not always better. For simple tasks (formatting, extraction, classification), a small fast model delivers identical results at a fraction of the cost of a flagship model.
Configuring Models in EnConvo
Select AI Model Provider
Find the AI Model Provider setting and click to change it. You will see all configured providers.
Choose Provider and Model
Select the provider (OpenAI, Anthropic, Google, etc.) and then choose the specific model from the Model Name dropdown.
Adjust Temperature
Set the temperature based on your needs:
- 0 (none): Deterministic, consistent responses — best for coding, math, extraction
- 0.5 (low): Slightly varied — good for general tasks
- 1.0 (medium): Balanced creativity — default for most use cases
- 1.5 (high): More creative — good for brainstorming, writing
- 2.0 (maximum): Most varied — experimental, creative exploration
Provider Setup Quick Reference
| Provider | Setup | API Key Source |
|---|---|---|
| Enconvo Cloud | Built-in, no setup | Enconvo subscription |
| OpenAI | Add API key | platform.openai.com |
| Anthropic | Add API key | console.anthropic.com |
| Add API key | ai.google.dev | |
| DeepSeek | Add API key | platform.deepseek.com |
| Groq | Add API key | console.groq.com |
| Ollama | Install Ollama app | Free, no key needed |
| LM Studio | Install LM Studio app | Free, no key needed |
| OpenRouter | Add API key | openrouter.ai |
| Mistral | Add API key | console.mistral.ai |
| Perplexity | Add API key | perplexity.ai |
| xAI | Add API key | console.x.ai |
For detailed setup instructions for each provider, see the Providers section of the documentation.
Switching Models On-the-Fly
In the AI Chat interface, you can switch models at any time:- Click the model name in the chat header
- Select a different model from the dropdown
- Continue your conversation with the new model
- Starting with a fast model for quick back-and-forth, then switching to a powerful model for the final task
- Trying the same prompt with different models to compare results
- Using a vision model when you need to share an image
Related Features
Provider Setup
Configure AI providers and API keys
Local LLM
Set up Ollama and local models
AI Chat
Use models in conversation
Enconvo Cloud Plan
Access all models with one subscription