Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.enconvo.ai/llms.txt

Use this file to discover all available pages before exploring further.

Overview

Together AI is an inference platform specializing in open-source models. It provides fast, cost-effective access to the best open-weight models including Llama, Mixtral, Qwen, and many more, with competitive pricing and high throughput.

Supported Models

ModelDescriptionBest For
Llama 3.1 405BLargest open modelComplex reasoning
Llama 3.1 70BStrong open modelGeneral tasks, coding
Llama 3.1 8BFast open modelQuick responses
Mixtral 8x22BMistral MoE modelBalanced quality/speed
Qwen 2.5 72BAlibaba’s open modelMultilingual, coding
DeepSeek V3DeepSeek open modelReasoning, analysis
Together AI frequently adds new open-source models as they are released. Check the model dropdown in EnConvo for the latest available models.

Setup

1

Get API Key

  1. Go to Together AI
  2. Sign in or create an account
  3. Navigate to SettingsAPI Keys
  4. Create a new API key
2

Configure in EnConvo

  1. Open SettingsAI Provider
  2. Select Together AI
  3. Go to Credentials module
  4. Enter your API key
3

Select Model

Choose your preferred model from the dropdown

Configuration

SettingDescriptionDefault
CredentialsAPI key configurationRequired
Model NameModel to useLlama 3.1 70B
TemperatureCreativity (0-2)Medium (1)

Pricing

Together AI offers competitive pricing for open-source models. Check Together AI Pricing for current rates.
ModelInputOutput
Llama 3.1 405B$3.50/1M$3.50/1M
Llama 3.1 70B$0.88/1M$0.88/1M
Llama 3.1 8B$0.18/1M$0.18/1M
Mixtral 8x22B$1.20/1M$1.20/1M
New accounts receive free credits to get started. Together AI pricing is often significantly lower than commercial model providers.

Best Practices

  • Llama 3.1 405B: When you need maximum open-source quality
  • Llama 3.1 70B: Best balance of quality and cost for most tasks
  • Llama 3.1 8B: High-volume or latency-sensitive workloads
  • Mixtral 8x22B: Good general-purpose alternative
  • Qwen 2.5 72B: Excellent for multilingual and coding tasks
  • Start with smaller models and scale up only if needed
  • Use 8B models for simple tasks to save costs
  • Monitor usage in the Together AI dashboard

Troubleshooting

  • Verify the key is copied correctly from api.together.xyz
  • Check if your account has sufficient credits
  • Ensure the key has not been revoked
  • Some models may be temporarily offline for maintenance
  • Try a different model from the same family
  • Check Together AI status page for service updates
  • Larger models (405B) take longer to respond
  • Switch to a smaller model for faster inference
  • Check if Together AI is experiencing high load