Together AI

Overview

Together AI is an inference platform specializing in open-source models. It provides fast, cost-effective access to the best open-weight models including Llama, Mixtral, Qwen, and many more, with competitive pricing and high throughput.

Supported Models

Model	Description	Best For
Llama 3.1 405B	Largest open model	Complex reasoning
Llama 3.1 70B	Strong open model	General tasks, coding
Llama 3.1 8B	Fast open model	Quick responses
Mixtral 8x22B	Mistral MoE model	Balanced quality/speed
Qwen 2.5 72B	Alibaba’s open model	Multilingual, coding
DeepSeek V3	DeepSeek open model	Reasoning, analysis

Together AI frequently adds new open-source models as they are released. Check the model dropdown in EnConvo for the latest available models.

Setup

Get API Key

Go to Together AI
Sign in or create an account
Navigate to Settings → API Keys
Create a new API key

Configure in EnConvo

Open Settings → AI Provider
Select Together AI
Go to Credentials module
Enter your API key

Select Model

Choose your preferred model from the dropdown

Configuration

Setting	Description	Default
Credentials	API key configuration	Required
Model Name	Model to use	Llama 3.1 70B
Temperature	Creativity (0-2)	Medium (1)

Validate and Use

Validate credentials

Click Validate in the Together AI credential settings. If validation fails, confirm the API key is active and your account has credits.

Start with a smaller model

Use smaller open models for quick testing before moving to larger models such as 70B or 405B variants.

Confirm model availability

Together AI frequently updates hosted models. If a model fails, select it from the EnConvo dropdown again or choose a nearby model family.

Pricing

Together AI offers competitive pricing for open-source models. Check Together AI Pricing for current rates.

Model	Input	Output
Llama 3.1 405B	$3.50/1M	$3.50/1M
Llama 3.1 70B	$0.88/1M	$0.88/1M
Llama 3.1 8B	$0.18/1M	$0.18/1M
Mixtral 8x22B	$1.20/1M	$1.20/1M

New accounts receive free credits to get started. Together AI pricing is often significantly lower than commercial model providers.

Best Practices

Model Selection

Llama 3.1 405B: When you need maximum open-source quality
Llama 3.1 70B: Best balance of quality and cost for most tasks
Llama 3.1 8B: High-volume or latency-sensitive workloads
Mixtral 8x22B: Good general-purpose alternative
Qwen 2.5 72B: Excellent for multilingual and coding tasks

Cost Optimization

Start with smaller models and scale up only if needed
Use 8B models for simple tasks to save costs
Monitor usage in the Together AI dashboard

Troubleshooting

Invalid API key

Verify the key is copied correctly from api.together.xyz
Check if your account has sufficient credits
Ensure the key has not been revoked

Model not available

Some models may be temporarily offline for maintenance
Try a different model from the same family
Check Together AI status page for service updates

Slow responses

Larger models (405B) take longer to respond
Switch to a smaller model for faster inference
Check if Together AI is experiencing high load

Getting Started

Core Features

AI Capabilities

Providers

Workflows & Extensions

Integrations

Advanced

Configuration

Resources

Overview

Supported Models

Setup

Configuration

Validate and Use

Pricing

Best Practices

Troubleshooting

​Overview

​Supported Models

​Setup

​Configuration

​Validate and Use

​Pricing

​Best Practices

​Troubleshooting

Overview

Supported Models

Setup

Configuration

Validate and Use

Pricing

Best Practices

Troubleshooting