AI Models
Learn about the different AI models available on Vectly and how to choose the right one for your needs.
Last Updated: 5/27/2025AI Models on Vectly
Vectly provides access to multiple state-of-the-art AI models, each with unique strengths and optimal use cases. This guide helps you choose the right model for your needs.
Available Models
Claude Models (Anthropic)
Claude 4 Opus
- Best for: Complex reasoning, detailed analysis, creative writing
- Strengths:
- Exceptional reasoning capabilities
- Nuanced understanding of context
- Strong safety features
- Great for technical documentation
 
- Context Window: 200K tokens
- Credit Cost: Higher (see pricing)
Claude 4 Sonnet
- Best for: Balanced performance and cost
- Strengths:
- Good reasoning abilities
- Faster than Opus
- Cost-effective for most tasks
- Reliable for code generation
 
- Context Window: 200K tokens
- Credit Cost: Medium
GPT Models (OpenAI)
GPT-4.1 Turbo
- Best for: General-purpose tasks, code generation, analysis
- Strengths:
- Excellent code understanding
- Strong general knowledge
- Fast response times
- Good at following instructions
 
- Context Window: 128K tokens
- Credit Cost: Medium-High
GPT-4o
- Best for: Multimodal tasks, vision understanding
- Strengths:
- Can analyze images
- Native multimodal understanding
- Efficient processing
- Good for visual content
 
- Context Window: 128K tokens
- Credit Cost: Medium
GPT-4o Mini
- Best for: Quick queries, simple tasks, testing
- Strengths:
- Very fast responses
- Low credit usage
- Good for simple questions
- Ideal for development/testing
 
- Context Window: 128K tokens
- Credit Cost: Low
O1 Models (OpenAI)
O1 Preview
- Best for: Complex reasoning, mathematical problems, coding challenges
- Strengths:
- Advanced reasoning capabilities
- Step-by-step problem solving
- Excellent for STEM tasks
- Self-correcting reasoning
 
- Context Window: 128K tokens
- Credit Cost: Very High
O1 Mini
- Best for: Reasoning tasks with budget constraints
- Strengths:
- Good reasoning abilities
- More affordable than O1 Preview
- Faster processing
- Suitable for moderate complexity
 
- Context Window: 128K tokens
- Credit Cost: High
Choosing the Right Model
By Use Case
📝 Writing & Content Creation
- Best: Claude 4 Opus
- Alternative: GPT-4.1 Turbo
- Budget: Claude 4 Sonnet
💻 Code Generation & Debugging
- Best: GPT-4.1 Turbo
- Alternative: Claude 4 Sonnet
- Budget: GPT-4o Mini
🧮 Complex Problem Solving
- Best: O1 Preview
- Alternative: Claude 4 Opus
- Budget: O1 Mini
🔍 Research & Analysis
- Best: Claude 4 Opus
- Alternative: GPT-4.1 Turbo
- Budget: Claude 4 Sonnet
🖼️ Image Analysis
- Best: GPT-4o
- Alternative: GPT-4.1 Turbo (text only)
⚡ Quick Questions
- Best: GPT-4o Mini
- Alternative: Claude 4 Sonnet
Model Features
Reasoning Mode
Available for Claude and O1 models:
- Visible Reasoning (Claude): See the AI's thought process
- Hidden Reasoning (O1): Advanced reasoning without visible steps
Web Search
Compatible with all models:
- Search current information
- Verify facts and data
- Find recent events
- Additional credit cost applies
Prompt Caching
Reduces costs for repetitive contexts:
- Available for Claude models
- Automatically caches repeated prompts
- Significant savings for long contexts
Credit Usage
Credit consumption varies by:
- Model Selected: Premium models cost more
- Input Length: Longer prompts use more credits
- Output Length: Longer responses cost more
- Features Used: Web search, reasoning add costs
Approximate Credit Costs (per 1K tokens)
| Model | Input | Output | 
|---|---|---|
| GPT-4o Mini | 0.1 | 0.3 | 
| Claude 4 Sonnet | 0.3 | 1.5 | 
| GPT-4.1 Turbo | 0.5 | 1.5 | 
| GPT-4o | 0.5 | 1.5 | 
| Claude 4 Opus | 1.5 | 7.5 | 
| O1 Mini | 3.0 | 12.0 | 
| O1 Preview | 15.0 | 60.0 | 
Best Practices
1. Start Small
- Test with GPT-4o Mini first
- Upgrade to premium models if needed
- Use prompt caching for repeated contexts
2. Match Model to Task
- Don't use O1 Preview for simple questions
- Don't use Mini models for complex analysis
- Consider context window limits
3. Optimize for Credits
- Be concise in your prompts
- Use system prompts effectively
- Enable caching when applicable
- Monitor usage in real-time
4. Leverage Strengths
- Use Claude for nuanced writing
- Use GPT-4 for code and general tasks
- Use O1 for complex reasoning
- Use Mini models for iteration
Model Comparison Table
| Feature | Claude 4 Opus | Claude 4 Sonnet | GPT-4.1 Turbo | GPT-4o | GPT-4o Mini | O1 Preview | O1 Mini | 
|---|---|---|---|---|---|---|---|
| Best For | Complex tasks | Balanced use | General purpose | Multimodal | Quick queries | Reasoning | Budget reasoning | 
| Speed | Slow | Medium | Fast | Fast | Very Fast | Slow | Medium | 
| Context | 200K | 200K | 128K | 128K | 128K | 128K | 128K | 
| Reasoning | Excellent | Good | Good | Good | Basic | Exceptional | Very Good | 
| Code | Very Good | Good | Excellent | Very Good | Good | Excellent | Very Good | 
| Cost | High | Medium | Medium | Medium | Low | Very High | High | 
Coming Soon
We're constantly adding new models:
- Gemini Pro models
- Specialized fine-tuned models
- Custom model deployment
- Model mixing capabilities
Stay tuned for updates!