Advanced Features
Explore Vectly's advanced features including web search, reasoning modes, prompt caching, and more.
Last Updated: 5/27/2025Advanced Features
Vectly offers powerful advanced features that enhance AI capabilities beyond basic chat. These features enable current information access, transparent reasoning, cost optimization, and more sophisticated interactions.
Web Search
Overview
Web search allows AI models to access current information from the internet, overcoming knowledge cutoff limitations and providing up-to-date responses.
How It Works
- Query Analysis: AI determines if web search would help
- Search Execution: Relevant searches performed automatically
- Result Processing: Information extracted and verified
- Response Integration: Findings incorporated into answer
- Source Citation: Links provided for verification
Enabling Web Search
Per Message
- Click the search icon in input area
- Toggle on for current message
- Automatically used when relevant
Per Chat
- Enable in chat settings
- Applies to all messages
- Can override per message
Use Cases
Current Events
"What happened in tech news today?"
"Latest updates on AI regulations"
"Current stock market status"
Fact Checking
"Verify the latest statistics on climate change"
"What's the current population of Tokyo?"
"Latest COVID-19 guidelines"
Product Research
"Compare the latest iPhone models"
"Best laptops for programming in 2024"
"Current pricing for cloud services"
Best Practices
- Be specific about timeframe needed
- Ask for recent/current information
- Verify critical information
- Check source reliability
- Note search adds 2 credits
Reasoning Mode
What is Reasoning Mode?
Advanced problem-solving features that show or enhance the AI's thinking process, available for specific models.
Types of Reasoning
Visible Reasoning (Claude)
- See AI's complete thought process
- Understand decision making
- Follow logical steps
- Identify assumptions
- Build trust through transparency
Enhanced Reasoning (O1 Models)
- Advanced problem-solving
- Multi-step thinking
- Self-correction
- Complex analysis
- Hidden by default
Enabling Reasoning
Claude Models
- Click reasoning icon in chat
- Enable "Show thinking process"
- See thoughts in special blocks
- Thoughts don't count toward output
O1 Models
- Select O1 model
- Choose reasoning effort level
- Higher effort = better reasoning
- More credits for higher effort
Use Cases
Complex Problem Solving
"Design a scalable microservices architecture"
"Solve this algorithmic challenge"
"Debug this complex issue"
Analysis and Research
"Analyze these competing theories"
"Evaluate business strategies"
"Compare technical approaches"
Learning and Understanding
"Explain quantum computing step by step"
"Break down this mathematical proof"
"Trace through this algorithm"
Reasoning Best Practices
- Use for complex tasks only
- Review reasoning for insights
- Higher effort for critical tasks
- Balance cost vs. benefit
- Save reasoning for reference
Prompt Caching
Overview
Prompt caching reduces costs by storing and reusing common prompt prefixes, particularly beneficial for long contexts used repeatedly.
How It Works
- Automatic Detection: System identifies repeated patterns
- Cache Creation: Common prefixes stored
- Cache Hits: Reused prompts cost less
- Smart Invalidation: Updates when context changes
- Transparent Savings: See cache savings in real-time
Benefits
- Cost Reduction: Up to 90% savings on cached portions
- Faster Responses: Cached content processes quicker
- Consistent Context: Same context across messages
- Automatic Management: No manual configuration needed
Supported Models
- Claude 4 Opus ✅
- Claude 4 Sonnet ✅
- Other models coming soon
Optimization Strategies
System Prompts
"You are an expert Python developer. 
Follow PEP 8 standards.
Use type hints.
Write comprehensive docstrings."
Reuse across conversations for consistency and savings.
Document Context
- Upload documents to project
- Reference in multiple chats
- Automatic caching applies
- Significant savings
Code Context
- Connect GitHub repositories
- Frequently accessed files cached
- Cross-chat benefits
- Version-aware caching
Monitoring Cache Performance
- See cache indicators in chat
- View savings per message
- Track total cache benefits
- Optimize based on patterns
Multi-Modal Capabilities
Image Understanding (GPT-4V)
Supported Operations
- Image analysis and description
- Text extraction (OCR)
- Diagram understanding
- Chart and graph reading
- Visual question answering
How to Use
- Upload image with message
- Ask questions about image
- AI analyzes and responds
- Combine with text context
Use Cases
"What's in this screenshot?"
"Extract text from this image"
"Explain this diagram"
"Analyze this chart's trends"
Coming Soon: Image Generation
- Text-to-image creation
- Style control
- Variations and editing
- Integration with chat context
Function Calling
Overview
Function calling allows AI to interact with external tools and APIs, enabling actions beyond text generation.
Current Capabilities
- Structured data extraction
- API request formatting
- Tool use planning
- Parameter validation
Use Cases
"Extract all email addresses from this text"
"Format this data as JSON"
"Create API request for user creation"
"Parse this log file for errors"
Coming Soon
- Direct API integration
- Custom function definitions
- Workflow automation
- Third-party tool connections
Context Management
Smart Context Window
Automatic Prioritization
- Most relevant content first
- Recent messages prioritized
- Important context preserved
- Automatic truncation
Manual Control
- Pin important messages
- Clear unnecessary context
- Reorder information
- Optimize token usage
Cross-Chat Context
Project-Level Sharing
- Context available across chats
- Consistent information
- Reduced duplication
- Better organization
Global Context (Coming Soon)
- Personal knowledge base
- Cross-project access
- Persistent memory
- Advanced retrieval
Performance Features
Streaming Responses
- Real-time text generation
- See thoughts as they form
- Cancel mid-response
- Immediate feedback
Parallel Processing
- Multiple file analysis
- Concurrent searches
- Batch operations
- Efficient workflows
Response Control
- Stop generation anytime
- Continue incomplete responses
- Regenerate with variations
- Edit and retry
Customization Options
Model Parameters
Temperature Control (Coming Soon)
- Adjust creativity level
- 0 = Focused, deterministic
- 1 = Creative, varied
- Balance for task needs
Response Length
- Set maximum tokens
- Ensure complete answers
- Control costs
- Prevent cutoffs
Output Formatting
Structured Responses
Request specific formats:
"Respond in JSON format"
"Use markdown tables"
"Provide bullet points"
"Include code blocks"
Language Support
- Multi-language responses
- Code syntax highlighting
- Mathematical notation
- Technical diagrams
Integration Features
API Access (Coming Soon)
Programmatic Usage
- RESTful API endpoints
- Authentication tokens
- Rate limiting
- Usage tracking
Webhook Support
- Event notifications
- Response callbacks
- Status updates
- Error handling
Export Capabilities
Current Features
- Copy responses
- Download chat history
- Export code blocks
- Save conversations
Coming Soon
- Batch export
- Format conversion
- API integration
- Automated backups
Best Practices
Feature Selection
- Match feature to task: Don't overuse expensive features
- Start simple: Add features as needed
- Monitor costs: Track feature usage
- Optimize combinations: Some features work well together
Cost Optimization
- Enable caching for repeated tasks
- Use web search selectively
- Choose appropriate reasoning levels
- Monitor credit consumption
Quality Optimization
- Use reasoning for complex problems
- Enable search for current info
- Leverage caching for consistency
- Combine features strategically
Troubleshooting
Feature Not Working
Web Search Issues
- Check internet connection
- Verify feature enabled
- Try rephrasing query
- Check credit balance
Reasoning Problems
- Ensure compatible model
- Check effort settings
- Verify prompt clarity
- Monitor timeout issues
Cache Misses
- Check for context changes
- Verify model support
- Review usage patterns
- Contact support if persistent
Future Roadmap
Upcoming Features
- Voice input/output
- Real-time collaboration
- Custom model fine-tuning
- Plugin ecosystem
- Workflow automation
- Enhanced integrations
Beta Features
Join our beta program to test:
- Experimental models
- New capabilities
- Early access features
- Provide feedback
Advanced features make Vectly more than just a chat interface—they transform it into a powerful AI workbench for any task.