Advanced Features

Explore Vectly's advanced features including web search, reasoning modes, prompt caching, and more.

Last Updated: 5/27/2025

Advanced Features

Vectly offers powerful advanced features that enhance AI capabilities beyond basic chat. These features enable current information access, transparent reasoning, cost optimization, and more sophisticated interactions.

Web Search

Overview

Web search allows AI models to access current information from the internet, overcoming knowledge cutoff limitations and providing up-to-date responses.

How It Works

Query Analysis: AI determines if web search would help
Search Execution: Relevant searches performed automatically
Result Processing: Information extracted and verified
Response Integration: Findings incorporated into answer
Source Citation: Links provided for verification

Enabling Web Search

Per Message

Click the search icon in input area
Toggle on for current message
Automatically used when relevant

Per Chat

Enable in chat settings
Applies to all messages
Can override per message

Use Cases

Current Events

"What happened in tech news today?"
"Latest updates on AI regulations"
"Current stock market status"

Fact Checking

"Verify the latest statistics on climate change"
"What's the current population of Tokyo?"
"Latest COVID-19 guidelines"

Product Research

"Compare the latest iPhone models"
"Best laptops for programming in 2024"
"Current pricing for cloud services"

Best Practices

Be specific about timeframe needed
Ask for recent/current information
Verify critical information
Check source reliability
Note search adds 2 credits

Reasoning Mode

What is Reasoning Mode?

Advanced problem-solving features that show or enhance the AI's thinking process, available for specific models.

Types of Reasoning

Visible Reasoning (Claude)

See AI's complete thought process
Understand decision making
Follow logical steps
Identify assumptions
Build trust through transparency

Enhanced Reasoning (O1 Models)

Advanced problem-solving
Multi-step thinking
Self-correction
Complex analysis
Hidden by default

Enabling Reasoning

Claude Models

Click reasoning icon in chat
Enable "Show thinking process"
See thoughts in special blocks
Thoughts don't count toward output

O1 Models

Select O1 model
Choose reasoning effort level
Higher effort = better reasoning
More credits for higher effort

Use Cases

Complex Problem Solving

"Design a scalable microservices architecture"
"Solve this algorithmic challenge"
"Debug this complex issue"

Analysis and Research

"Analyze these competing theories"
"Evaluate business strategies"
"Compare technical approaches"

Learning and Understanding

"Explain quantum computing step by step"
"Break down this mathematical proof"
"Trace through this algorithm"

Reasoning Best Practices

Use for complex tasks only
Review reasoning for insights
Higher effort for critical tasks
Balance cost vs. benefit
Save reasoning for reference

Prompt Caching

Overview

Prompt caching reduces costs by storing and reusing common prompt prefixes, particularly beneficial for long contexts used repeatedly.

How It Works

Automatic Detection: System identifies repeated patterns
Cache Creation: Common prefixes stored
Cache Hits: Reused prompts cost less
Smart Invalidation: Updates when context changes
Transparent Savings: See cache savings in real-time

Benefits

Cost Reduction: Up to 90% savings on cached portions
Faster Responses: Cached content processes quicker
Consistent Context: Same context across messages
Automatic Management: No manual configuration needed

Supported Models

Claude 4 Opus ✅
Claude 4 Sonnet ✅
Other models coming soon

Optimization Strategies

System Prompts

"You are an expert Python developer. 
Follow PEP 8 standards.
Use type hints.
Write comprehensive docstrings."

Reuse across conversations for consistency and savings.

Document Context

Upload documents to project
Reference in multiple chats
Automatic caching applies
Significant savings

Code Context

Connect GitHub repositories
Frequently accessed files cached
Cross-chat benefits
Version-aware caching

Monitoring Cache Performance

See cache indicators in chat
View savings per message
Track total cache benefits
Optimize based on patterns

Image Understanding (GPT-4V)

Supported Operations

Image analysis and description
Text extraction (OCR)
Diagram understanding
Chart and graph reading
Visual question answering

How to Use

Upload image with message
Ask questions about image
AI analyzes and responds
Combine with text context

Use Cases

"What's in this screenshot?"
"Extract text from this image"
"Explain this diagram"
"Analyze this chart's trends"

Coming Soon: Image Generation

Text-to-image creation
Style control
Variations and editing
Integration with chat context

Function Calling

Overview

Function calling allows AI to interact with external tools and APIs, enabling actions beyond text generation.

Current Capabilities

Structured data extraction
API request formatting
Tool use planning
Parameter validation

Use Cases

"Extract all email addresses from this text"
"Format this data as JSON"
"Create API request for user creation"
"Parse this log file for errors"

Coming Soon

Direct API integration
Custom function definitions
Workflow automation
Third-party tool connections

Context Management

Smart Context Window

Automatic Prioritization

Most relevant content first
Recent messages prioritized
Important context preserved
Automatic truncation

Manual Control

Pin important messages
Clear unnecessary context
Reorder information
Optimize token usage

Cross-Chat Context

Context available across chats
Consistent information
Reduced duplication
Better organization

Global Context (Coming Soon)

Personal knowledge base
Cross-project access
Persistent memory
Advanced retrieval

Performance Features

Streaming Responses

Real-time text generation
See thoughts as they form
Cancel mid-response
Immediate feedback

Parallel Processing

Multiple file analysis
Concurrent searches
Batch operations
Efficient workflows

Response Control

Stop generation anytime
Continue incomplete responses
Regenerate with variations
Edit and retry

Customization Options

Model Parameters

Temperature Control (Coming Soon)

Adjust creativity level
0 = Focused, deterministic
1 = Creative, varied
Balance for task needs

Response Length

Set maximum tokens
Ensure complete answers
Control costs
Prevent cutoffs

Output Formatting

Structured Responses

Request specific formats:

"Respond in JSON format"
"Use markdown tables"
"Provide bullet points"
"Include code blocks"

Language Support

Multi-language responses
Code syntax highlighting
Mathematical notation
Technical diagrams

Integration Features

API Access (Coming Soon)

Programmatic Usage

RESTful API endpoints
Authentication tokens
Rate limiting
Usage tracking

Webhook Support

Event notifications
Response callbacks
Status updates
Error handling

Export Capabilities

Current Features

Copy responses
Download chat history
Export code blocks
Save conversations

Coming Soon

Batch export
Format conversion
API integration
Automated backups

Best Practices

Feature Selection

Match feature to task: Don't overuse expensive features
Start simple: Add features as needed
Monitor costs: Track feature usage
Optimize combinations: Some features work well together

Cost Optimization

Enable caching for repeated tasks
Use web search selectively
Choose appropriate reasoning levels
Monitor credit consumption

Quality Optimization

Use reasoning for complex problems
Enable search for current info
Leverage caching for consistency
Combine features strategically

Troubleshooting

Feature Not Working

Web Search Issues

Check internet connection
Verify feature enabled
Try rephrasing query
Check credit balance

Reasoning Problems

Ensure compatible model
Check effort settings
Verify prompt clarity
Monitor timeout issues

Cache Misses

Check for context changes
Verify model support
Review usage patterns
Contact support if persistent

Future Roadmap

Upcoming Features

Voice input/output
Real-time collaboration
Custom model fine-tuning
Plugin ecosystem
Workflow automation
Enhanced integrations

Beta Features

Join our beta program to test:

Experimental models
New capabilities
Early access features
Provide feedback

Advanced features make Vectly more than just a chat interface—they transform it into a powerful AI workbench for any task.