Advanced Features

Explore Vectly's advanced features including web search, reasoning modes, prompt caching, and more.

Last Updated: 5/27/2025

Advanced Features

Vectly offers powerful advanced features that enhance AI capabilities beyond basic chat. These features enable current information access, transparent reasoning, cost optimization, and more sophisticated interactions.

Overview

Web search allows AI models to access current information from the internet, overcoming knowledge cutoff limitations and providing up-to-date responses.

How It Works

  1. Query Analysis: AI determines if web search would help
  2. Search Execution: Relevant searches performed automatically
  3. Result Processing: Information extracted and verified
  4. Response Integration: Findings incorporated into answer
  5. Source Citation: Links provided for verification

Per Message

  • Click the search icon in input area
  • Toggle on for current message
  • Automatically used when relevant

Per Chat

  • Enable in chat settings
  • Applies to all messages
  • Can override per message

Use Cases

Current Events

"What happened in tech news today?"
"Latest updates on AI regulations"
"Current stock market status"

Fact Checking

"Verify the latest statistics on climate change"
"What's the current population of Tokyo?"
"Latest COVID-19 guidelines"

Product Research

"Compare the latest iPhone models"
"Best laptops for programming in 2024"
"Current pricing for cloud services"

Best Practices

  • Be specific about timeframe needed
  • Ask for recent/current information
  • Verify critical information
  • Check source reliability
  • Note search adds 2 credits

Reasoning Mode

What is Reasoning Mode?

Advanced problem-solving features that show or enhance the AI's thinking process, available for specific models.

Types of Reasoning

Visible Reasoning (Claude)

  • See AI's complete thought process
  • Understand decision making
  • Follow logical steps
  • Identify assumptions
  • Build trust through transparency

Enhanced Reasoning (O1 Models)

  • Advanced problem-solving
  • Multi-step thinking
  • Self-correction
  • Complex analysis
  • Hidden by default

Enabling Reasoning

Claude Models

  1. Click reasoning icon in chat
  2. Enable "Show thinking process"
  3. See thoughts in special blocks
  4. Thoughts don't count toward output

O1 Models

  1. Select O1 model
  2. Choose reasoning effort level
  3. Higher effort = better reasoning
  4. More credits for higher effort

Use Cases

Complex Problem Solving

"Design a scalable microservices architecture"
"Solve this algorithmic challenge"
"Debug this complex issue"

Analysis and Research

"Analyze these competing theories"
"Evaluate business strategies"
"Compare technical approaches"

Learning and Understanding

"Explain quantum computing step by step"
"Break down this mathematical proof"
"Trace through this algorithm"

Reasoning Best Practices

  • Use for complex tasks only
  • Review reasoning for insights
  • Higher effort for critical tasks
  • Balance cost vs. benefit
  • Save reasoning for reference

Prompt Caching

Overview

Prompt caching reduces costs by storing and reusing common prompt prefixes, particularly beneficial for long contexts used repeatedly.

How It Works

  1. Automatic Detection: System identifies repeated patterns
  2. Cache Creation: Common prefixes stored
  3. Cache Hits: Reused prompts cost less
  4. Smart Invalidation: Updates when context changes
  5. Transparent Savings: See cache savings in real-time

Benefits

  • Cost Reduction: Up to 90% savings on cached portions
  • Faster Responses: Cached content processes quicker
  • Consistent Context: Same context across messages
  • Automatic Management: No manual configuration needed

Supported Models

  • Claude 4 Opus ✅
  • Claude 4 Sonnet ✅
  • Other models coming soon

Optimization Strategies

System Prompts

"You are an expert Python developer. 
Follow PEP 8 standards.
Use type hints.
Write comprehensive docstrings."

Reuse across conversations for consistency and savings.

Document Context

  • Upload documents to project
  • Reference in multiple chats
  • Automatic caching applies
  • Significant savings

Code Context

  • Connect GitHub repositories
  • Frequently accessed files cached
  • Cross-chat benefits
  • Version-aware caching

Monitoring Cache Performance

  • See cache indicators in chat
  • View savings per message
  • Track total cache benefits
  • Optimize based on patterns

Multi-Modal Capabilities

Image Understanding (GPT-4V)

Supported Operations

  • Image analysis and description
  • Text extraction (OCR)
  • Diagram understanding
  • Chart and graph reading
  • Visual question answering

How to Use

  1. Upload image with message
  2. Ask questions about image
  3. AI analyzes and responds
  4. Combine with text context

Use Cases

"What's in this screenshot?"
"Extract text from this image"
"Explain this diagram"
"Analyze this chart's trends"

Coming Soon: Image Generation

  • Text-to-image creation
  • Style control
  • Variations and editing
  • Integration with chat context

Function Calling

Overview

Function calling allows AI to interact with external tools and APIs, enabling actions beyond text generation.

Current Capabilities

  • Structured data extraction
  • API request formatting
  • Tool use planning
  • Parameter validation

Use Cases

"Extract all email addresses from this text"
"Format this data as JSON"
"Create API request for user creation"
"Parse this log file for errors"

Coming Soon

  • Direct API integration
  • Custom function definitions
  • Workflow automation
  • Third-party tool connections

Context Management

Smart Context Window

Automatic Prioritization

  • Most relevant content first
  • Recent messages prioritized
  • Important context preserved
  • Automatic truncation

Manual Control

  • Pin important messages
  • Clear unnecessary context
  • Reorder information
  • Optimize token usage

Cross-Chat Context

Project-Level Sharing

  • Context available across chats
  • Consistent information
  • Reduced duplication
  • Better organization

Global Context (Coming Soon)

  • Personal knowledge base
  • Cross-project access
  • Persistent memory
  • Advanced retrieval

Performance Features

Streaming Responses

  • Real-time text generation
  • See thoughts as they form
  • Cancel mid-response
  • Immediate feedback

Parallel Processing

  • Multiple file analysis
  • Concurrent searches
  • Batch operations
  • Efficient workflows

Response Control

  • Stop generation anytime
  • Continue incomplete responses
  • Regenerate with variations
  • Edit and retry

Customization Options

Model Parameters

Temperature Control (Coming Soon)

  • Adjust creativity level
  • 0 = Focused, deterministic
  • 1 = Creative, varied
  • Balance for task needs

Response Length

  • Set maximum tokens
  • Ensure complete answers
  • Control costs
  • Prevent cutoffs

Output Formatting

Structured Responses

Request specific formats:

"Respond in JSON format"
"Use markdown tables"
"Provide bullet points"
"Include code blocks"

Language Support

  • Multi-language responses
  • Code syntax highlighting
  • Mathematical notation
  • Technical diagrams

Integration Features

API Access (Coming Soon)

Programmatic Usage

  • RESTful API endpoints
  • Authentication tokens
  • Rate limiting
  • Usage tracking

Webhook Support

  • Event notifications
  • Response callbacks
  • Status updates
  • Error handling

Export Capabilities

Current Features

  • Copy responses
  • Download chat history
  • Export code blocks
  • Save conversations

Coming Soon

  • Batch export
  • Format conversion
  • API integration
  • Automated backups

Best Practices

Feature Selection

  1. Match feature to task: Don't overuse expensive features
  2. Start simple: Add features as needed
  3. Monitor costs: Track feature usage
  4. Optimize combinations: Some features work well together

Cost Optimization

  • Enable caching for repeated tasks
  • Use web search selectively
  • Choose appropriate reasoning levels
  • Monitor credit consumption

Quality Optimization

  • Use reasoning for complex problems
  • Enable search for current info
  • Leverage caching for consistency
  • Combine features strategically

Troubleshooting

Feature Not Working

Web Search Issues

  • Check internet connection
  • Verify feature enabled
  • Try rephrasing query
  • Check credit balance

Reasoning Problems

  • Ensure compatible model
  • Check effort settings
  • Verify prompt clarity
  • Monitor timeout issues

Cache Misses

  • Check for context changes
  • Verify model support
  • Review usage patterns
  • Contact support if persistent

Future Roadmap

Upcoming Features

  • Voice input/output
  • Real-time collaboration
  • Custom model fine-tuning
  • Plugin ecosystem
  • Workflow automation
  • Enhanced integrations

Beta Features

Join our beta program to test:

  • Experimental models
  • New capabilities
  • Early access features
  • Provide feedback

Advanced features make Vectly more than just a chat interface—they transform it into a powerful AI workbench for any task.