Skip to Content
Welcome to the new DatumOS documentation!

AI Models

DatumOS provides access to multiple AI models through Vercel AI Gateway, allowing you to choose the best model for each task based on capability, cost, and performance needs.

Available Models

Claude Haiku 4.5 (Default)

Provider: Anthropic Model ID: claude-haiku-4-5-20251001

Fast and cost-effective model that’s perfect for most everyday tasks.

Key Capabilities:

  • ✅ Streaming responses
  • ✅ Tool calling (search, artifacts)
  • ✅ Image input
  • ✅ Extended reasoning
  • 200,000 token context window (~150,000 words)

Best For:

  • General chat and questions
  • Document search and retrieval
  • Simple analysis tasks
  • Quick summaries
  • Most day-to-day queries

Pricing:

  • Input: $0.80 per million tokens
  • Output: $4.00 per million tokens
Default Choice

Claude Haiku 4.5 is the default model because it offers the best balance of speed, capability, and cost for typical construction project queries.

Claude Sonnet 4.5

Provider: Anthropic Model ID: claude-sonnet-4-5-20250929

Most capable model with advanced reasoning, ideal for complex tasks.

Key Capabilities:

  • ✅ Streaming responses
  • ✅ Tool calling (search, artifacts)
  • ✅ Image input
  • ✅ Extended reasoning (advanced)
  • 200,000 token context window (~150,000 words)

Best For:

  • Complex analysis and reasoning
  • Multi-step problem solving
  • Detailed technical writing
  • Code generation and debugging
  • Advanced image analysis
  • In-depth research tasks

Pricing:

  • Input: $3.00 per million tokens
  • Output: $15.00 per million tokens
When to Upgrade

Switch to Sonnet 4.5 when Haiku’s responses aren’t detailed enough or when you need deeper reasoning.

Grok Vision

Provider: XAI Model ID: grok-2-vision-1212

Advanced multimodal model with strong vision capabilities.

Key Capabilities:

  • ✅ Streaming responses
  • ✅ Tool calling (search, artifacts)
  • ✅ Image input (advanced)
  • ❌ Extended reasoning (not available)
  • 131,072 token context window (~100,000 words)

Best For:

  • Image analysis and understanding
  • Drawing and diagram interpretation
  • Photo documentation review
  • Visual inspection tasks
  • OCR and text extraction from images
Vision Tasks

Grok Vision excels at understanding visual content like construction drawings, site photos, and technical diagrams.

Switching Models

During a Conversation

Change models mid-conversation to match your needs:

  1. Click the Model dropdown in the chat header
  2. Select your desired model
  3. Continue the conversation with the new model

The new model will have access to the full conversation history.

Setting a Default Model

Choose your preferred default model:

  1. Go to Settings > AI Models
  2. Select your default model
  3. Click Save

New conversations will start with this model.

Per-Message Model Selection

For fine-grained control:

  1. Click the model selector before sending each message
  2. Choose the appropriate model
  3. Send your message

This lets you optimize cost and capability message by message.

Cost Optimization

Use Haiku for simple queries, upgrade to Sonnet only when needed. This can reduce costs by 3-4x while maintaining quality.

Model Selection Guide

Decision Tree

Is the task primarily visual (images, drawings)? ├─ YES → Grok Vision └─ NO → Continue Does it require complex reasoning or analysis? ├─ YES → Claude Sonnet 4.5 └─ NO → Claude Haiku 4.5 (default)

Task-Specific Recommendations

TaskRecommended ModelWhy
Document SearchClaude Haiku 4.5Fast, cost-effective, excellent tool use
Simple QuestionsClaude Haiku 4.5Quick responses, adequate reasoning
Complex AnalysisClaude Sonnet 4.5Advanced reasoning, deeper insights
Image AnalysisGrok VisionStrong vision capabilities
Code GenerationClaude Sonnet 4.5Better understanding of complex logic
Report WritingClaude Sonnet 4.5More sophisticated language, better structure
Quick SummariesClaude Haiku 4.5Fast, cost-effective, sufficient quality
Multi-Step ResearchClaude Sonnet 4.5Better reasoning across multiple sources

Cost Considerations

Understanding Token Usage

Tokens are pieces of text, roughly:

  • 1 token ≈ 4 characters
  • 1 token ≈ 0.75 words
  • 100 tokens ≈ 75 words

Example message:

Find all RFIs related to structural steel in the Civic Center project
  • Input: ~15 tokens
  • Typical response: ~200-500 tokens
  • Total cost: under $0.01 with Haiku

Cost Comparison

Typical query (500 input tokens, 1,000 output tokens):

ModelInput CostOutput CostTotal Cost
Claude Haiku 4.5$0.0004$0.004$0.0044
Claude Sonnet 4.5$0.0015$0.015$0.0165

Sonnet costs ~3.75x more per query.

Daily usage example (100 queries/day):

ModelDaily CostMonthly Cost
Claude Haiku 4.5$0.44~$13
Claude Sonnet 4.5$1.65~$50
Mixed (80% Haiku, 20% Sonnet)$0.68~$20
Cost Optimization Strategy

Use Haiku as your default, upgrade to Sonnet for complex tasks. This “mixed mode” gives you 90% of Sonnet’s value at 40% of the cost.

Managing Costs

Best Practices:

  1. Start with Haiku - Use it for initial exploration
  2. Upgrade strategically - Switch to Sonnet for complex tasks
  3. Monitor usage - Check Settings > Usage to track spending
  4. Set limits - Configure spending alerts in Settings
  5. Use shorter prompts - Be concise to reduce token usage
  6. Avoid redundancy - Don’t repeat information unnecessarily

Performance Characteristics

Response Times

Time to first token (95th percentile):

ModelSimple ChatWith ToolsWith Images
Claude Haiku 4.5~2s~5s~3s
Claude Sonnet 4.5~3s~6s~4s
Grok Vision~2.5s~5.5s~4s

Throughput

Tokens per second (streaming):

ModelAverage Speed
Claude Haiku 4.5~40 tokens/sec
Claude Sonnet 4.5~30 tokens/sec
Grok Vision~35 tokens/sec
Tip

All models stream responses in real-time, so you see output as it’s generated rather than waiting for the complete response.

Context Windows

What is a Context Window?

The context window is the maximum amount of text (input + output) a model can process in one conversation.

Context Window Sizes:

ModelContext WindowApproximate Words
Claude Haiku 4.5200,000 tokens~150,000 words
Claude Sonnet 4.5200,000 tokens~150,000 words
Grok Vision131,072 tokens~100,000 words

What Fits in a Context Window?

Claude models (200K tokens):

  • ~500 pages of text
  • ~20 full project specifications
  • ~100 typical chat messages
  • Entire novels with room for analysis

Grok Vision (131K tokens):

  • ~325 pages of text
  • ~13 full project specifications
  • ~65 typical chat messages

Managing Long Conversations

When approaching context limits:

  1. Start fresh - Begin a new chat for unrelated topics
  2. Be concise - Avoid repeating information unnecessarily
  3. Summarize - Ask the AI to summarize long conversations
Context Limits

When a conversation exceeds the context window, older messages are automatically dropped. Important context may be lost.

Model Capabilities Comparison

CapabilityHaiku 4.5Sonnet 4.5Grok Vision
Streaming✅ Yes✅ Yes✅ Yes
Tool calling✅ Excellent✅ Excellent✅ Good
Image input✅ Good✅ Excellent✅ Excellent
Extended reasoning✅ Good✅ Excellent❌ No
Code generation✅ Good✅ Excellent✅ Good
Writing quality✅ Good✅ Excellent✅ Good
Speed✅ Fastest⚡ Fast✅ Fast
Cost💰 Lowest💰💰 Higher💰 Medium
Context window200K200K131K

Troubleshooting

Model not available

Symptoms: Model appears grayed out or shows error

Solutions:

  1. Check your subscription plan includes the model
  2. Verify account is in good standing
  3. Try refreshing the page
  4. Check status page 

Slow responses

Symptoms: Taking longer than usual to get responses

Solutions:

  1. Switch to a faster model (Haiku)
  2. Check your internet connection
  3. Simplify your query
  4. Try during off-peak hours

Quality issues

Symptoms: Responses are not detailed enough or incorrect

Solutions:

  1. Try Sonnet - Upgrade from Haiku for complex tasks
  2. Be more specific - Provide more context in your query
  3. Break it down - Split complex questions into steps
  4. Provide examples - Show what you’re looking for

Next Steps