AI Models

DatumOS provides access to multiple AI models through Vercel AI Gateway, allowing you to choose the best model for each task based on capability, cost, and performance needs.

Available Models

Claude Haiku 4.5 (Default)

Provider: Anthropic Model ID: claude-haiku-4-5-20251001

Fast and cost-effective model that’s perfect for most everyday tasks.

Key Capabilities:

✅ Streaming responses
✅ Tool calling (search, artifacts)
✅ Image input
✅ Extended reasoning
200,000 token context window (~150,000 words)

Best For:

General chat and questions
Document search and retrieval
Simple analysis tasks
Quick summaries
Most day-to-day queries

Pricing:

Input: $0.80 per million tokens
Output: $4.00 per million tokens

Default Choice

Claude Haiku 4.5 is the default model because it offers the best balance of speed, capability, and cost for typical construction project queries.

Claude Sonnet 4.5

Provider: Anthropic Model ID: claude-sonnet-4-5-20250929

Most capable model with advanced reasoning, ideal for complex tasks.

Key Capabilities:

✅ Streaming responses
✅ Tool calling (search, artifacts)
✅ Image input
✅ Extended reasoning (advanced)
200,000 token context window (~150,000 words)

Best For:

Complex analysis and reasoning
Multi-step problem solving
Detailed technical writing
Code generation and debugging
Advanced image analysis
In-depth research tasks

Pricing:

Input: $3.00 per million tokens
Output: $15.00 per million tokens

When to Upgrade

Switch to Sonnet 4.5 when Haiku’s responses aren’t detailed enough or when you need deeper reasoning.

Grok Vision

Provider: XAI Model ID: grok-2-vision-1212

Advanced multimodal model with strong vision capabilities.

Key Capabilities:

✅ Streaming responses
✅ Tool calling (search, artifacts)
✅ Image input (advanced)
❌ Extended reasoning (not available)
131,072 token context window (~100,000 words)

Best For:

Image analysis and understanding
Drawing and diagram interpretation
Photo documentation review
Visual inspection tasks
OCR and text extraction from images

Vision Tasks

Grok Vision excels at understanding visual content like construction drawings, site photos, and technical diagrams.

Switching Models

During a Conversation

Change models mid-conversation to match your needs:

Click the Model dropdown in the chat header
Select your desired model
Continue the conversation with the new model

The new model will have access to the full conversation history.

Setting a Default Model

Choose your preferred default model:

Go to Settings > AI Models
Select your default model
Click Save

New conversations will start with this model.

Per-Message Model Selection

For fine-grained control:

Click the model selector before sending each message
Choose the appropriate model
Send your message

This lets you optimize cost and capability message by message.

Cost Optimization

Use Haiku for simple queries, upgrade to Sonnet only when needed. This can reduce costs by 3-4x while maintaining quality.

Model Selection Guide

Decision Tree


Is the task primarily visual (images, drawings)?
├─ YES → Grok Vision
└─ NO  → Continue

Does it require complex reasoning or analysis?
├─ YES → Claude Sonnet 4.5
└─ NO  → Claude Haiku 4.5 (default)

Task-Specific Recommendations

Task	Recommended Model	Why
Document Search	Claude Haiku 4.5	Fast, cost-effective, excellent tool use
Simple Questions	Claude Haiku 4.5	Quick responses, adequate reasoning
Complex Analysis	Claude Sonnet 4.5	Advanced reasoning, deeper insights
Image Analysis	Grok Vision	Strong vision capabilities
Code Generation	Claude Sonnet 4.5	Better understanding of complex logic
Report Writing	Claude Sonnet 4.5	More sophisticated language, better structure
Quick Summaries	Claude Haiku 4.5	Fast, cost-effective, sufficient quality
Multi-Step Research	Claude Sonnet 4.5	Better reasoning across multiple sources

Cost Considerations

Understanding Token Usage

Tokens are pieces of text, roughly:

1 token ≈ 4 characters
1 token ≈ 0.75 words
100 tokens ≈ 75 words

Example message:


Find all RFIs related to structural steel in the Civic Center project

Input: ~15 tokens
Typical response: ~200-500 tokens
Total cost: under $0.01 with Haiku

Cost Comparison

Typical query (500 input tokens, 1,000 output tokens):

Model	Input Cost	Output Cost	Total Cost
Claude Haiku 4.5	$0.0004	$0.004	$0.0044
Claude Sonnet 4.5	$0.0015	$0.015	$0.0165

Sonnet costs ~3.75x more per query.

Daily usage example (100 queries/day):

Model	Daily Cost	Monthly Cost
Claude Haiku 4.5	$0.44	~$13
Claude Sonnet 4.5	$1.65	~$50
Mixed (80% Haiku, 20% Sonnet)	$0.68	~$20

Cost Optimization Strategy

Use Haiku as your default, upgrade to Sonnet for complex tasks. This “mixed mode” gives you 90% of Sonnet’s value at 40% of the cost.

Managing Costs

Best Practices:

Start with Haiku - Use it for initial exploration
Upgrade strategically - Switch to Sonnet for complex tasks
Monitor usage - Check Settings > Usage to track spending
Set limits - Configure spending alerts in Settings
Use shorter prompts - Be concise to reduce token usage
Avoid redundancy - Don’t repeat information unnecessarily

Performance Characteristics

Response Times

Time to first token (95th percentile):

Model	Simple Chat	With Tools	With Images
Claude Haiku 4.5	~2s	~5s	~3s
Claude Sonnet 4.5	~3s	~6s	~4s
Grok Vision	~2.5s	~5.5s	~4s

Throughput

Tokens per second (streaming):

Model	Average Speed
Claude Haiku 4.5	~40 tokens/sec
Claude Sonnet 4.5	~30 tokens/sec
Grok Vision	~35 tokens/sec

Tip

All models stream responses in real-time, so you see output as it’s generated rather than waiting for the complete response.

Context Windows

What is a Context Window?

The context window is the maximum amount of text (input + output) a model can process in one conversation.

Context Window Sizes:

Model	Context Window	Approximate Words
Claude Haiku 4.5	200,000 tokens	~150,000 words
Claude Sonnet 4.5	200,000 tokens	~150,000 words
Grok Vision	131,072 tokens	~100,000 words

What Fits in a Context Window?

Claude models (200K tokens):

~500 pages of text
~20 full project specifications
~100 typical chat messages
Entire novels with room for analysis

Grok Vision (131K tokens):

~325 pages of text
~13 full project specifications
~65 typical chat messages

Managing Long Conversations

When approaching context limits:

Start fresh - Begin a new chat for unrelated topics
Be concise - Avoid repeating information unnecessarily
Summarize - Ask the AI to summarize long conversations

Context Limits

When a conversation exceeds the context window, older messages are automatically dropped. Important context may be lost.

Model Capabilities Comparison

Capability	Haiku 4.5	Sonnet 4.5	Grok Vision
Streaming	✅ Yes	✅ Yes	✅ Yes
Tool calling	✅ Excellent	✅ Excellent	✅ Good
Image input	✅ Good	✅ Excellent	✅ Excellent
Extended reasoning	✅ Good	✅ Excellent	❌ No
Code generation	✅ Good	✅ Excellent	✅ Good
Writing quality	✅ Good	✅ Excellent	✅ Good
Speed	✅ Fastest	⚡ Fast	✅ Fast
Cost	💰 Lowest	💰💰 Higher	💰 Medium
Context window	200K	200K	131K

Troubleshooting

Model not available

Symptoms: Model appears grayed out or shows error

Solutions:

Check your subscription plan includes the model
Verify account is in good standing
Try refreshing the page
Check status page

Slow responses

Symptoms: Taking longer than usual to get responses

Solutions:

Switch to a faster model (Haiku)
Check your internet connection
Simplify your query
Try during off-peak hours

Quality issues

Symptoms: Responses are not detailed enough or incorrect

Solutions:

Try Sonnet - Upgrade from Haiku for complex tasks
Be more specific - Provide more context in your query
Break it down - Split complex questions into steps
Provide examples - Show what you’re looking for

Next Steps

AI Chat OverviewLearn all the ways to interact with your project data Generate ArtifactsCreate documents and visualizations with the best model

Was this page helpful?