AI Models
DatumOS provides access to multiple AI models through Vercel AI Gateway, allowing you to choose the best model for each task based on capability, cost, and performance needs.
Available Models
Claude Haiku 4.5 (Default)
Provider: Anthropic
Model ID: claude-haiku-4-5-20251001
Fast and cost-effective model that’s perfect for most everyday tasks.
Key Capabilities:
- ✅ Streaming responses
- ✅ Tool calling (search, artifacts)
- ✅ Image input
- ✅ Extended reasoning
- 200,000 token context window (~150,000 words)
Best For:
- General chat and questions
- Document search and retrieval
- Simple analysis tasks
- Quick summaries
- Most day-to-day queries
Pricing:
- Input: $0.80 per million tokens
- Output: $4.00 per million tokens
Claude Haiku 4.5 is the default model because it offers the best balance of speed, capability, and cost for typical construction project queries.
Claude Sonnet 4.5
Provider: Anthropic
Model ID: claude-sonnet-4-5-20250929
Most capable model with advanced reasoning, ideal for complex tasks.
Key Capabilities:
- ✅ Streaming responses
- ✅ Tool calling (search, artifacts)
- ✅ Image input
- ✅ Extended reasoning (advanced)
- 200,000 token context window (~150,000 words)
Best For:
- Complex analysis and reasoning
- Multi-step problem solving
- Detailed technical writing
- Code generation and debugging
- Advanced image analysis
- In-depth research tasks
Pricing:
- Input: $3.00 per million tokens
- Output: $15.00 per million tokens
Switch to Sonnet 4.5 when Haiku’s responses aren’t detailed enough or when you need deeper reasoning.
Grok Vision
Provider: XAI
Model ID: grok-2-vision-1212
Advanced multimodal model with strong vision capabilities.
Key Capabilities:
- ✅ Streaming responses
- ✅ Tool calling (search, artifacts)
- ✅ Image input (advanced)
- ❌ Extended reasoning (not available)
- 131,072 token context window (~100,000 words)
Best For:
- Image analysis and understanding
- Drawing and diagram interpretation
- Photo documentation review
- Visual inspection tasks
- OCR and text extraction from images
Grok Vision excels at understanding visual content like construction drawings, site photos, and technical diagrams.
Switching Models
During a Conversation
Change models mid-conversation to match your needs:
- Click the Model dropdown in the chat header
- Select your desired model
- Continue the conversation with the new model
The new model will have access to the full conversation history.
Setting a Default Model
Choose your preferred default model:
- Go to Settings > AI Models
- Select your default model
- Click Save
New conversations will start with this model.
Per-Message Model Selection
For fine-grained control:
- Click the model selector before sending each message
- Choose the appropriate model
- Send your message
This lets you optimize cost and capability message by message.
Use Haiku for simple queries, upgrade to Sonnet only when needed. This can reduce costs by 3-4x while maintaining quality.
Model Selection Guide
Decision Tree
Is the task primarily visual (images, drawings)?
├─ YES → Grok Vision
└─ NO → Continue
Does it require complex reasoning or analysis?
├─ YES → Claude Sonnet 4.5
└─ NO → Claude Haiku 4.5 (default)Task-Specific Recommendations
| Task | Recommended Model | Why |
|---|---|---|
| Document Search | Claude Haiku 4.5 | Fast, cost-effective, excellent tool use |
| Simple Questions | Claude Haiku 4.5 | Quick responses, adequate reasoning |
| Complex Analysis | Claude Sonnet 4.5 | Advanced reasoning, deeper insights |
| Image Analysis | Grok Vision | Strong vision capabilities |
| Code Generation | Claude Sonnet 4.5 | Better understanding of complex logic |
| Report Writing | Claude Sonnet 4.5 | More sophisticated language, better structure |
| Quick Summaries | Claude Haiku 4.5 | Fast, cost-effective, sufficient quality |
| Multi-Step Research | Claude Sonnet 4.5 | Better reasoning across multiple sources |
Cost Considerations
Understanding Token Usage
Tokens are pieces of text, roughly:
- 1 token ≈ 4 characters
- 1 token ≈ 0.75 words
- 100 tokens ≈ 75 words
Example message:
Find all RFIs related to structural steel in the Civic Center project- Input: ~15 tokens
- Typical response: ~200-500 tokens
- Total cost: under $0.01 with Haiku
Cost Comparison
Typical query (500 input tokens, 1,000 output tokens):
| Model | Input Cost | Output Cost | Total Cost |
|---|---|---|---|
| Claude Haiku 4.5 | $0.0004 | $0.004 | $0.0044 |
| Claude Sonnet 4.5 | $0.0015 | $0.015 | $0.0165 |
Sonnet costs ~3.75x more per query.
Daily usage example (100 queries/day):
| Model | Daily Cost | Monthly Cost |
|---|---|---|
| Claude Haiku 4.5 | $0.44 | ~$13 |
| Claude Sonnet 4.5 | $1.65 | ~$50 |
| Mixed (80% Haiku, 20% Sonnet) | $0.68 | ~$20 |
Use Haiku as your default, upgrade to Sonnet for complex tasks. This “mixed mode” gives you 90% of Sonnet’s value at 40% of the cost.
Managing Costs
Best Practices:
- Start with Haiku - Use it for initial exploration
- Upgrade strategically - Switch to Sonnet for complex tasks
- Monitor usage - Check Settings > Usage to track spending
- Set limits - Configure spending alerts in Settings
- Use shorter prompts - Be concise to reduce token usage
- Avoid redundancy - Don’t repeat information unnecessarily
Performance Characteristics
Response Times
Time to first token (95th percentile):
| Model | Simple Chat | With Tools | With Images |
|---|---|---|---|
| Claude Haiku 4.5 | ~2s | ~5s | ~3s |
| Claude Sonnet 4.5 | ~3s | ~6s | ~4s |
| Grok Vision | ~2.5s | ~5.5s | ~4s |
Throughput
Tokens per second (streaming):
| Model | Average Speed |
|---|---|
| Claude Haiku 4.5 | ~40 tokens/sec |
| Claude Sonnet 4.5 | ~30 tokens/sec |
| Grok Vision | ~35 tokens/sec |
All models stream responses in real-time, so you see output as it’s generated rather than waiting for the complete response.
Context Windows
What is a Context Window?
The context window is the maximum amount of text (input + output) a model can process in one conversation.
Context Window Sizes:
| Model | Context Window | Approximate Words |
|---|---|---|
| Claude Haiku 4.5 | 200,000 tokens | ~150,000 words |
| Claude Sonnet 4.5 | 200,000 tokens | ~150,000 words |
| Grok Vision | 131,072 tokens | ~100,000 words |
What Fits in a Context Window?
Claude models (200K tokens):
- ~500 pages of text
- ~20 full project specifications
- ~100 typical chat messages
- Entire novels with room for analysis
Grok Vision (131K tokens):
- ~325 pages of text
- ~13 full project specifications
- ~65 typical chat messages
Managing Long Conversations
When approaching context limits:
- Start fresh - Begin a new chat for unrelated topics
- Be concise - Avoid repeating information unnecessarily
- Summarize - Ask the AI to summarize long conversations
When a conversation exceeds the context window, older messages are automatically dropped. Important context may be lost.
Model Capabilities Comparison
| Capability | Haiku 4.5 | Sonnet 4.5 | Grok Vision |
|---|---|---|---|
| Streaming | ✅ Yes | ✅ Yes | ✅ Yes |
| Tool calling | ✅ Excellent | ✅ Excellent | ✅ Good |
| Image input | ✅ Good | ✅ Excellent | ✅ Excellent |
| Extended reasoning | ✅ Good | ✅ Excellent | ❌ No |
| Code generation | ✅ Good | ✅ Excellent | ✅ Good |
| Writing quality | ✅ Good | ✅ Excellent | ✅ Good |
| Speed | ✅ Fastest | ⚡ Fast | ✅ Fast |
| Cost | 💰 Lowest | 💰💰 Higher | 💰 Medium |
| Context window | 200K | 200K | 131K |
Troubleshooting
Model not available
Symptoms: Model appears grayed out or shows error
Solutions:
- Check your subscription plan includes the model
- Verify account is in good standing
- Try refreshing the page
- Check status page
Slow responses
Symptoms: Taking longer than usual to get responses
Solutions:
- Switch to a faster model (Haiku)
- Check your internet connection
- Simplify your query
- Try during off-peak hours
Quality issues
Symptoms: Responses are not detailed enough or incorrect
Solutions:
- Try Sonnet - Upgrade from Haiku for complex tasks
- Be more specific - Provide more context in your query
- Break it down - Split complex questions into steps
- Provide examples - Show what you’re looking for
Next Steps
Was this page helpful?