The model usage report is a periodic summary that breaks down your API spend by model and provider. It shows which models are costing the most, how many requests each handles, token counts, and cache hit rates. This helps you identify optimization opportunities and track spending patterns across your model portfolio.Documentation Index
Fetch the complete documentation index at: https://concentrate.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
How it works
Each evaluation cycle:- Queries yesterday’s billing data from
billing_dailyfor the target entity - Groups by model and provider — aggregates total credits, requests, input tokens, output tokens, and cached tokens per model
- Sorts by cost descending — the most expensive models appear first
- Calculates total spend across all models
- Sends the report if there was any usage (at least one request with non-zero spend)
Configuration
You can configure the model usage report from the Alerts page in your dashboard.| Setting | Description | Options |
|---|---|---|
| Enabled | Toggle the report on or off | true / false |
| Scope | What entities to report on | organization, user |
Scope
- Organization — reports on model usage across your entire organization.
- User — reports on model usage across your personal API keys.
Report contents
The report includes a per-model breakdown with:| Field | Description |
|---|---|
| Provider | The AI provider (e.g., OpenAI, Anthropic) |
| Model | The specific model used (e.g., gpt-4, claude-sonnet-4-20250514) |
| Cost | Total spend for this model in dollars |
| Requests | Number of API requests |
| Input Tokens | Total input tokens consumed |
| Output Tokens | Total output tokens generated |
| Cache Rate | Percentage of tokens served from cache |
Cooldown
The model usage report has a 7-day cooldown (604,800 seconds). After a report is sent, no duplicate report is sent for the same entity for 7 days. This makes it effectively a weekly report.Notifications
- Email — includes a styled table with the full model breakdown, total spend, and a link to your usage details page
- SMS — a shorter summary listing total spend and the top 3 models by cost