Metrics & Usage Dashboard
The Usage dashboard provides real-time visibility into your API consumption, performance metrics, and spending.
Overview
To access the dashboard, navigate to the Usage tab in your main dashboard.
Time Range Selector
At the top right, you can filter all charts by a specific time period:
- 24H: Last 24 hours (Day)
- 7D: Last 7 days (Week)
- 30D: Last 30 days (Month)
Available Charts
1. Total Spend
This chart displays your cumulative cost over the selected period.
- Purpose: Track budget burn rate and identify cost spikes.
- Data: Aggregated cost in USD across all models.
2. Requests & Tokens
You can toggle this chart between two views to analyze volume:
- Requests: The total number of API calls made.
- Tokens: The total number of input (prompt) and output (completion) tokens processed.
- Model Filtering: Use the dropdown to filter these metrics by a specific model (e.g., "Only show Mistral Large usage").
3. Latency Metrics
Monitor the speed and responsiveness of your requests using two key metrics:
- TTFT (Time to First Token): Measures how quickly the model starts generating a response. This is critical for perceived latency in chat applications.
- Total Time: Measures the total duration of the request from start to finish.
Filtering by Model
The Model Dropdown allows you to isolate metrics for individual models.
- By default, charts show aggregated data for All Models.
- Select a specific model to see its individual performance.
Best Practices
- Monitor TTFT: If you are building a real-time chat app, keep an eye on TTFT spikes.
- Analyze Token Ratios: Check the Requests vs. Tokens view to understand if your average request size is increasing (which affects cost).
- Cost Management: Use the Total Spend chart in conjunction with Billing Settings to manage your credits effectively.