Request body
Quota check result
Create a quota
Create a new usage quota with dimension filters.
Quotas allow you to limit usage based on metrics like total tokens, cost, or event count. You can apply quotas to specific customers, models, event types, or any combination using dimension filters.
Request body
Quota created
Create a quota alert
Configure an alert for when a quota reaches a threshold. Alerts are sent to all active project webhooks.
No description provided
Request body
Alert created successfully
Get a quota
Get a specific quota by ID.
Unique identifier for the quota
Quota found
List quotas
List all quotas for the project.
Optionallimit: stringMaximum number of quotas to return (1-100, default: 50) (default: "50")
Optionalcursor: stringPagination cursor from the previous response
List of quotas
List quota alerts
Get all alerts configured for a quota
No description provided
Optionallimit: stringMaximum number of alerts to return (1-100, default: 50) (default: "50")
Optionalcursor: stringPagination cursor from the previous response
List of alerts
Update a quota
Update an existing quota.
Unique identifier for the quota
Request body
OptionaldimensionFilters?: { [key: string]: unknown }OptionallimitValue?: numberOptionalmetric?: "total_tokens" | "total_events" | "total_cost_cents"Optionalname?: stringOptionalperiod?: "hour" | "day" | "week" | "month"Quota updated
Check quotas
Check usage against all matching quotas for the given dimensions without consuming quota.
This endpoint allows you to query quota status without actually recording usage. Useful for:
Dimension Matching:* Quotas are matched based on dimension filters. Pass dimensions like:
customer_id: Filter by customerevent_type: Filter by event typemodel: Filter by modelprovider: Filter by provider