cella's Inference Stats panel (M) tracks token usage and estimates cost for recognized AI models.
cella recognizes models from the following providers:
cella maintains a knownPricing map in the source code with per-model input/output token pricing (USD per 1M tokens). When an API response is intercepted, cella:
cella uses case-insensitive substring matching. If the intercepted model string contains a known key, the corresponding pricing is applied.
For example, github-copilot/gpt-4o matches the gpt-4o pricing entry.
If a model isn't recognized, cella still tracks token counts and request rates — it just reports $0.00 for cost. You can identify these in the Inference Stats panel and add pricing to the source code.
To update or add model pricing, edit the knownPricing map in internal/proxy/inference_stats.go:
var knownPricing = map[string]ModelPricing{
"gpt-4o": {InputPer1M: 2.50, OutputPer1M: 10.00},
"claude-sonnet-4-5": {InputPer1M: 3.00, OutputPer1M: 15.00},
// Add new models here...
}
Then rebuild: go build -o cella ./cmd/main.go