Your OpenAI Bill Doubled. Here's Which Feature Caused It.
Why Provider Dashboards Can't Help You
When your OpenAI bill doubles, a provider dashboard can confirm the total went up. It usually cannot tell you that chat-completion caused 64% of the spend while summarizer caused 22% and search-assist caused 14%. By the time the provider dashboard updates, the runaway feature may have been expensive for a full day.
What Feature-Level Tracking Looks Like
Feature-level tracking attaches a stable name to every request: chat-completion, summarizer, search-assist, or whatever your product actually ships. The result is a cost breakdown you can act on immediately.
Interactive feature cost demo
The Integration
Add the tracking call after your LLM response returns. It is async, fire-and-forget, and never in the user's request path.
// fire-and-forget — never blocks your users
fetch('https://llm-track.com/api/ingest', {
method: 'POST',
headers: { 'x-api-key': process.env.LLMTRACK_KEY },
body: JSON.stringify({
provider: 'openai',
model: response.model,
feature_name: 'chat-completion',
total_tokens: response.usage.total_tokens,
latency_ms: Date.now() - startedAt,
status: 'success'
})
}).catch(() => {})What You Can Do With This Data
Once every request has a feature name, you can sort features by spend, catch regressions before the invoice arrives, compare cost per user, and decide where budget alerts should fire.
Start with one production endpoint and watch costs appear in under a second.
The Model Switching Opportunity
The feature that caused the spike might not need the expensive model. Structured output and summarization often move safely to GPT-4o Mini or Gemini Flash after testing a real sample.
FAQ
No. The call runs after your LLM response and should be fire-and-forget with a safe catch handler.
No. Track metadata only: tokens, computed cost, model, feature name, latency, and status.
The provider dashboard is delayed and aggregate. It is useful for billing, but not for immediate feature-level operations.
See your feature costs in under 5 minutes
One async call. No proxy. Know which feature caused the bill before the invoice lands.
Start free →