# Overview

Infron provides built‑in Usage Accounting that allows you to monitor AI model usage and cost breakdowns directly from your API responses. This feature includes detailed insights into token consumption, associated costs, and caching behavior.

**Benefits**&#x20;

* Efficiency: Retrieve usage information without additional API calls&#x20;
* Accuracy: Token counts are computed using each model’s native tokenizer&#x20;
* Transparency: Track real-time cost and cached token utilization&#x20;
* Detailed Breakdown: Separate reporting for prompt, completion, reasoning, and cached tokens

**Usage Information**&#x20;

When enabled, the API returns comprehensive usage metrics, including:

* Prompt and completion token counts calculated with the model’s native tokenizer&#x20;
* Total cost in credits
* Reasoning token counts (when supported by the model)&#x20;
* Cached token counts (when applicable)

This usage information appears in the final SSE message for streaming responses, or in the full response body for non‑streaming requests.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://infronai.gitbook.io/docs/billing-apis/usage-and-cost/overview.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
