# Platform Overview

*Infron’s Inference Providers Routing Platform give developers access to thousands of ai models, powered by world-class inference providers.*&#x20;

*Infron's APIs are also integrated into the OpenAI-SDKs, Claude-SDKs and Google-SDKs, making it easy to explore serverless inference of models on your favorite providers.*

*Infron helps developers source and optimize AI usage. We believe the future is multi-model and multi-provider.*

<figure><img src="/files/QDtMwHzDpzf8n6qgcMwL" alt=""><figcaption></figcaption></figure>

### Who Uses Infron?

#### Startups & Enterprises <a href="#startups-and-enterprises" id="startups-and-enterprises"></a>

* **AI Startups** building and scaling models without infrastructure overhead
* **Enterprise Teams** running production workloads with reliability requirements
* **ML Engineers** needing flexible compute for training and experimentation

#### Researchers & Academia <a href="#researchers-and-academia" id="researchers-and-academia"></a>

* **Research Groups** pushing state-of-the-art with budget constraints
* **PhD Students & Professors** accessing cutting-edge ai models for research
* **University Students** completing coursework and projects

#### Developers & Hobbyists <a href="#developers-and-hobbyists" id="developers-and-hobbyists"></a>

* **Solo Developers** prototyping and launching AI applications
* **Hobbyists** experimenting with the latest models
* **Open Source Contributors** testing community projects

### Core principles and values of Infron

* **Price and Performance**.&#x20;

Infron scouts for the best prices, the lowest latencies, and the highest throughput across dozens of providers, and lets you choose how to [prioritize](broken://pages/ZXxfwYY21Yz9PCv9X7ja) them.

* **Standardized API**.&#x20;

No need to change code when switching between models or providers. One magic api access to thousand of open source models, commercial models and search agents.

* **Real-World Insights**.&#x20;

Be the first to take advantage of new models.&#x20;

Infron will continue to add more ai models.

* **Consolidated Billing**.&#x20;

Simple and transparent billing, regardless of how many providers you use.

* **Higher Availability**.&#x20;

Fallback providers, and automatic, smart routing means your requests still work even when providers go down.

* **Higher Rate Limits**.&#x20;

Infron works directly with providers to provide better rate limits and more throughput.

### Why Chooses Infron? <a href="#why-openrouter" id="why-openrouter"></a>

When you build AI applications, it's tough to manage multiple provider APIs, comparing model performance, and dealing with varying reliability. Infron solves these challenges by offering:

* **Instant Access to Cutting-Edge Models**: Go beyond mainstream providers to access thousands of specialized models across multiple AI tasks. Whether you need the latest language models, state-of-the-art image generators, or domain-specific embeddings, you'll find them here.
* **Zero Vendor Lock-in**: Unlike being tied to a single provider's model catalog, you get access to models from Cerebras, Groq, Together AI, Replicate, and more — all through one consistent interface.
* **Production-Ready Performance**: Built for enterprise workloads with the reliability your applications demand.

Here's what you can build:

* **Text Generation**: Use Large language models with tool-calling capabilities for chatbots, content generation, and code assistance
* **Image and Video Generation**: Create custom images and videos, including support for LoRAs and style customization
* **Search & Retrieval**: State-of-the-art embeddings for semantic search, RAG systems, and recommendation engines

### Key Features

* **🎯 All-in-One API**: A single API for text generation, image generation, document embeddings, search, deep seach, summarization, image classification, and more.
* **🔀 Multi-Provider Support**: Easily run models from top-tier providers like Cerebrase, Replicate, Sambanova, Together AI, and others.
* **🚀 Scalable & Reliable**: Built for high availability and low-latency performance in production environments.
* **🔧 Developer-Friendly**: Simple requests, fast responses, and a consistent developer experience across OpenAI, Claude clients.
* **👷 Easy to integrate**: Drop-in replacement for the OpenAI chat completions API.
* **💰 Cost-Effective**: No extra markup on provider rates.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://infronai.gitbook.io/docs/overview/introduction.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
