AI APIs - LLMs, Inference & Model Routing

0 results found

Sort by:

No results found. Try a different search term.

Homepage/Category (AI Models & Inference)

Top AI Models & Inference APIs on API.market

Browse our collection of high-quality AI Models & Inference APIs

Sort by:

Large language models

5.0(2)

🗂️OpenAI SDK

🚀 Unlock cutting-edge AI with models from OpenAI, Claude, Google Gemini, and Meta Llama. Build transformative experiences with the best in AI! ✨

By swift-api

96•172.2K

$1/month

Large language models

4.5(2)

🗂️GPT-5 nano

GPT-5 Nano is OpenAI fastest, cheapest version of GPT-5. It's great for summarization and classification tasks

By swift-api

76•666

10K FREE API units

Large language models

4.5(2)

🗂️GPT-5.2

GPT-5.2 delivers exceptional coding and agentic task automation across industries with superior performance.

By swift-api

26•552

10K FREE API units

Large language models

5.0(1)

🗂️⚡ Claude AI | All Models

Get seamless access to all Claude models (Claude-4, Claude-3.5 and more) with our high-performance, cost-effective API.

By flash-ai

89•1.6K

10K FREE API units

Large language models

5.0(1)

🗂️🚀🗂️🚀 BridgeML LLM API: 15+ Model Options for High-Speed, Affordable AI Integration

High-Speed, Low-Cost AI API with Extensive Language Model Support for Apps

By bridgeml

37•808

100K FREE API units

Large language models

5.0(1)

🗂️✨ GPT-4o

GPT-4o (GPT-4 Omni) is the most advanced multimodal model (accepting text or image inputs and outputting text)

By swift-api

24•83.4K

$1/month

Large language models

1.0(1)

🗂️✨ Chat GPT 3.5 Turbo

High Availability and Unlimited Calls for GPT 3.5 Turbo. We provide users with high-quality services

By swift-api

48•756

10K FREE API units

Large language models

0.0(0)

🗂️Only $10 for Unlimted AI Models!!!

Say Goodbye to Token Anxiety! Unlock the World's Best AI Models for the Cost of a Single Meal.

By draco-ai

2•19

50 FREE API units

Large language models

0.0(0)

🗂️⚡Gemini AI | All Models

⚡50% Discount | Direct and highly available API for all latest Gemini models: Gemini 3 Series, Gemini 2.5 Series, and more

By flash-ai

36•6.2K

10K FREE API units

Multi-model routing

0.0(0)

🗂️Vertex Key AI API

OpenAI-compatible API with 60+ models: Claude Opus/Sonnet, GPT-5, Gemini. Streaming, Vision, Tool Use. Multi-provider failover.

By vertex-key

29•3.3K

$1/month

Multi-model routing

0.0(0)

🗂️Unify.ai

Unify is your centralized platform for LLM endpoints.

By unify

11•25

100 FREE API units

Multi-model routing

0.0(0)

🗂️✨ Swift AI

🚀 Unlock cutting-edge AI with models from OpenAI, Claude, Google Gemini, and Meta Llama. Build transformative experiences with the best in AI! ✨

By swift-api

37•51.3K

10K FREE API units

Large language models

0.0(0)

🗂️⚡OpenAI | All Models

Access GPT-5, GPT-4.1, GPT-4o models directly with high availability and enjoy a 50% discount!

By flash-ai

19•395

10K FREE API units

Multi-model routing

0.0(0)

🗂️💬 LvyAI Chat API: OpenAI-Compatible, 70% Cheaper Models

Experience low-latency chat completions with GLM-4, Qwen-Turbo, and DeepSeek at unmatched performance and cost savings.

By lvyapi-1

6•47

1K FREE API units

Large language models

0.0(0)

🗂️🛡️ Hallucination Guard API: Detect & Prevent LLM Hallucinations via Data

Rapidly improve AI output accuracy by leveraging the most complete LLM Hallucination Taxonomy, Benchmark Scores, and Detection Methods.

By nicheapi-llc-1

1•7

$29/month

Large language models

0.0(0)

🗂️GPT-4.1

GPT-4.1 excels at instruction following and tool calling, with broad knowledge across domains. It features a 1M token context window, and low latency without a

By swift-api

13•33

1K FREE API units

Large language models

0.0(0)

🗂️GPT 4.1-Nano

GPT-4.1 nano excels at instruction following and tool calling. It features a 1M token context window, and low latency without a reasoning step.

By swift-api

14•273

100K FREE API units

Large language models

0.0(0)

🗂️GPT-5

GPT-5 is OpenAI flagship model for coding, reasoning, and agentic tasks across domains

By swift-api

47•121

1K FREE API units

Large language models

0.0(0)

🗂️GPT-5 mini

GPT-5 mini is a faster, more cost-efficient version of GPT-5. It's great for well-defined tasks and precise prompts.

By swift-api

15•18

1K FREE API units

Large language models

0.0(0)

🗂️GPT-4.1 mini

GPT-4.1 mini excels at instruction following and tool calling. It features a 1M token context window, and low latency without a reasoning step.

By swift-api

5•21

1K FREE API units

Large language models

0.0(0)

🗂️🖼️ Generate Images with nano banana flash (Gemini 2.5 Flash Image) Model

Generate high-quality images from text or image references quickly with impressive performance and flat pricing.

By google

2•24

$0.039 per image

Large language models

0.0(0)

🗂️📸 (gemini-3-pro-image-preview) Generate & Edit Images using Nano Banana Pro

Generate photorealistic images from text or reference images using Google's Gemini 3 Pro Image Preview.

By google

3•47

$0.15 per API unit

Large language models

0.0(0)

🗂️Gemini 3.1 Flash Lite API Relay

Google's fastest Gemini model. Ultra-low latency, multimodal, China direct connection.

By flashrelay

5•12

10K FREE API units

Large language models

0.0(0)

🗂️🎨 (gemini-3.1-flash-image-preview) Generate Images & Edit Text Fast with Nano Banana 2

Iteratively create and edit images with text, boasting impressive speed and multi-turn editing.

By google

2•26

$0.067 per API unit

Large language models

0.0(0)

🗂️Doubao Seed 2.0 Mini API Relay

ByteDance's ultra-low cost, high-concurrency model. Multimodal, China direct connection.

By flashrelay

4•7

10K FREE API units

Large language models

0.0(0)

🗂️DeepSeek V4 Flash High-Speed API Relay

Access the latest DeepSeek V4 Flash model. Fast, affordable, China direct connection, OpenAI compatible.

By flashrelay

8•4.6K

100 FREE API units