Top AI Models & Inference APIs on API.market

Browse our collection of high-quality AI Models & Inference APIs

Sort by:
Large language models
5.0(2)

πŸš€ Unlock cutting-edge AI with models from OpenAI, Claude, Google Gemini, and Meta Llama. Build transformative experiences with the best in AI! ✨

88β€’25.1K
$1/month
Large language models
4.5(2)

GPT-5 Nano is OpenAI fastest, cheapest version of GPT-5. It's great for summarization and classification tasks

67β€’605
10K FREE API units
Large language models
4.5(2)

GPT-5.2 delivers exceptional coding and agentic task automation across industries with superior performance.

18β€’357
10K FREE API units
Large language models
5.0(1)

High-Speed, Low-Cost AI API with Extensive Language Model Support for Apps

30β€’808
100K FREE API units
Large language models
5.0(1)

GPT-4o (GPT-4 Omni) is the most advanced multimodal model (accepting text or image inputs and outputting text)

21β€’83.4K
$1/month
Large language models
1.0(1)

High Availability and Unlimited Calls for GPT 3.5 Turbo. We provide users with high-quality services

46β€’729
10K FREE API units
Multi-model routing
0.0(0)

OpenAI-compatible API with 60+ models: Claude Opus/Sonnet, GPT-5, Gemini. Streaming, Vision, Tool Use. Multi-provider failover.

26β€’3.2K
$1/month
Multi-model routing
0.0(0)

Unify is your centralized platform for LLM endpoints.

9β€’22
100 FREE API units
Multi-model routing
0.0(0)

πŸš€ Unlock cutting-edge AI with models from OpenAI, Claude, Google Gemini, and Meta Llama. Build transformative experiences with the best in AI! ✨

33β€’336
10K FREE API units
Large language models
0.0(0)

Access GPT-5, GPT-4.1, GPT-4o models directly with high availability and enjoy a 50% discount!

14β€’97
10K FREE API units
Multi-model routing
0.0(0)

Experience low-latency chat completions with GLM-4, Qwen-Turbo, and DeepSeek at unmatched performance and cost savings.

3β€’38
1K FREE API units
Large language models
0.0(0)

Rapidly improve AI output accuracy by leveraging the most complete LLM Hallucination Taxonomy, Benchmark Scores, and Detection Methods.

1β€’7
$29/month
Large language models
0.0(0)

GPT-4.1 nano excels at instruction following and tool calling. It features a 1M token context window, and low latency without a reasoning step.

14β€’273
100K FREE API units
Large language models
0.0(0)

GPT-4.1 excels at instruction following and tool calling, with broad knowledge across domains. It features a 1M token context window, and low latency without a

13β€’33
1K FREE API units
Large language models
0.0(0)

GPT-5 is OpenAI flagship model for coding, reasoning, and agentic tasks across domains

44β€’88
1K FREE API units
Large language models
0.0(0)

GPT-5 mini is a faster, more cost-efficient version of GPT-5. It's great for well-defined tasks and precise prompts.

15β€’18
1K FREE API units
Large language models
0.0(0)

GPT-4.1 mini excels at instruction following and tool calling. It features a 1M token context window, and low latency without a reasoning step.

5β€’20
1K FREE API units
Large language models
0.0(0)

Generate high-quality images from text or image references quickly with impressive performance and flat pricing.

2β€’24
$0.039 per API unit
Large language models
0.0(0)

⚑50% Discount | Direct and highly available API for all latest Gemini models: Gemini 3 Series, Gemini 2.5 Series, and more

28β€’5.2K
10K FREE API units
Large language models
0.0(0)

Generate photorealistic images from text or reference images using Google's Gemini 3 Pro Image Preview.

3β€’45
$0.15 per API unit
Large language models
0.0(0)

Google's fastest Gemini model. Ultra-low latency, multimodal, China direct connection.

5β€’12
10K FREE API units
Large language models
0.0(0)

Iteratively create and edit images with text, boasting impressive speed and multi-turn editing.

2β€’24
$0.067 per API unit
Large language models
0.0(0)

ByteDance's ultra-low cost, high-concurrency model. Multimodal, China direct connection.

2β€’3
10K FREE API units
Large language models
0.0(0)

Access the latest DeepSeek V4 Flash model. Fast, affordable, China direct connection, OpenAI compatible.

5β€’80
100 FREE API units
Large language models
0.0(0)

Get seamless access to all Claude models (Claude-4, Claude-3.5 and more) with our high-performance, cost-effective API.

45β€’704
10K FREE API units