AI Model Directory

Structured facts, compatibility notes, source freshness, and programmatic SEO fields.

Claude Haiku 3.5

Anthropic · Updated 2026-05-19

Anthropic Claude model entry for fast, efficient, and targeted API workloads.

low-latency generationcost-sensitive generationagent workflowtargeted classification

Claude Opus 4.7

Anthropic · Updated 2026-05-21

Anthropic flagship Claude Opus 4.7 entry for advanced reasoning, adaptive thinking, agent teams, and complex planning.

reasoningcode reviewagent planninglong-form reasoning

Claude Sonnet 4.6

Anthropic · Updated 2026-05-21

Anthropic Claude Sonnet 4.6 model for balanced intelligence, coding, writing, and complex task orchestration with 1M-token context.

code reviewlong-form reasoningagent planning

DeepSeek V4 (Pro-Max / Flash-Max)

DeepSeek · Updated 2026-05-21

DeepSeek V4-series models including V4-Pro-Max and V4-Flash-Max for coding, reasoning, and agent workflows with competitive pricing.

codingagent workflowcost-sensitive generation

DeepSeek-V3.2

DeepSeek · Updated 2026-05-21

DeepSeek-V3.2 reasoning model for math, coding, and tool-use workflows with cost-efficient pricing and OpenAI-compatible API.

codingagent workflowcost-sensitive generationOpenAI-compatible integration

Devstral 2

Mistral AI · Updated 2026-05-19

Mistral coding-agent model entry for software engineering tasks, coding workflows, and agentic development evaluation.

codingagent workflowcode reviewsoftware engineering tasks

Gemini 2.5 Flash

Google · Updated 2026-05-19

Google Gemini model entry for low-latency, high-volume, multimodal, and agentic workloads.

low-latency generationmultimodal workflowagent workflowGoogle ecosystem

Gemini 3.1 Pro

Google · Updated 2026-05-21

Google Gemini 3.1 Pro model for multimodal reasoning, breakthrough pattern recognition, Google ecosystem workflows, and search-grounded tasks with 1M-token context.

multimodal workflowGoogle ecosystemsearch-grounded answersreasoning

Gemini 3.5 Flash

Google · Updated 2026-05-21

Google Gemini 3.5 Flash lightweight model for low-latency generation, multimodal tasks, and cost-efficient Google ecosystem workflows.

low-latency generationmultimodal workflowcost-sensitive generationGoogle ecosystem

Gemini Pro

Google · Updated 2026-05-17

Model entry for multimodal work, search grounding, and Google ecosystem integration, used for scenario-based recommendations.

multimodal workflowGoogle ecosystemsearch-grounded answers

GPT-5.5

OpenAI · Updated 2026-05-21

OpenAI flagship GPT-5.5 model for coding, agentic tasks, reasoning workflows, and Responses API integrations with 400K-token context.

codingagent workflowreasoningtool-use

GPT-5.5 Codex

OpenAI · Updated 2026-05-21

OpenAI coding-specialized GPT-5.5 Codex model for long-horizon agentic coding tasks, repository edits, and test generation.

codingagent workflowagentic codingcode review

gpt-oss-120b

OpenAI · Updated 2026-05-19

OpenAI open-weight model entry for high-reasoning, agentic, and self-hosted deployment evaluation.

open-weight deploymentagent workflowreasoninglocal inference

gpt-oss-20b

OpenAI · Updated 2026-05-19

OpenAI open-weight model entry for lower-latency local, specialized, and cost-aware deployment paths.

open-weight deploymentlocal inferencecost-sensitive generationlow-latency generation

Grok 4.3

xAI · Updated 2026-05-18

xAI flagship model entry for agentic tool calling, instruction following, reasoning, and OpenAI-compatible Responses API integration.

agent workflowtool-usereasoningOpenAI-compatible integration

Llama 4 Maverick

Meta · Updated 2026-05-18

Meta Llama 4 Maverick model entry for multimodal open-weight workflows, multilingual text, code generation, and local or hosted deployment evaluation.

multimodal workflowdocument workflowcodingcost-sensitive generation

Mistral Large 3

Mistral AI · Updated 2026-05-21

Mistral Large 3 flagship model for multilingual tasks, general reasoning, and open-weight or API deployment with 256K-token context.

codingagent workflowcost-sensitive generationmultilingualOpenAI-compatible integration

Mistral Medium 3.5

Mistral AI · Updated 2026-05-17

Mistral model entry for coding, agentic workflows, and multimodal use cases.

codingagent workflowmultimodal workflowdocument workflow

OpenAI o3 / o4-mini

OpenAI · Updated 2026-05-21

OpenAI reasoning-optimized o-series models (o3 and o4-mini) for complex multi-step reasoning, math, science, and coding tasks.

reasoningcodingagent workflowtool-use

Qwen3.6

Alibaba Cloud · Updated 2026-05-21

Open-weight Qwen3.6 model family for hybrid thinking, multilingual tasks, coding, and OpenAI-compatible self-hosted serving with 128K context.

reasoningcodingcost-sensitive generationOpenAI-compatible integration