ModelRadar — AI model releases & announcements

This week2 models

2026-W26

Sakana AI's higher-performance Fugu model — not a monolithic LLM but a learned multi-agent orchestration system that routes requests across multiple models and tools. Multimodal (text+image input), 1M context, $5/$30 per MTok on OpenRouter.

→

Jun 24

LFM2.5-230MLiquidAI🌐 ?

textopen

New on HuggingFace: model by LiquidAI with 13 likes.

→

Last week5 models

2026-W25

Jun 20

◆ Underrated

GLM-5.2Z.AI (Zhipu)🇨🇳 CN

textreasoningopen1M ctx753B MoE

Brand-new flagship from Zhipu/Z.AI — released on June 20, 2026. The GLM-5 series ships monthly (Apr→May→Jun). Extremely cheap on OpenRouter ($0.0012/$0.0041 per MTok). 1M-token context.

→

Jun 18

Google: Nano Banana Pro (Gemini 3 Pro Image)google🌐 ?

text

Google's Gemini-3-Pro-Image model ('Nano Banana Pro') — available on OpenRouter since June 18, 2026, $2.00/MTok (input).

→

Jun 17

◆ Underrated

FastContext 1.0 (4B)Microsoft Research🇺🇸 US

codeopen4B

Microsoft's specialized code-search model for coding agents — powers the 'Explore' subagent in SWE-FastContext. Not a general-purpose LLM, but relevant for agentic setups.

→

Jun 17

Gemini 3.1 Flash ImageGoogle DeepMind🇺🇸 US

multimodalimage131K ctx

Google's latest Gemini Flash generation with native image input/output. Available on OpenRouter ($0.50/$3 per MTok). Gemini 3.x is a standalone series alongside Gemini 2.5.

→

Jun 15

◆ Underrated

North Mini Code 1.0Cohere🇨🇦 CA

codeopen262K ctx30B

Cohere's code specialist — 30B, open weights, Apache 2.0, 256K context. Free on OpenRouter. Surprisingly strong for its size; Cohere's first genuinely strong open-weights coding model.

→

Jun 8, 2026 – Jun 142 models

2026-W24

Jun 12

Kimi K2.7 CodeMoonshot AI🇨🇳 CN

reasoningcodeopen262K ctx1T MoE (32B active, 384 Experten)

Coding-focused offshoot of the K2 series — ~30% fewer thinking tokens than K2.6. Competitive with GPT-5.5 and Claude Opus 4.8 on coding benchmarks. 256K context, MLA attention, 1T-parameter MoE.

→

Jun 9

Claude Fable 5Anthropic🇺🇸 US

reasoningmultimodal1M ctx

Anthropic's new top flagship — replaces the Opus tier as the strongest generally available model. Arena Elo 1508 (rank 1). 1M context, Adaptive Thinking always on, $10/$50 per MTok. Fable 5 + Mythos 5 (invite-only) launched together on June 9, 2026.

→

Jun 1, 2026 – Jun 76 models

2026-W23

Jun 4

Nemotron 3 Ultra 550BNVIDIA🇺🇸 US

reasoningtextopen1M ctx550B MoE (55B active)

NVIDIA's powerful open-weights MoE model: hybrid LatentMoE + MTP layers, 1M context, 550B/55B active. Free on OpenRouter. Focus: complex multi-agent workflows, code, math, science.

→

Jun 4

◆ Underrated

Nex-N2-ProNex AGI (Shanghai Innovation Inst.)🇨🇳 CN

reasoningtextopen262K ctx397B

397B model from the Shanghai Innovation Institute — open source, Apache 2.0, free on OpenRouter. Agentic-focused, with papers on a 'Unified Ecosystem for Large-Scale Environment Construction'. 7.87K HF likes despite barely any Western coverage.

→

Jun 3

Qwen3.7 PlusAlibaba Qwen🇨🇳 CN

reasoningtext1M ctx

Qwen's latest Plus tier — 1M context, $0.32/$1.28 per MTok. Qwen3.7 Max (stronger) and Plus (more efficient) form the top of the Qwen3.7 line.

→

Jun 2

◆ Underrated

MiniMax M3MiniMax🇨🇳 CN

multimodalreasoning1M ctx

MiniMax is barely known in the West — but its M3 model on OpenRouter ($0.30/$1.20 per MTok) is one of the cheapest 1M-context multimodal providers. Successor to Text-01 (4M context).

→

Jun 1

DeepSeek V4 ProDeepSeek🇨🇳 CN

reasoningtextopen1M ctx1.6T MoE (49B active)

DeepSeek's heavyweight: 1.6T total, 49B active, 1M context, FP4/FP8 mixed precision. Codeforces 3206 Elo beats GPT-5.4. MIT license, fully open weights. $0.435/$0.87 per MTok on OpenRouter.

→

Jun 1

DeepSeek V4 FlashDeepSeek🇨🇳 CN

textreasoningopen1M ctx158B MoE

Lean V4 variant: 158B MoE, 1M context, MIT. Extremely cheap on OpenRouter ($0.09/$0.18 per MTok). 2.48M HF downloads in just days signal massive demand.

→

May 25, 2026 – May 312 models

2026-W22

May 29

Claude Opus 4.8Anthropic🇺🇸 US

reasoningmultimodal1M ctx

Current Opus-tier model (until Claude Fable 5). Arena Elo ~1490. 1M context, Adaptive Thinking, knowledge through Jan 2026. $5/$25 per MTok direct, similar on OpenRouter.

→

May 29

◆ Underrated

Step 3.7 FlashStepFun AI🇨🇳 CN

multimodalreasoningopen262K ctx201B

201B multimodal open-weights model from StepFun — underrated in the West. Image+text input, strong reasoning. Successor to Step 3.5 Flash (199B). $0.20/$1.15 per MTok on OpenRouter.

→

May 18, 2026 – May 244 models

2026-W21

May 23

Qwen3.7 MaxAlibaba Qwen🇨🇳 CN

reasoningtext1M ctx

Strongest Qwen3.7 variant: 1M context, $1.25/$3.75 per MTok. More capable than Plus, but pricier. The Qwen3.7 line shipped late May 2026 as the successor to Qwen3.6.

→

May 22

◆ Underrated

Grok Build 0.1xAI🇺🇸 US

codereasoning256K ctx

xAI's coding-focused model — 256K context, $1/$2 per MTok. 'Build' implies software development as the main use case. A separate coding line alongside the general-purpose Grok 4.3.

→

May 21

Gemini 3.5 FlashGoogle DeepMind🇺🇸 US

reasoningmultimodal1M ctx

Google's current Flash generation: 1M context, all modalities, $1.50/$9 per MTok on OpenRouter. Gemini 3.5 Flash is the speed tier of the Gemini 3.x series that replaces Gemini 2.5.

→

May 19

Kimi K2.6Moonshot AI🇨🇳 CN

reasoningmultimodalopen262K ctx1.1T MoE

Multimodal predecessor of K2.7-Code — Visual Agentic Intelligence with 1.1T parameters. 2.66M HF downloads. Still relevant for multimodal tasks (images, video) since K2.7 is code-only.

→

May 11, 2026 – May 171 model

2026-W20

May 13

◆ Underrated

GLM-5.1Z.AI (Zhipu)🇨🇳 CN

textreasoningopen131K ctx754B MoE

Direct predecessor of GLM-5.2 (June 20). Still interesting for local deployment. The GLM-5.1 FP8 variant has 1.33M downloads — massive interest from the CN community.

→

May 4, 2026 – May 101 model

2026-W19

May 9

Gemini 3.1 Flash LiteGoogle DeepMind🇺🇸 US

multimodaltext1M ctx

Google's cheapest Gemini 3.x option: 1M context, $0.25/$1.50 per MTok. The 'Lite' variant is the first choice when token costs are critical and multimodality is needed.

→

Apr 27, 2026 – May 311 models

2026-W18

May 2

◆ Underrated

Laguna M.1Poolside AI🇺🇸 US

codereasoningopen262K ctx226B MoE (23B active)

Poolside's strong coding-agent model: 226B MoE, Apache 2.0, SWE-bench Pro 49.2%. Free on OpenRouter! Specialized for agentic long-horizon coding tasks. Poolside is a little-known US AI company.

→

May 2

◆ Underrated

Laguna XS.2Poolside AI🇺🇸 US

codeopen262K ctx33B MoE (3B active)

Poolside's lean coding model: 33B MoE, 3B active, for local deployments. Free on OpenRouter. Sliding-window attention for very fast inference. 231K downloads on HF.

→

May 1

Grok 4.3xAI🇺🇸 US

reasoningmultimodal1M ctx

xAI's current frontier model on OpenRouter ($1.25/$2.50 per MTok). Grok 4.3 positions itself as a strong all-rounder. Also accessible via Grok.com / X Premium.

→

May 1

Mistral Small 4 (119B)Mistral AI🇪🇺 EU

textreasoningopen262K ctx119B MoE

Mistral ironically calls 119B 'Small' — Apache 2.0, with instruction-following, reasoning and coding in one. Mistral positions itself as the strongest European open-weights challenger.

→

May 1

◆ Underrated

Granite 4.1 8BIBM🇺🇸 US

textcodeopen131K ctx8B

IBM's Granite 4.x generation: 8B, Apache 2.0, $0.05/$0.10 per MTok (one of the cheapest). Enterprise-focused, strong on structured tasks. IBM ships Granite models consistently with open weights.

→

May 1

Mistral Medium 3.5Mistral AI🇪🇺 EU

textmultimodal262K ctx~128B

Mistral Medium 3.5 is Mistral's first flagship to handle instruction-following, reasoning and coding in a unified way. 418K HF downloads, EAGLE-acceleration variant available. $1.50/$7.50 per MTok on OpenRouter.

→

Apr 30

◆ Underrated

Command A+ (2026-05)Cohere🇨🇦 CA

reasoningmultimodalopen131K ctx218B MoE (25B active)

Cohere's first multimodal open-weights flagship: 218B MoE, 25B active, 128K context, 48 languages, Apache 2.0. Reasoning tokens, tool use with JSON schema. Not on OpenRouter yet.

→

Apr 30

Kimi K2.5Moonshot AI🇨🇳 CN

reasoningmultimodalopen131K ctx1.1T MoE

Moonshot's 'Visual Agentic Intelligence' — 1.81M HF downloads. Basis for K2.6 and K2.7. One of the first large models with real multimodal agentics at 1T parameters.

→

Apr 30

◆ Underrated

Nemotron 3 Nano Omni (30B)NVIDIA🇺🇸 US

reasoningmultimodalopen256K ctx30B MoE (3B active)

NVIDIA's small multimodal reasoning model: 30B MoE, only 3B active, text+image+audio input. Free on OpenRouter. Designed for on-device, 4x faster than its predecessor, reasoning ON/OFF mode.

→

Apr 28

Qwen3.6 FlashAlibaba Qwen🇨🇳 CN

reasoningtext1M ctx

Fast variant of the Qwen3.6 family: 1M context, $0.19/$1.13 per MTok. Qwen3.6 shipped as Flash, 27B, 35B and Max-preview variants — all on April 28, 2026.

→

Apr 28

◆ Underrated

Qwen3.6 35B-A3BAlibaba Qwen🇨🇳 CN

reasoningtextopen262K ctx35B MoE (3B active)

Open-weights MoE variant of the Qwen3.6 generation: 35B total, only 3B active. $0.14/$1 per MTok on OpenRouter. Very efficient for local deployment on consumer hardware.

→

Apr 20, 2026 – Apr 268 models

2026-W17

Apr 25

GPT-5.5OpenAI🇺🇸 US

reasoningmultimodal1.1M ctx

OpenAI's current frontier model: 'A new class of intelligence for coding and professional work.' 1M context, knowledge cutoff Dec 2025, $5/$30 per MTok. GPT-5.5 Pro as the stronger variant ($30/$180 per MTok).

→

Apr 25

GPT-5.5 ProOpenAI🇺🇸 US

reasoningmultimodal1.1M ctx

Strongest available GPT-5.5 variant: $30/$180 per MTok — priced on par with o3. Positioned for enterprise use at the highest demands.

→

Apr 23

◆ Underrated

MiMo V2.5 ProXiaomi🇨🇳 CN

reasoningtext1M ctx

Xiaomi's reasoning model on OpenRouter ($0.435/$0.87 per MTok). MiMo is Xiaomi's first serious LLM push — barely noticed in the West. 1M context. V2.5 (lighter) and V2.5-Pro available.

→

Apr 23

◆ Underrated

Hy3 PreviewTencent🇨🇳 CN

textreasoning262K ctx

Tencent's Hunyuan 3 in preview on OpenRouter ($0.063/$0.21 per MTok). One of the cheapest models available. Tencent is a heavyweight that gets little Western attention in AI.

→

Apr 23

◆ Underrated

Ling-2.6-1TAnt Group (inclusionAI)🇨🇳 CN

reasoningtextopen262K ctx1T MoE

Ant Group's (Alipay's parent) 1-trillion-parameter MoE model — Apache 2.0. Extremely cheap on OpenRouter ($0.075/$0.625 per MTok). 472 likes on HF despite barely any Western coverage.

Xiaomi's standard variant alongside V2.5-Pro: cheaper ($0.14/$0.28 per MTok), 1M context. Xiaomi's entry into the LLM market has gone almost unnoticed in the West — but the model is bookable on OpenRouter.

→

Apr 22

◆ Underrated

Ling-2.6 Flash (107B)Ant Group (inclusionAI)🇨🇳 CN

reasoningtextopen262K ctx107B MoE

Ant Group's fast MoE variant: 107B, extremely cheap on OpenRouter ($0.01/$0.03 per MTok — one of the cheapest options anywhere). Ideal for agentic workflows where cost matters.

→

Apr 22

GPT-5.4 Image 2OpenAI🇺🇸 US

imagemultimodal272K ctx

OpenAI's second image-generation iteration on the GPT-5.4 base: $8/$15 per MTok (input/output). The current standard route for professional AI image generation via OpenRouter API.

→

Mar 30, 2026 – Apr 51 model

2026-W14

Apr 5

◆ Underrated

GLM-5Z.AI (Zhipu)🇨🇳 CN

textreasoningopen131K ctx754B MoE

First release of the GLM-5 generation (April 2026). 2.1K likes on HuggingFace. Zhipus' development rhythm is remarkable: one major release per month.

→

Feb 23, 2026 – Mar 11 model

2026-W09

Mar 1

◆ Underrated

Gemma 3n E4BGoogle DeepMind🇺🇸 US

multimodalopen33K ctx8B raw (4B effective)

Google's on-device multimodal model: handles text, image, video AND audio at just 4B effective parameters. MatFormer architecture enables sub-models within the same checkpoint. Designed for edge deployments.

→

Jan 26, 2026 – Feb 11 model

2026-W05

Feb 1

Claude Opus 4.7Anthropic🇺🇸 US

reasoningmultimodal1M ctx

A few generations before Fable 5 — but in thinking mode Arena Elo 1502, rank 3 worldwide. Still relevant for setups where Extended Thinking is desired and Fable 5's Adaptive Thinking isn't enough.

→

Oct 27, 2025 – Nov 21 model

2025-W44

Nov 1

GPT-5.4OpenAI🇺🇸 US

reasoningmultimodal1M ctx

Predecessor of GPT-5.5, but still very widely used ($2.50/$15 per MTok vs. $5/$30 for 5.5). Mini variant (400K context, $0.75/$4.50) for coding agents and subagents. IMOAnswerBench 91.4%.

→

Apr 28, 2025 – May 41 model

2025-W18

Apr 28

◆ Underrated

Qwen3-235B-A22BAlibaba Qwen🇨🇳 CN

reasoningtextopen131K ctx235B MoE (22B active)

Alibaba's strong open-weights model from April 2025 — basis for many downstream variants. Qwen3.5 and 3.6 followed in 2026. Often overlooked in the West despite AIME 2025 85.7%.

→

Mar 31, 2025 – Apr 61 model

2025-W14

Apr 5

Llama 4 MaverickMeta🇺🇸 US

multimodalopen1M ctx402B MoE (17B/128E)

Meta's multimodal MoE flagship: 17B active, 128 experts, 402B total, 1M-token context. Scout variant (16E, 109B) for lean deployments. Llama 5 is expected in H2 2026.

→

Mar 10, 2025 – Mar 161 model

2025-W11

Mar 12

Gemma 3 (27B)Google DeepMind🇺🇸 US

multimodalopen131K ctx27B

Google's open-weights flagship: 27B (also 1B, 4B, 12B), 128K context, text+image, 140+ languages. Gemma 3 is the basis for many community fine-tunes. Superseded by Gemma 4 (April 2025).

→

Loading…