<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>Model Selection on KnightLi Blog</title>
        <link>https://knightli.com/en/tags/model-selection/</link>
        <description>Recent content in Model Selection on KnightLi Blog</description>
        <generator>Hugo -- gohugo.io</generator>
        <language>en</language>
        <lastBuildDate>Thu, 02 Jul 2026 07:27:14 +0800</lastBuildDate><atom:link href="https://knightli.com/en/tags/model-selection/index.xml" rel="self" type="application/rss+xml" /><item>
        <title>How to Choose Anthropic’s Current Claude Models: Fable, Opus, Sonnet, and Haiku Compared</title>
        <link>https://knightli.com/en/2026/07/02/anthropic-claude-models-comparison/</link>
        <pubDate>Thu, 02 Jul 2026 07:27:14 +0800</pubDate>
        
        <guid>https://knightli.com/en/2026/07/02/anthropic-claude-models-comparison/</guid>
        <description>&lt;p&gt;Anthropic’s current Claude lineup is no longer as simple as “Haiku is fast, Sonnet is balanced, Opus is strongest.” As of July 2026, the main models include Claude Fable 5, Claude Opus 4.8, Claude Sonnet 5, Claude Haiku 4.5, and the limited-availability Claude Mythos 5.&lt;/p&gt;
&lt;p&gt;If you want a quick rule: start with Sonnet 5 for daily development and most Agent tasks; use Opus 4.8 for complex enterprise Agents and hard coding; consider Fable 5 when you need the highest capability; use Haiku 4.5 for low-latency, cost-sensitive work; do not treat Mythos 5 as a general-purpose option because it is only available to approved customers.&lt;/p&gt;
&lt;h2 id=&#34;current-main-models&#34;&gt;Current main models
&lt;/h2&gt;&lt;table&gt;
  &lt;thead&gt;
      &lt;tr&gt;
          &lt;th&gt;Model&lt;/th&gt;
          &lt;th&gt;API ID / alias&lt;/th&gt;
          &lt;th&gt;Positioning&lt;/th&gt;
          &lt;th&gt;Context&lt;/th&gt;
          &lt;th&gt;Max output&lt;/th&gt;
          &lt;th&gt;Latency tendency&lt;/th&gt;
          &lt;th&gt;Standard price&lt;/th&gt;
      &lt;/tr&gt;
  &lt;/thead&gt;
  &lt;tbody&gt;
      &lt;tr&gt;
          &lt;td&gt;Claude Fable 5&lt;/td&gt;
          &lt;td&gt;&lt;code&gt;claude-fable-5&lt;/code&gt;&lt;/td&gt;
          &lt;td&gt;Highest capability among broadly released Anthropic models, suited to long-running Agents&lt;/td&gt;
          &lt;td&gt;1M tokens&lt;/td&gt;
          &lt;td&gt;128k tokens&lt;/td&gt;
          &lt;td&gt;Slower&lt;/td&gt;
          &lt;td&gt;$10 / MTok input, $50 / MTok output&lt;/td&gt;
      &lt;/tr&gt;
      &lt;tr&gt;
          &lt;td&gt;Claude Opus 4.8&lt;/td&gt;
          &lt;td&gt;&lt;code&gt;claude-opus-4-8&lt;/code&gt;&lt;/td&gt;
          &lt;td&gt;Complex Agent coding, enterprise workflows, browser/computer use&lt;/td&gt;
          &lt;td&gt;1M tokens&lt;/td&gt;
          &lt;td&gt;128k tokens&lt;/td&gt;
          &lt;td&gt;Medium&lt;/td&gt;
          &lt;td&gt;$5 / MTok input, $25 / MTok output&lt;/td&gt;
      &lt;/tr&gt;
      &lt;tr&gt;
          &lt;td&gt;Claude Sonnet 5&lt;/td&gt;
          &lt;td&gt;&lt;code&gt;claude-sonnet-5&lt;/code&gt;&lt;/td&gt;
          &lt;td&gt;Best balance of speed, capability, and price; a strong default&lt;/td&gt;
          &lt;td&gt;1M tokens&lt;/td&gt;
          &lt;td&gt;128k tokens&lt;/td&gt;
          &lt;td&gt;Fast&lt;/td&gt;
          &lt;td&gt;$2 / $10 before 2026-08-31; then $3 / $15&lt;/td&gt;
      &lt;/tr&gt;
      &lt;tr&gt;
          &lt;td&gt;Claude Haiku 4.5&lt;/td&gt;
          &lt;td&gt;&lt;code&gt;claude-haiku-4-5&lt;/code&gt;&lt;/td&gt;
          &lt;td&gt;Fastest and lowest-cost model for high-throughput light tasks&lt;/td&gt;
          &lt;td&gt;200k tokens&lt;/td&gt;
          &lt;td&gt;64k tokens&lt;/td&gt;
          &lt;td&gt;Fastest&lt;/td&gt;
          &lt;td&gt;$1 / MTok input, $5 / MTok output&lt;/td&gt;
      &lt;/tr&gt;
      &lt;tr&gt;
          &lt;td&gt;Claude Mythos 5&lt;/td&gt;
          &lt;td&gt;&lt;code&gt;claude-mythos-5&lt;/code&gt;&lt;/td&gt;
          &lt;td&gt;Same specs and price as Fable 5, but limited availability&lt;/td&gt;
          &lt;td&gt;1M tokens&lt;/td&gt;
          &lt;td&gt;128k tokens&lt;/td&gt;
          &lt;td&gt;Slower&lt;/td&gt;
          &lt;td&gt;$10 / MTok input, $50 / MTok output&lt;/td&gt;
      &lt;/tr&gt;
  &lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;MTok means million tokens. Pricing may also be affected by prompt caching, batch processing, data residency, cloud platform, and region, so the table only includes common base input/output prices.&lt;/p&gt;
&lt;h2 id=&#34;fable-5-highest-capability-but-not-the-default&#34;&gt;Fable 5: highest capability, but not the default
&lt;/h2&gt;&lt;p&gt;Claude Fable 5 is the highest-capability broadly released model in Anthropic’s current lineup. Its official positioning is “next-generation intelligence for long-running agents.”&lt;/p&gt;
&lt;p&gt;It fits:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Long-running, multi-step Agent workflows.&lt;/li&gt;
&lt;li&gt;Complex research, code migration, and cross-system planning.&lt;/li&gt;
&lt;li&gt;Enterprise tasks that need the strongest reasoning and large context.&lt;/li&gt;
&lt;li&gt;High-value tasks that are less sensitive to cost and latency.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;But Fable 5 is not necessarily the default. It is the most expensive and slower. Unless the task truly needs the highest ceiling, starting with Sonnet 5 or Opus 4.8 is usually more practical.&lt;/p&gt;
&lt;p&gt;One more detail: Fable 5 uses adaptive thinking and it is always on. It decides when and how much to reason automatically. That helps complex tasks, but it also makes cost and response time more dependent on the task itself.&lt;/p&gt;
&lt;h2 id=&#34;opus-48-a-solid-choice-for-complex-coding-and-enterprise-agents&#34;&gt;Opus 4.8: a solid choice for complex coding and enterprise Agents
&lt;/h2&gt;&lt;p&gt;Claude Opus 4.8 sits between Fable 5 and Sonnet 5. Anthropic suggests starting with Opus 4.8 when you are unsure which model to use for complex Agent coding and enterprise work.&lt;/p&gt;
&lt;p&gt;Its strengths include:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;1M token context.&lt;/li&gt;
&lt;li&gt;128k token max output.&lt;/li&gt;
&lt;li&gt;Strong performance on complex coding, browser Agents, computer use, and enterprise workflows.&lt;/li&gt;
&lt;li&gt;Half the price of Fable 5.&lt;/li&gt;
&lt;li&gt;Support for adaptive thinking.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Opus 4.8 is well suited as the “hard-task default.” Codebase-level refactors, complex PR fixes, enterprise data analysis, multi-tool Agents, and long-document reasoning can all start here.&lt;/p&gt;
&lt;p&gt;If a task is extremely hard, upgrade from Opus 4.8 to Fable 5. If task volume is high and cost pressure is obvious, move down to Sonnet 5.&lt;/p&gt;
&lt;h2 id=&#34;sonnet-5-the-best-daily-default&#34;&gt;Sonnet 5: the best daily default
&lt;/h2&gt;&lt;p&gt;Claude Sonnet 5 is the most important default candidate today. Its positioning is the best combination of speed and intelligence.&lt;/p&gt;
&lt;p&gt;It fits:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Daily coding and code review.&lt;/li&gt;
&lt;li&gt;Documentation, research assistance, and knowledge work.&lt;/li&gt;
&lt;li&gt;Medium-complexity Agents.&lt;/li&gt;
&lt;li&gt;Internal enterprise automation.&lt;/li&gt;
&lt;li&gt;API applications that need cost control without losing too much quality.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The biggest change in Sonnet 5 is that many Agent capabilities that used to feel closer to Opus have moved into the Sonnet price tier. It also supports 1M token context and 128k token output, with lower latency than Opus.&lt;/p&gt;
&lt;p&gt;Pricing has an introductory discount until August 31, 2026: $2 / MTok input and $10 / MTok output. From September 1, 2026, it returns to $3 / MTok input and $15 / MTok output. Even at the standard price, it remains significantly cheaper than Opus 4.8.&lt;/p&gt;
&lt;p&gt;For most teams, I would start with Sonnet 5: let it cover 70% to 80% of tasks, then escalate truly difficult work to Opus 4.8 or Fable 5.&lt;/p&gt;
&lt;h2 id=&#34;haiku-45-high-throughput-low-latency-low-cost&#34;&gt;Haiku 4.5: high throughput, low latency, low cost
&lt;/h2&gt;&lt;p&gt;Claude Haiku 4.5 is the fastest model in the current main Claude line. Anthropic positions it as the fastest model with near-frontier intelligence.&lt;/p&gt;
&lt;p&gt;It fits:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Classification, extraction, summarization, and format conversion.&lt;/li&gt;
&lt;li&gt;Batch processing of short text.&lt;/li&gt;
&lt;li&gt;Customer support, tickets, moderation, and other high-throughput scenarios.&lt;/li&gt;
&lt;li&gt;Latency-sensitive interactive products.&lt;/li&gt;
&lt;li&gt;Light tasks that do not need 1M context.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Its limits are clear: 200k token context and 64k token max output, below the 1M / 128k of Fable, Opus, and Sonnet. It should not be the first choice for long codebases, complex multi-document analysis, or long-running Agents.&lt;/p&gt;
&lt;p&gt;But for large volumes of simple, fast tasks, Haiku 4.5 is straightforwardly cost-effective: $1 / MTok input and $5 / MTok output.&lt;/p&gt;
&lt;h2 id=&#34;mythos-5-not-a-normal-option&#34;&gt;Mythos 5: not a normal option
&lt;/h2&gt;&lt;p&gt;Claude Mythos 5 shares the same specs and price as Fable 5, but it is not generally available. Anthropic documentation marks it as limited availability, only for approved Project Glasswing customers.&lt;/p&gt;
&lt;p&gt;In other words, for ordinary API model selection, you usually do not need to include Mythos 5. Unless you are already approved or obtain access through Anthropic, AWS, or Google Cloud account teams, it is not a direct replacement for Fable 5.&lt;/p&gt;
&lt;h2 id=&#34;how-to-choose-tier-by-task-complexity&#34;&gt;How to choose: tier by task complexity
&lt;/h2&gt;&lt;p&gt;Use this order:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;Start with Sonnet 5 by default&lt;br&gt;
Good for most coding, documentation, Agent, and enterprise automation tasks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Move to Opus 4.8 for clearly complex tasks&lt;br&gt;
Long codebases, multiple tools, multi-step tasks, and stronger reasoning requirements.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Try Fable 5 when you need the highest capability&lt;br&gt;
High-value, long-running, high-failure-cost tasks where price matters less.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Use Haiku 4.5 for high-throughput light work&lt;br&gt;
Classification, extraction, summaries, support, batch processing, and low-latency interaction.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Consider Mythos 5 only if you have access&lt;br&gt;
It is not a default option for ordinary developers.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id=&#34;two-migration-and-cost-details&#34;&gt;Two migration and cost details
&lt;/h2&gt;&lt;p&gt;First, newer Claude models use a new tokenizer. Anthropic’s documentation says Opus 4.7 and later Opus models, Fable 5, Mythos 5, Mythos Preview, and Sonnet 5 may produce about 30% more tokens for the same text. Cost estimates should not rely on per-million-token price alone.&lt;/p&gt;
&lt;p&gt;Second, 1M context does not mean every request should fill 1M context. Fable 5, Opus 4.8, and Sonnet 5 all support 1M tokens, but tool calls, caching, output, and multi-turn Agents add cost. A better deployment approach is:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Use prompt caching for common system prompts and long background context.&lt;/li&gt;
&lt;li&gt;Chunk long documents first, then use stronger models for synthesis.&lt;/li&gt;
&lt;li&gt;Send simple steps to Haiku or Sonnet, and escalate key decisions to Opus / Fable.&lt;/li&gt;
&lt;li&gt;Run real task samples instead of relying only on official benchmarks.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;a-simple-conclusion&#34;&gt;A simple conclusion
&lt;/h2&gt;&lt;p&gt;Claude’s current model line is fairly clear:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Fable 5: highest capability for the hardest and highest-value tasks.&lt;/li&gt;
&lt;li&gt;Opus 4.8: strong choice for complex Agent coding and enterprise work.&lt;/li&gt;
&lt;li&gt;Sonnet 5: best daily default, balancing capability, speed, and price.&lt;/li&gt;
&lt;li&gt;Haiku 4.5: fastest and cheapest for large-scale light tasks.&lt;/li&gt;
&lt;li&gt;Mythos 5: limited availability, not a normal option.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;If you are choosing Claude models for a product or internal workflow, the practical strategy is not to chase the highest tier. Split tasks: Haiku for lightweight batch work, Sonnet 5 as the default execution layer, Opus 4.8 for complex Agents and hard coding, and Fable 5 for the small set of tasks that are hardest, most expensive, and most worth it.&lt;/p&gt;
&lt;h2 id=&#34;model-routing-advice&#34;&gt;Model routing advice
&lt;/h2&gt;&lt;p&gt;When selecting Claude models, avoid having only one default model. A more useful design is routing: light batch jobs go to Haiku; daily coding and knowledge work go to Sonnet; complex repository tasks and multi-step Agents go to Opus; the highest-value and hardest tasks escalate to Fable.&lt;/p&gt;
&lt;p&gt;The routing can start simple. Summaries, classification, and field extraction prefer Haiku. PR review, documentation generation, and ordinary code changes prefer Sonnet. Cross-module refactors, incident reviews, and complex planning prefer Opus. If Opus fails repeatedly or the task is very valuable, use Fable.&lt;/p&gt;
&lt;p&gt;Every tier should also have exit conditions. Uncertain output, tool-call failures, repeated test failures, context over threshold, or tasks involving permissions or production data should trigger human confirmation instead of continued automation.&lt;/p&gt;
&lt;h2 id=&#34;cost-evaluation-method&#34;&gt;Cost evaluation method
&lt;/h2&gt;&lt;p&gt;The price table is only a rough estimate. Real cost depends on context length, cache hit rate, retries, output length, and human rework time. A more expensive model that completes a task once may be cheaper than multiple retries with a cheaper model.&lt;/p&gt;
&lt;p&gt;For each task type, record three metrics: average token cost, average human review time, and the share of failures that require escalation. After two weeks, it is usually clear which tasks belong on Sonnet and which deserve Opus or Fable.&lt;/p&gt;
&lt;p&gt;References:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a class=&#34;link&#34; href=&#34;https://docs.anthropic.com/en/docs/about-claude/models/overview&#34;  target=&#34;_blank&#34; rel=&#34;noopener&#34;
    &gt;Claude Models overview&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class=&#34;link&#34; href=&#34;https://docs.anthropic.com/en/docs/about-claude/pricing&#34;  target=&#34;_blank&#34; rel=&#34;noopener&#34;
    &gt;Claude Pricing&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
</description>
        </item>
        
    </channel>
</rss>
