AI Strategy VIP 2026-04-10

Don't Do Everything With The Best One

Tasks have weight. Tools have weight. Work happens only when the weights match. Today we unpack this principle through Claude's three models.

Most people who start using AI tools experience the same thing a month in — the bill comes in bigger than expected. Why? The answer is in a surprisingly old principle: you didn't divide the work to match the weight of the tool.

This essay explains that principle from start to finish. If you've never used Claude before, you can still follow along — we'll go slowly. Today's example happens to be Claude's three models, but the principle applies to any AI tool, in any era. Three years from now, even when "Claude" is renamed, the spine of this essay still holds.

Let's start with something most people don't realize.

Claude isn't one thing. It's three.

When you open Claude.ai or Claude Code, you see one window. So "Claude" looks like a single product. But behind the scenes, three different models are running. Let me tell you their names first, so it's less confusing.

Opus — Latin for "a great work." Like "Opus No. 9" in music. It earned this name because it's the largest model in the Claude series.
Sonnet — the 14-line short poem. The middle size.
Haiku — the 3-line Japanese short verse. The smallest model.

The names sound poetic and brandy, but they're actually ordered by size. Opus → Sonnet → Haiku, getting smaller. "Bigger" means it has more internal numbers (parameters). Like more brain synapses in a person. More parameters means better at complex reasoning — but more expensive, and slower.

That leads to the natural question: why make three versions? Why not just use the best one?

The answer is simple. Not every task needs the biggest model.

To really understand this, we need a good analogy. Think of your local medical system.

Say you have a mild cold. Where do you go? Your neighborhood clinic. Not a university hospital professor. You could go to the professor, but it's inefficient. Getting an appointment is hard, waits are long, and the bill is several times higher. If the neighborhood doctor sees something serious, they write a referral to the university hospital. For a possible cancer diagnosis, you go straight to the university hospital. You match the doctor to the weight of the problem.

Claude's three models work exactly this way.

Opus = the university hospital professor. Complex judgment, big designs, deep reasoning. Expensive and slow, but handles the hard stuff.
Sonnet = the neighborhood clinic doctor. Handles well-defined work cleanly. Most tasks end right here.
Haiku = the nurse. Handles repetitive tasks quickly. Intake, classification, cleanup.

Giving a light task to Opus is like sending a cold patient to a university professor. Expensive, and wasteful for the professor too — they should be seeing the truly difficult cases in that time.

Good. Now let's look at "how expensive" with numbers. This part matters.

Claude charges by "tokens." Don't overthink it — a token is roughly a word and a half's worth of characters. A Korean sentence is about 30-50 tokens. This whole essay is about 5,000 tokens.

Pricing as of April 2026, per million tokens:

Model	Input	Output	vs Haiku
Opus	$15	$75	18×
Sonnet	$3	$15	3.75×
Haiku	$0.80	$4	baseline

Opus costs 18 times more than Haiku. Pin that number in your head. Again — 18×.

Why does that matter for the bill? Because most people run everything through Opus by default. Renaming files — Opus. Polishing a three-line email — Opus. A simple summary — Opus. Each is only a few cents, but dozens of calls a day stack up past $200 a month. Going back to the medical analogy: it's sending colds, vaccinations, and routine blood draws all to the university hospital. Of course the health insurance breaks.

Here's the first aha moment.

The bill problem isn't a money problem. It's a distribution problem.

If you avoid Opus to save money, hard tasks fall apart. If you use only Opus, the bill explodes. The answer is in the middle. Match the model to the weight of the work.

So how do you split? It's simpler than it looks. Ask one question:

"Does this task actually need a brain?"

The answer splits into three.

"Yes, a lot" → Opus

Strategy planning. Tasks that weigh many factors at once. Designing the architecture of complex code. Debugging that's stuck for 3 hours. Anything that needs deep, single-pass thinking. Send it to Opus.

"In between" → Sonnet

You know what to do; you just need it done well. Converting a designed spec into code. Writing a report from gathered material. Doing a defined refactor. This is why Claude Code defaults to Sonnet — most work lives here.

"Barely any" → Haiku

Mechanical, repetitive tasks. Lowercasing 100 filenames. Turning a long list into a clean table. Standardizing text formats. Simple classification. Closer to execution than thinking. Haiku does it 18× cheaper.

Here's the one-line to remember: Opus = judgment. Sonnet = making. Haiku = repetition. Three words. That's it.

Let's walk through a real example.

Task: I have a 20-page meeting record and need to extract action items.

Old way — hand the whole thing to Opus.

→ "Extract action items from this 20-page meeting record."

Opus reads all 20 pages, understands them, and produces action items. Quality is fine. Cost: around $3.

New way — two steps.

Step 1. Ask Opus for just the plan.

→ "I want to extract action items from this 20-page meeting record. Don't execute — just design the method. Which sections to scan first, what patterns to look for, what format to output."

Opus returns a short answer. "Scan for phrases like 'must do,' 'decided to.' Action items usually cluster at the end of each section, so read those paragraphs first. Output as a 3-column list: owner / action / deadline." Cost: $0.20.

Step 2. Paste that plan to Haiku.

→ "Follow the plan above to extract action items from this 20-page meeting record, formatted as owner / action / deadline."

Haiku executes. Cost: $0.30.

Total: $0.50. One-sixth of the old way. And funnier — the result is often better. Because when Opus only thinks about "how to do it," the thinking sharpens. In the medical analogy, the professor who focuses only on diagnosis and lets others handle intake and blood draws actually diagnoses better.

Now the last piece — how to actually switch models. Also easy.

In Claude Code, specify at launch:

claude --model opus        # start with Opus
claude --model sonnet      # Sonnet (default)
claude --model haiku       # start with Haiku

Or switch mid-session:

/model opus
/model haiku

On Claude.ai web, the model selector is in the top dropdown. Pick Opus / Sonnet / Haiku. You can switch within one conversation easily.

Try it for a week. Just one question running in your head each time: "Is this judgment, making, or repetition?" Awkward at first. Automatic within a few days. And the bill drops fast.

Summary.

Claude has three models. Opus is big, expensive, and smart. Haiku is small, cheap, and fast. Sonnet is in between. Most people run everything on Opus, and the bill explodes. But not every task needs Opus. Match the model to the weight of the work and you get the same outcome at 1/6 the cost.

Three words: Judgment → Opus. Making → Sonnet. Repetition → Haiku.

And hard tasks: split in two. Ask Opus for the plan. Hand the plan to Haiku for execution. Each one does what it's best at.

Using one tool well matters less than having the sense to divide across many tools. Once you develop this sense, any future AI tool works the same way. The names will change. The principle won't.

자, AI 도구를 쓰기 시작하신 분들이 한 달쯤 지나면 대부분 비슷한 경험을 하십니다. 청구서가 생각보다 크게 나온다는 거죠. 왜 그럴까요. 답은 의외로 오래된 원리에 있습니다. 일의 무게에 맞춰 도구를 나누지 않았기 때문입니다.

이 글에서는 그 원리를 처음부터 끝까지 설명드립니다. Claude를 한 번도 안 써보신 분도 끝까지 따라오실 수 있게 천천히 가겠습니다. 오늘의 예시는 Claude의 세 모델이지만, 사실 원리는 어떤 AI 도구에도, 어떤 시대에도 똑같이 적용됩니다. 3년 후 Claude라는 이름조차 바뀌어도 이 글의 뼈대는 유효할 겁니다.

일에는 무게가 있습니다

먼저 원리부터 이해하고 넘어가시면 좋습니다. 일에는 무게가 있습니다. 이메일 세 줄 다듬기는 가볍습니다. 1년치 사업 전략을 짜는 건 무겁습니다. 같은 "글쓰기"처럼 보여도 무게가 다르죠.

도구에도 무게가 있습니다. 큰 도구는 복잡한 판단을 잘합니다. 작은 도구는 반복 작업을 빠르고 싸게 처리합니다. 이 둘을 짝맞추는 것이 도구를 잘 쓰는 일의 전부입니다. 무거운 일 → 큰 도구, 가벼운 일 → 작은 도구.

이 원리가 어제 오늘 이야기가 아닙니다. 회사에서 CEO에게 복사 심부름을 안 시킵니다. 대학병원 교수님께 감기 진료를 안 받으러 갑니다. 사람들은 수백 년 전부터 이 원리로 일해왔어요. 그런데 이상하게 AI 앞에서만 이 상식이 사라집니다. 왜일까요. AI가 하나인 줄 알기 때문입니다. 실은 AI도 여러 크기로 나뉘어 있습니다.

예시 — Claude의 세 모델

원리를 이해하기 위해 Claude를 예로 들어보겠습니다. 다른 AI(ChatGPT, Gemini 등)도 2026년 현재 대부분 비슷한 구조를 가지고 있어요.

Claude.ai를 켜시든 Claude Code를 켜시든, 창은 하나만 뜹니다. 그래서 "Claude"가 하나처럼 보이죠. 그런데 실은 뒤에서 세 개의 모델이 따로 돌아가고 있습니다. 이름도 재미있어요. 전부 문학 용어에서 따왔습니다.

Opus (오푸스) — 라틴어로 "큰 작품". Claude 시리즈에서 가장 큰 모델입니다.
Sonnet (소넷) — 14줄짜리 짧은 시. 중간 크기입니다.
Haiku (하이쿠) — 3줄짜리 일본 단시. 가장 작은 모델이에요.

여기서 "크다"는 게 뭐냐면 — 안에 들어 있는 숫자(파라미터)가 많다는 뜻입니다. 사람으로 치면 뇌 시냅스가 많다는 것과 비슷해요. 많을수록 복잡한 추론을 잘하고, 대신 더 비싸고 더 느립니다. 도구의 무게가 세 단계로 나뉘어 있는 것이죠.

자연스럽게 따라오는 질문이 있습니다. "왜 세 개를 다 만들어놨지? 하나만 가장 좋은 걸로 쓰면 안 되나?" 답은 첫 섹션에서 드린 그대로입니다. 모든 일이 가장 큰 도구를 필요로 하는 게 아니기 때문입니다. 앞으로 어떤 회사가 어떤 AI를 만들어도 이 3-tier 구조는 반복됩니다. 원리가 그렇게 요구하니까요.

비유 — 동네 병원

쉽게 이해하시려면 병원을 떠올려보세요. 가벼운 감기에 걸리셨을 때 어디로 가시나요? 동네 내과입니다. 대학병원 교수님께는 안 가시죠. 예약도 어렵고, 대기도 길고, 진료비도 몇 배니까요. 그런데 내과 선생님이 "이건 큰 문제다" 싶으시면 그때 대학병원으로 의뢰서를 써주십니다. 일의 무게에 맞춰 의사를 고르는 것입니다.

Claude의 세 모델이 정확히 이 구조예요.

Opus = 대학병원 교수님. 복잡한 판단, 큰 설계.
Sonnet = 동네 내과 원장님. 명확한 실무.
Haiku = 간호사 선생님. 접수·분류·정리.

가벼운 일을 Opus에게 시키는 건 감기 환자를 대학병원 교수님께 보내는 것과 같습니다. 돈도 낭비고, 교수님 시간도 낭비예요.

가격의 차이는 얼마나 되나요

숫자로 확인해보겠습니다. 가격은 2026년 4월 기준이고 앞으로 바뀔 수 있지만, 상대적 차이는 비슷한 비율로 유지될 겁니다.

Claude는 "토큰"이라는 단위로 돈을 받습니다. 토큰이 뭐냐면 — 어렵게 생각하실 필요 없이 대충 단어 한 개 반 정도의 글자 묶음이라고 보시면 돼요. 한국어 한 문장이 보통 30~50 토큰입니다.

100만 토큰당 가격표입니다.

모델	입력	출력	Haiku 대비
Opus	$15	$75	18배
Sonnet	$3	$15	3.75배
Haiku	$0.80	$4	기준

Opus가 Haiku보다 18배 비쌉니다. 가격표는 시간이 지나면 바뀌지만, 이 3-tier 간 배수는 어떤 AI 회사든 비슷한 스케일로 유지합니다 (보통 5-20배). 이 숫자의 본질은 **"같은 일을 해도 도구에 따라 비용이 10배 넘게 차이난다"**는 사실이에요.

왜 이 숫자가 청구서 이야기로 이어지느냐면, 대부분의 분들이 기본 설정대로 전부 Opus로 돌리시기 때문입니다. 파일 이름 바꾸는 것도 Opus, 세 줄짜리 이메일 다듬는 것도 Opus. 하나하나는 몇십 센트에 불과해도 하루에 수십 번 쌓이면 한 달에 $200이 넘어갑니다. 병원 비유로 치면 — 감기도 대학병원, 예방접종도 대학병원, 단순 채혈도 대학병원에 보내는 셈이죠.

여기서 첫 번째 아하 모멘트가 옵니다.

청구서 문제는 돈의 문제가 아니라 분배의 문제입니다.

돈을 줄이려고 Opus를 안 쓰시면 복잡한 일이 망가집니다. 반대로 다 Opus로 쓰시면 청구서가 터집니다. 답은 중간에 있어요. 일의 무게에 맞춰 도구를 나누는 것 — 이게 핵심이자 모든 AI 시대의 기본 원리입니다.

일을 어떻게 나눌까요 — 질문 하나

그럼 실제로 어떻게 나누는지 보시겠습니다. 이건 생각보다 간단합니다. 질문 하나만 물어보시면 돼요.

"이 일에 진짜 머리가 필요한가?"

이 질문의 답이 세 가지로 나뉩니다.

"예, 많이 필요합니다" → Opus

전략을 짜거나, 여러 가지 요소를 동시에 고려해서 판단해야 하거나, 복잡한 코드의 구조를 설계해야 할 때. "이게 왜 안 되지?"라는 디버깅이 3시간째 안 풀릴 때. 한 번에 깊게 생각해야 할 때. 이럴 땐 Opus입니다.

"중간 정도" → Sonnet

뭘 해야 하는지는 명확한데, 잘 만들어야 할 때입니다. 이미 설계된 기능을 코드로 옮기거나, 정리된 자료를 바탕으로 보고서를 쓰거나, 정해진 리팩토링을 하거나. Claude Code의 기본 모델이 Sonnet인 것도 이래서예요. 실제로 대부분의 작업이 여기에 해당합니다.

"거의 필요 없음" → Haiku

기계적인 반복 작업입니다. 파일 이름 100개를 소문자로 바꾸기, 긴 목록을 정리해서 리스트로 만들기, 텍스트 형식 통일, 단순 분류 같은 일들이요. 생각이라기보다는 실행에 가까운 일들이죠. Haiku가 이걸 18배 싸게 해냅니다.

이 세 가지를 한 번에 외우는 요령이 있습니다. Opus = 판단 / Sonnet = 제작 / Haiku = 반복. 이 세 단어만 기억해두시면 됩니다.

실제 예시 — 회의록 정리

구체적으로 한 번 해보시겠습니다. 실제 과제를 들어볼게요. 과제: 회의록 20페이지를 정리해서 해야 할 일(액션 아이템)을 뽑아야 합니다.

옛날 방식 — 전부 Opus

→ "이 20페이지 회의록 정리해서 액션 아이템 뽑아줘."

Opus가 20페이지를 다 읽고, 이해하고, 액션 아이템을 정리합니다. 결과는 괜찮습니다. 비용은 약 $3이에요.

새로운 방식 — 두 단계로 분리

1단계. Opus에게 계획만 부탁합니다.

→ "이 20페이지 회의록에서 액션 아이템을 뽑으려고 해. 실행하지 말고 방법만 알려줘. 어느 섹션을 먼저 봐야 하는지, 어떤 패턴으로 액션 아이템을 찾아야 하는지, 결과는 어떻게 정리해서 내보낼지."

Opus가 짧은 답을 냅니다. "회의록을 스캔하실 때는 '해야 한다', '하기로 했다' 같은 표현을 먼저 찾으세요. 각 섹션의 마지막 문단에 결정 사항이 모여 있으니 그 부분을 우선 읽으세요. 결과는 담당자 이름 + 해야 할 일 + 기한, 이렇게 세 열로 정리하시면 됩니다." 비용 $0.20.

2단계. 그 답을 그대로 Haiku에게 붙여넣습니다.

→ "위 계획대로 이 20페이지 회의록을 정리해서 액션 아이템을 담당자/일/기한으로 뽑아줘."

Haiku가 실행합니다. 비용 $0.30.

총 비용: $0.50입니다. 옛날 방식의 6분의 1이에요. 그리고 더 재밌는 건 — 결과물이 더 좋을 때가 많다는 겁니다. 왜냐하면 Opus가 "어떻게 할지"에만 집중할 때 그 생각이 훨씬 선명해지거든요. 병원 비유로 돌아가시면, 대학병원 교수님이 진료만 집중하고 접수·채혈·안내는 다른 분들이 할 때 진료 자체가 좋아지는 것과 같습니다.

모델 바꾸는 법 — 실제 명령어

자, 이제 실제로 어떻게 모델을 바꾸는지 아셔야 합니다. 이것도 어렵지 않아요.

Claude Code를 쓰시는 경우, 터미널에서 시작하실 때 모델을 지정하실 수 있습니다.

claude --model opus        # Opus로 시작
claude --model sonnet      # Sonnet (기본값)
claude --model haiku       # Haiku로 시작

또는 이미 세션 안에 들어와 계시다면 이렇게 중간에 바꾸실 수 있습니다.

/model opus
/model haiku

Claude.ai 웹에서 쓰시는 경우, 화면 상단에 모델 선택 드롭다운이 있습니다. 거기서 Opus / Sonnet / Haiku를 고르시면 돼요. 같은 대화창 안에서 왔다갔다 하시기도 쉽습니다.

일주일만 이렇게 써보시면 됩니다. "이 일은 판단인가, 제작인가, 반복인가?" 이 질문을 매번 머릿속으로 해보시는 겁니다. 처음엔 어색하지만 며칠 지나면 자동이 되세요. 그리고 청구서가 확 떨어지는 게 보이실 거예요.

정리

오늘 하신 일을 정리해볼까요.

일에는 무게가 있습니다. 도구에도 무게가 있습니다. 무게가 맞아야 일이 됩니다. 이 원리를 Claude의 세 모델(Opus/Sonnet/Haiku)로 설명드렸지만, 사실 모든 AI 도구, 앞으로 나올 어떤 기술에도 똑같이 적용됩니다. 구체적 이름은 바뀌어도 3-tier 구조 자체는 반복됩니다.

질문 하나만 몸에 붙이세요 — "이 일에 진짜 머리가 필요한가?" 이 한 질문이 판단/제작/반복을 가르고, 당신의 도구 선택을 자동화시킵니다. 어려운 일은 두 단계로 쪼개세요. 큰 도구에게 계획을, 작은 도구에게 실행을. 각자 가장 잘하는 것만 하게 됩니다.

도구 하나를 잘 쓰는 사람이 아니라, 도구 여러 개를 나눠 쓰는 감각을 가진 사람이 오래갑니다. 이 감각은 AI가 아니라 일의 본질 — 분업 — 에서 나오거든요. 3년 후 Claude라는 이름이 Claude가 아니게 되더라도, 오늘 이 글에서 배우신 원리는 그대로 작동합니다. 기술은 바뀝니다. 원리는 안 바뀝니다.

Don't Do Everything With The Best One

일에는 무게가 있습니다

예시 — Claude의 세 모델

비유 — 동네 병원

가격의 차이는 얼마나 되나요

일을 어떻게 나눌까요 — 질문 하나

"예, 많이 필요합니다" → Opus

"중간 정도" → Sonnet

"거의 필요 없음" → Haiku

실제 예시 — 회의록 정리

옛날 방식 — 전부 Opus

새로운 방식 — 두 단계로 분리

모델 바꾸는 법 — 실제 명령어

정리

Read the full story

Edit Section

Don't Do Everything With The Best One

일에는 무게가 있습니다

예시 — Claude의 세 모델

비유 — 동네 병원

가격의 차이는 얼마나 되나요

일을 어떻게 나눌까요 — 질문 하나

"예, 많이 필요합니다" → Opus

"중간 정도" → Sonnet

"거의 필요 없음" → Haiku

실제 예시 — 회의록 정리

옛날 방식 — 전부 Opus

새로운 방식 — 두 단계로 분리

모델 바꾸는 법 — 실제 명령어

정리

Related YouTube Videos

Read the full story

Edit Section