AI Workflow VIP 2026-05-27

Plan First, Then Run The Expensive One

Big models cost a lot per run. Instead of asking them to execute right away, ask for the plan first, then the execution. Splitting into two steps cuts cost and lifts quality.

When you start a big task with AI, there's a scene that comes to mind. You open the best model, throw the hard job at it, and wait for the result. Most people work this way. Then a month later the bill is bigger than expected, and the output rarely lands on the first try. Why? The answer is in a surprisingly simple principle: the more expensive the tool, the more you plan before using it.

This essay explains that principle from start to finish. If you've never used AI before, you can still follow along — we'll go slowly. Today's example is Claude Opus 4.5 making a 3.js media art piece, but the principle applies to any AI tool, in any era. The names will change. The spine of this essay still holds.

Think about a carpenter. A skilled carpenter always draws the plan before cutting expensive hardwood. Cheap plywood is fine — cut it wrong, cut it again. But a walnut board costs hundreds of dollars, and one bad cut lands straight in your pocket. So first you write the measurements on paper, decide the order, and only then pick up the saw.

AI works the same way. Small models are fast and cheap — run it, if it's wrong, run it again. But a big model costs a lot per run and takes a while. So the sensible thing is to plan first, confirm, and only then execute. People have worked this way around expensive resources for centuries.

For some reason, the common sense disappears in front of AI. Why? Because AI looks like magic — one click and the answer appears. You lose the feeling that it's expensive. So you toss a hard task in one line and hit enter. The best model spits out 20 pages; you don't like it, you throw again; and again. That's the moment the bill explodes.

Let me give you a concrete scene. I recently opened Claude's biggest model, Opus 4.5, and tried to build a 3.js media art piece visualizing how the human mind thinks. What's 3.js? It's a JavaScript library that lets you draw 3D graphics in a web browser. Abstract concepts — memory, emotion, logic, intuition — became floating particles (thousands of tiny points). Hover the mouse on one and the rest dim, like focusing a single memory. Fairly complex work.

The first prompt I typed was this:

"I want to build a 3.js media art piece. Don't build yet. Tell me the plan first."

One extra line. "Tell me the plan first." That single line changed everything.

Opus didn't spit out a long output. Instead it handed me a short design doc — which skills to use (Algorithmic Art, Frontend Design), which concepts to visualize (300 memories, emotional waves, eureka moments, dream mode), how to keep performance (rendering thousands of particles at 60 frames per second). I read it, thought "okay, let's go this way," and then said "build it."

The easiest way to feel this is a hospital. When you go in for major surgery, the surgeon doesn't pick up the scalpel right away. First they show you the CT scan, explain where they'll cut, what recovery looks like, what the risks are. After you say "yes, I understand," only then does the surgery begin. Why? Because it's expensive. And because once you start, you can't undo it.

Treat expensive AI models the same way. Instead of reaching for the scalpel, listen to the explanation first, confirm, then start. In the 3.js media art work, what I saw during the planning step was exactly that CT scan. Opus declared up front that it would use two skills together — "Algorithmic Art" and "Frontend Design" — and even wrote down why it would switch from p5.js to 3.js. If I hadn't read that and nodded, I would've only noticed the wrong direction once we were deep into production. By then, the cost and the hours would've already been spent.

Let me show the difference with numbers. Opus 4.5 hit 80.9% on SW-Bench Verified, a coding benchmark — far ahead of the previous generation. There's a more interesting number. According to Anthropic's own announcement, Opus 4.5 reaches peak performance on complex tasks in just 4 iterations, while competing models needed up to 10 tries.

Here's the first aha moment.

A good model isn't the one that tries more — it's the one that needs fewer tries.

4 versus 10 is a difference of 6 runs. And even those 4, if you pull out the plan first, drop to one or two. Run Opus without a plan and what happens? One run, unhappy, another, another. The 6-run gap becomes 12, and the bill grows accordingly.

So how do you apply this? It's simple. Before throwing a big task into the input box, ask yourself one question:

"Is this task small enough to run in one shot?"

If the answer is no, ask for the plan first.

Situation	Old way	New way
Summarize a 20-page report	Run it directly	Ask for the structure first → then run
Build a web app	"Build it"	"Tell me which skills / libraries you'll use"
Write a research paper	"Write it"	"Give me the outline and section arguments"

All three rows are the same move: one call becomes two. And that split cuts cost and raises quality. Why? Because when Opus focuses only on "how to do it," the thinking sharpens. And because a human checks in the middle, you never spend an hour going the wrong direction.

Now the actual commands. Don't memorize anything — one sentence is enough.

Don't build yet. Give me the plan first.

Put that line in front of any big task. Claude, ChatGPT, Gemini — any AI will respond to it. In Korean it goes:

바로 만들지 말고, 계획부터 알려줘.

Once you have the plan, keep the parts you like and continue with:

Good. Proceed with the plan above.

Try this for just one week. It feels clunky at first — "why two steps instead of one?" But after a few days it becomes automatic. And you'll see the outputs get noticeably better.

Let's wrap up.

Expensive tools need design first. Carpenters, surgeons, architects — people have worked this way for centuries. AI is no exception. A big model costs a lot per run, so before running, pull the plan, confirm, then execute. One call becomes two. That's the whole move.

Three years from now, even if "Opus" stops being called Opus, the principle in this essay still works. Whatever company ships whatever model, the gap between expensive and cheap tools will always exist. And in front of expensive tools, planning always comes first. Tech changes. Principles don't.

Plan first. Confirm next. Execute last.

자, AI로 뭔가 큰 작업을 시작하실 때 생각나는 장면이 있으십니까. 가장 좋은 모델을 켜고, 어려운 일을 한 번에 던지고, 결과를 기다리는 장면입니다. 많은 분들이 이렇게 쓰십니다. 그런데 한 달이 지나면 청구서가 생각보다 크게 나오고, 결과물도 한 번에 마음에 들지 않는 경우가 많습니다. 왜 그럴까요. 답은 의외로 단순한 원리에 있습니다. 비싼 도구일수록 먼저 계획을 세우고 써야 한다는 원리입니다.

이 글에서는 그 원리를 처음부터 끝까지 설명드립니다. AI를 한 번도 써보지 않은 분도 따라오실 수 있게 천천히 가겠습니다. 오늘의 예시는 Claude Opus 4.5로 3.js 미디어 아트를 만드는 장면이지만, 원리는 어떤 AI 도구에도, 어떤 시대에도 똑같이 적용됩니다. 모델 이름이 바뀌어도 이 글의 뼈대는 유효할 겁니다.

고수는 먼저 설계부터 합니다

목수를 떠올려보시면 쉽습니다. 고수 목수는 비싼 원목을 자르기 전에 반드시 먼저 도면을 그리십니다. 값싼 합판이라면 잘못 잘라도 다시 자르면 그만이지만, 한 장에 수십만 원 하는 월넛 원목은 한 번 잘못 자르면 그 손실이 바로 주머니로 돌아옵니다. 그래서 먼저 종이에 치수를 적으시고, 순서를 정하시고, 그 다음에야 톱을 드십니다.

AI도 같습니다. 작은 모델은 빠르고 싸니까 일단 돌려보고 틀리면 다시 돌리면 됩니다. 그런데 큰 모델은 한 번 돌릴 때마다 비용이 크고 시간도 오래 걸립니다. 그러니 먼저 계획을 세우고, 확인하고, 그 다음에 실행시키는 것이 상식입니다. 사람들이 수백 년간 값비싼 자원 앞에서 해온 방식 그대로입니다.

이상하게 AI 앞에서만 이 상식이 사라집니다. 왜일까요. AI가 클릭 한 번이면 답을 주는 마술처럼 보이기 때문입니다. 비싸다는 감각이 없습니다. 그래서 어려운 일도 그냥 한 줄 툭 던지고 엔터를 누르십니다. 그러면 가장 좋은 모델이 한 번에 20페이지짜리 결과물을 쏟아내고, 그게 마음에 안 들면 다시 던지고, 또 다시 던집니다. 청구서가 터지는 건 이 순간입니다.

예시 — Opus 4.5와 미디어 아트

원리를 이해하기 위해 실제 장면을 예로 들어보겠습니다. 저는 얼마 전 Claude에서 가장 큰 모델인 Opus 4.5를 켜고, 3.js로 사람의 사고 과정을 시각화한 미디어 아트를 만들어보았습니다. 3.js가 뭐냐면 — 웹 브라우저에서 3D 그래픽을 그릴 수 있게 해주는 자바스크립트 라이브러리입니다. 기억, 감정, 논리, 직관 같은 추상적인 개념을 파티클(작은 점 수천 개)로 떠다니게 만들고, 마우스를 올리면 주변이 어두워지며 하나의 기억만 밝아지는, 꽤 복잡한 작업이었습니다.

이때 제가 맨 처음 입력한 프롬프트는 이것이었습니다.

"3.js로 미디어 아트를 만들려고 해. 바로 만들지 말고 계획부터 알려줘."

한 줄 더 넣은 것뿐입니다. '계획부터 알려줘'. 그런데 이 한 줄이 모든 것을 바꿨습니다.

Opus가 긴 결과물을 쏟아내지 않고, 대신 짧은 설계 문서를 내놓았습니다. 어떤 스킬을 쓸지(알고리즘 아트, 프론트엔드 디자인), 어떤 개념을 시각화할지(기억 300개, 감정의 파동, 유레카 모멘트, 몽상 모드), 성능은 어떻게 확보할지(수천 개 파티클을 60프레임/초로 렌더링하는 최적화). 저는 그 설계를 읽고 '좋아, 이대로 가자' 하고 다시 한 번 '제작해 주세요' 했습니다.

일상 비유 — 수술 전의 설명

쉽게 이해하시려면 병원을 떠올려보세요. 큰 수술을 받으러 가시면, 집도의가 곧바로 메스를 드는 경우는 없습니다. 먼저 CT 사진을 보여주시고, 어디를 어떻게 절개할지, 회복은 어떻게 될지, 위험 요소는 뭔지를 설명해 주십니다. 그 다음 환자가 '네, 알겠습니다' 하면 그제서야 수술이 시작됩니다. 왜 그럴까요. 비싸기 때문입니다. 그리고 한 번 시작하면 되돌리기 어렵기 때문입니다.

비싼 AI 모델도 똑같이 다루셔야 합니다. 바로 메스를 드는 대신, 먼저 설명을 들으시고, 확인하시고, 그 다음 시작하시는 겁니다. 3.js 미디어 아트 작업에서 제가 계획 단계에 본 것이 바로 이 CT 사진이었습니다. Opus는 '알고리즘 아트 스킬'과 '프론트엔드 디자인 스킬' 두 가지를 동시에 쓰겠다고 미리 선언했고, p5.js 대신 3.js로 방향을 잡겠다고 이유까지 적어주었습니다. 제가 그걸 읽고 고개를 끄덕이지 않았다면, 본격 제작에 들어가서야 방향이 틀어진 걸 알아챘을 겁니다. 그때는 이미 비용도 시간도 다 써버린 뒤였을 거예요.

숫자로 보는 차이

구체적으로 얼마나 차이가 나는지 숫자로 보시겠습니다. Opus 4.5는 SW-Bench Verified라는 코딩 벤치마크에서 80.9%를 달성했다고 합니다. 이전 세대 대비 훨씬 앞선 수치입니다. 더 재미있는 수치가 하나 더 있습니다. Anthropic 자체 발표에 따르면, Opus 4.5가 복잡한 작업에서 최고 성능에 도달하는 데 4번의 반복만 필요한 반면, 경쟁 모델은 최대 10번까지 시도가 필요했다고 합니다.

여기서 첫 번째 아하 모멘트가 옵니다.

좋은 모델은 '더 많이 시도하는' 도구가 아니라 '더 적게 시도해도 되는' 도구입니다.

4번이냐 10번이냐는 6번의 실행 비용 차이입니다. 그리고 그 4번조차, 먼저 계획을 뽑아놓고 시작하면 한두 번으로 줄어듭니다. 반대로 계획 없이 Opus를 돌리면 어떻게 될까요. 한 번 돌리고 마음에 안 들어서 다시, 또 다시. 6번의 차이는 12번이 되고, 청구서는 그만큼 커집니다.

적용 — 질문 하나

그럼 실제로 어떻게 쓰는지 보시겠습니다. 어렵지 않습니다. 입력창에 큰 작업을 던지기 전에 스스로에게 질문 하나만 던지시면 됩니다.

"이 작업, 한 번에 실행시킬 수 있을 만큼 작은가?"

답이 '아니오'라면, 먼저 계획부터 물어보셔야 합니다.

상황	옛날 방식	새로운 방식
20페이지 보고서 요약	바로 실행	먼저 구조만 물어봄 → 그 다음 실행
웹앱 하나 만들기	바로 "만들어줘"	먼저 "어떤 스킬/라이브러리를 쓸지 알려줘"
리서치 페이퍼 작성	바로 "써줘"	먼저 "목차와 섹션별 논지를 알려줘"

세 칸 모두 차이는 단순합니다. 한 번의 호출을 두 번으로 나눈다는 것뿐입니다. 그런데 이 분리가 비용을 줄이고 품질을 올립니다. 왜일까요. Opus가 '어떻게 할지'에만 집중할 때, 그 생각이 훨씬 선명해지기 때문입니다. 그리고 사람이 중간에 한 번 확인하기 때문에, 잘못된 방향으로 한 시간을 쏟아붓는 일이 없어집니다.

실제 명령어 — 지금 쓸 수 있는 것

이제 실제 명령어를 보시겠습니다. 외우실 필요는 없고, 한 문장만 기억해두시면 됩니다.

바로 만들지 말고, 계획부터 알려줘.

이 한 줄을 큰 작업 앞에 붙이시면 됩니다. Claude든 ChatGPT든 Gemini든, 어떤 AI든 작동합니다. 영어로는 이렇게 쓰시면 됩니다.

Don't build yet. Give me the plan first.

계획을 받으신 뒤에는 마음에 드는 부분만 남기시고, 이렇게 이어가시면 됩니다.

좋아, 위 계획대로 진행해줘.

일주일만 이렇게 써보시면 됩니다. 처음엔 귀찮으실 겁니다. '그냥 바로 시키지 왜 두 번 씩이나' 싶으시겠지만, 며칠만 지나면 몸에 붙습니다. 그리고 결과물이 확 좋아지는 게 보이실 겁니다.

정리

오늘 하신 일을 정리해볼까요.

비싼 도구는 먼저 설계부터 해야 합니다. 목수도, 집도의도, 건축가도 수백 년간 그렇게 일해왔습니다. AI도 예외가 아닙니다. 큰 모델은 한 번 돌리는 비용이 크니까, 돌리기 전에 계획을 뽑고, 확인하고, 그 다음 실행하시는 것이 상식입니다. 한 번의 호출을 두 번으로 나눈다 — 이것만 몸에 붙이시면 됩니다.

3년 후 Opus라는 이름이 Opus가 아니게 되더라도, 오늘 이 글에서 배우신 원리는 그대로 작동합니다. 어떤 AI 회사가 어떤 모델을 내놓아도 비싼 도구와 싼 도구의 간격은 계속 존재합니다. 그리고 비싼 도구 앞에선 언제나 계획이 먼저입니다. 기술은 바뀝니다. 원리는 안 바뀝니다.

계획 먼저. 확인 다음. 실행 마지막.

Plan First, Then Run The Expensive One

고수는 먼저 설계부터 합니다

예시 — Opus 4.5와 미디어 아트

일상 비유 — 수술 전의 설명

숫자로 보는 차이

적용 — 질문 하나

실제 명령어 — 지금 쓸 수 있는 것

정리

Read the full story

Edit Section

Plan First, Then Run The Expensive One

고수는 먼저 설계부터 합니다

예시 — Opus 4.5와 미디어 아트

일상 비유 — 수술 전의 설명

숫자로 보는 차이

적용 — 질문 하나

실제 명령어 — 지금 쓸 수 있는 것

정리

Related YouTube Videos

Read the full story

Edit Section