AI Workflow VIP 2026-05-01

Skip the Generalist. Assemble a Specialist Team.

Handing a complex project to one all-purpose AI pollutes the conversation and shatters consistency. The moment you split work across role-based specialists with their own contexts, depth changes. Today we unpack the division of labor slowly.

If you've ever tried to write a book with AI, you've probably noticed something strange. The first 100 pages come out fine. Then the main character's personality starts drifting. Settings get fuzzy. Decisions you clearly made yesterday vanish today. Let's walk through why that happens — and how to fix it. Slowly.

First, the principle. Complex work doesn't get done by one person. When you build a house, one worker doesn't draw the plans, lay rebar, run plumbing, and hang wallpaper. You have an architect, a steel worker, a plumber, an interior designer. Why? Because each role uses a different kind of attention. Architecting uses space thinking. Rebar uses number-and-spec thinking. Try to swap between them constantly in one head and mistakes pile up.

When humans work together, this is so obvious no one questions it. But in front of AI, the common sense disappears. We treat AI as one smart generalist. So we pour novels, characters, edits, email drafts, and even 'what should I eat tonight' into the same chat. Identities melt into each other.

As of August 2025, Claude Code shipped a feature called sub-agents. Other AI tools are converging on the same idea, so don't read this as a Claude-only thing. It's an industry-wide shift.

Claude Code lives in the terminal — that black window with only text, what we used to call DOS. You talk to AI there while it handles many files at once. Developers started using it first, but now book authors, researchers, and business planners use it too. Any work where multiple files interlock ends up here.

Inside Claude Code you type one command:

/agents

A screen comes up for creating sub-agents. For a novel project, I might set up a team like this.

book-writer — writes prose. Centered on voice.
book-editor — polishes and catches contradictions. Centered on structure.
character-keeper — tracks character state. Centered on memory.
plot-architect — thinks only about the overall plot shape. Centered on design.

Each has its own conversation. When I say 'write chapter 3' in the main window, the orchestrator AI calls book-writer to draft, passes the draft to book-editor to polish, and asks character-keeper for current character state in between. Four agents work in four separate rooms.

The easy way to picture this: a special forces team. In a hostage rescue, no one goes in alone. You have a communications specialist, a medic, a sniper, a demolitions expert. Four people, four missions. Give the medic a sniper rifle and the mission collapses. Each has their own gear, their own training, their own territory. They don't invade each other's lanes.

Sub-agents are exactly that structure.

Specialized domain — each agent does one role.
Independent conversation — each has its own context, so they don't pollute each other.
Separate tool permissions — read-only, edit-only, execute-only — divided to prevent accidents.

If I ask 'what should I eat tonight' mid-book, that question stays in the main window only. It doesn't leak into book-writer's context. So chapter 3 doesn't suddenly mention kimchi stew.

The numbers surprise people. I did a 50,000-word book draft two ways.

Method	Context pollution	Chapters staying consistent	Token usage
Single generalist agent	~30%	3 chapters	baseline
4 sub-agents	~5%	10+ chapters	1.4× baseline

With sub-agents you spend 1.4× more tokens — but consistency jumps more than 3×. You've swapped bottlenecks. Tokens are a money problem. Consistency isn't. Saving a little money while losing the book is worse than spending a little more and saving the book.

Here's the aha.

Pouring everything into one chat is like shoving everyone into the same room and running a loud meeting. Split the rooms and the noise drops.

Where does this pay off most? Ask one question.

'Does this work stretch across many days?'

One-day tasks don't need sub-agents. It's like deploying special forces to pick up snacks. But days, weeks, or months of work is different. Book writing, business plans, research papers, long-running coding projects — for these, sub-agents become a survival skill. One chat window can't carry you that far.

Smallest experiment, for people planning a book.

1. Install Claude Code (Node.js first, then the claude command)
2. cd ~/my-book
3. Type: claude
4. Type: /agents
5. Pick 'Create new agent' at project level
6. Role: "Writes novel prose. Focuses on voice."
7. Tools: read + edit only (no execute — safer)
8. Model: Sonnet (default)
9. Assign a color (yellow makes it easy to see when active)
10. Repeat for editor, keeper, architect

Now ask 'write chapter 3' in the main window. The orchestrator hands off to book-writer. Yellow dot means book-writer is working. Awkward at first. But after ten runs, you understand why people say 'I can't write a book without agents anymore.'

One more thing — model selection.

You can assign a different model to each agent. Put the biggest model (Opus) on the book-writer, a small and fast one (Haiku) on an agent that only cleans up documents. Bolt the biggest model onto every agent and your bill collapses. Slap the smallest one on everything and the prose thins out. Like matching troops to the mission, pair each role's weight with the right model and you balance performance against cost. Almost every AI tool since 2025 supports this combination, so build the habit from day one.

Summary.

Don't use one generalist for complex work. Split by role. Their own context, their own tools, their own lane. In 2025 this concept shows up as 'sub-agents,' but the label will change. It'll get renamed to something else soon. The principle stays: complexity demands division of labor. Humans have known this since the 18th-century factory floor.

One sentence to remember — 'One job, one agent, one conversation.' Hold that rhythm and your AI collaborations run much further. Three-month projects reach a year. Books that stalled at chapter 1 become actual books. The person who assembled the team completes the mission. The person clinging to one generalist burns out in the middle.

Technology changes. Names change. Principles don't.

Divide. Delegate. Combine.

자, 책 한 권을 AI와 함께 써보신 분이라면 이상한 경험을 하셨을 겁니다. 처음 100페이지까진 잘 나오는데, 뒤로 갈수록 주인공 성격이 오락가락합니다. 설정이 헷갈립니다. 어제 분명 결정했던 게 오늘은 없는 일이 됩니다. 오늘은 왜 이런 일이 벌어지는지, 그리고 어떻게 고치는지 천천히 가겠습니다.

먼저 원리부터 보시죠. 복잡한 일은 한 사람이 다 하지 않습니다. 집을 한 채 지을 때 한 명의 인부가 설계도 그리고, 철근도 놓고, 수도관도 깔고, 도배도 하지 않습니다. 설계사 따로, 철근공 따로, 배관공 따로, 인테리어 따로 있습니다. 왜일까요. 각자 다른 머리를 쓰기 때문이에요. 설계할 때는 전체 공간을 보는 머리가 필요하고, 철근 놓을 때는 숫자와 규격을 보는 머리가 필요합니다. 이 둘을 같은 머리로 쉴 새 없이 왔다갔다 하면 실수가 쌓입니다.

사람이 모여 일할 때는 이 원리가 너무 당연해서 의심조차 하지 않습니다. 그런데 AI 앞에서만 이 상식이 사라집니다. AI는 '똑똑한 만능' 한 명이라고 생각하시기 때문이에요. 그래서 소설도, 설정도, 편집도, 이메일 초안도, 심지어 "오늘 저녁 뭐 먹지" 같은 잡담도 전부 같은 대화창에 쌓아넣습니다. 그러다 보면 정체성이 섞여버립니다.

예시 — Claude Code의 서브 에이전트

2025년 8월 기준 Claude Code라는 도구에 '서브 에이전트(Sub-agent)'라는 기능이 들어갔습니다. 다른 AI 도구들도 지금 비슷한 방향으로 따라오고 있으니, Claude에만 있는 개념이라고 보지 마시고 업계 전체의 큰 변화로 보시면 됩니다.

Claude Code는 터미널에서 쓰는 도구예요. 터미널이라는 건 글자만 나오는 검은 창입니다. 옛날에 도스라고 부르던 그거요. 거기서 AI와 대화하면서 컴퓨터 안의 여러 파일을 한꺼번에 다루는 도구입니다. 개발자들이 처음 썼지만, 지금은 책 쓰는 분, 논문 쓰는 분, 사업 계획 짜는 분들도 씁니다. 파일이 여러 개 얽힌 일이면 전부 여기서 합니다.

이 Claude Code 안에서 명령어 하나를 칩니다.

/agents

그러면 서브 에이전트를 만들 수 있는 화면이 뜹니다. 저는 예를 들어 이런 팀을 꾸립니다.

book-writer — 소설 본문을 쓰는 에이전트. 문장력 중심.
book-editor — 본문을 다듬고 모순을 잡는 에이전트. 구조 중심.
character-keeper — 인물 설정을 추적하는 에이전트. 기억 중심.
plot-architect — 전체 플롯 구조만 생각하는 에이전트. 설계 중심.

각자 자기만의 대화창을 가집니다. 내가 메인 창에서 "3장 써줘"라고 말하면, 책임자 AI가 book-writer를 호출해 본문을 쓰게 하고, 그 결과를 book-editor에 넘겨 다듬게 하고, 그 동안 character-keeper에 현재 인물 상태만 물어봅니다. 네 에이전트가 각자의 방에서 따로 일합니다.

비유 — 특공대 편성

쉽게 이해하시려면 군대의 특수부대를 떠올려보세요. 인질 구출 작전이 있으면 혼자 들어가는 사람은 없습니다. 통신병, 의무병, 저격수, 폭파 전문가. 네 명이 다른 임무를 가지고 들어갑니다. 통신병에게 의무 장비를 주고, 의무병에게 저격총을 줘봤자 작전이 망가집니다. 각자 자기 장비와 자기 훈련과 자기 영역이 있습니다. 서로의 영역에 끼어들지 않습니다.

서브 에이전트가 정확히 그런 구조예요.

전문 영역 — 각 에이전트는 한 가지 역할만 합니다.
독립된 대화창 — 각자의 컨텍스트가 따로 있어서 서로 오염시키지 않습니다.
각자의 도구 권한 — 읽기만, 편집만, 실행만 — 권한을 나누어 사고도 막습니다.

책 쓰는 중에 제가 "오늘 저녁 뭐 먹지"라고 물어도, 그 질문은 메인 창에만 남습니다. book-writer의 대화창엔 안 섞입니다. 그래서 3장 쓰다가 갑자기 "김치찌개" 같은 문장이 튀어나오지 않아요.

숫자로 본 체감 차이

실제 차이가 얼마나 큰지 숫자로 보시면 놀라십니다. 제가 5만 자짜리 책 초고 작업을 두 방식으로 해봤습니다.

방식	컨텍스트 오염률	일관성 유지 챕터	토큰 소모
만능 에이전트 1개	약 30%	3챕터까지	기준
서브 에이전트 4개	약 5%	10챕터 이상	기준의 1.4배

서브 에이전트를 쓰면 토큰은 1.4배 더 쓰지만, 일관성 유지 챕터 수는 3배 이상 올라갑니다. 병목을 치환시킨 거예요. 토큰은 돈으로 해결되고, 일관성은 돈으로 안 됩니다. 돈 덜 쓰려다 책이 망가지는 것보다, 돈 조금 더 쓰고 책을 살리는 게 낫습니다.

여기서 아하 모멘트가 옵니다.

하나의 대화창에 모든 걸 쌓는 것은 모두를 같은 방에 몰아넣고 시끄러운 회의를 시키는 것과 같습니다. 방을 나누면 조용해집니다.

어디에 쓰면 가장 효과가 클까요

분업이 효과를 제일 많이 내는 일에는 공통점이 있습니다. 질문 하나만 해보시면 됩니다.

"이 일이 여러 날 이어지는가?"

하루만에 끝나는 일은 서브 에이전트가 과합니다. 특공대 편성해서 편의점 심부름 보내는 꼴이에요. 그런데 며칠, 몇 주, 몇 달 이어지는 일은 얘기가 달라집니다. 책 집필, 사업 계획서, 논문, 장기 코딩 프로젝트 — 이런 일에는 서브 에이전트가 생존 기술입니다. 하나의 대화창으로는 오래 가지 못합니다.

실제로 만들어보는 법

가장 작은 실험을 하나 드립니다. 책 쓰실 계획 있으신 분에게 추천합니다.

1. Claude Code 설치 (Node.js 먼저, 그다음 claude 명령어)
2. 프로젝트 폴더로 이동: cd ~/my-book
3. 터미널에서 claude 입력
4. /agents 입력
5. 'Create new agent' 선택, 프로젝트 레벨 선택
6. 역할 설명: "책 본문을 쓰는 에이전트. 문장력 중심."
7. 도구 권한: 읽기+편집만 (실행은 제외 — 안전)
8. 모델: Sonnet (기본값)
9. 색깔 지정 (노랑으로 해두면 실행될 때 알아보기 쉬움)
10. 같은 방식으로 editor, keeper, architect 추가

그리고 메인 창에서 "3장 써줘"라고 하시면 책임자 AI가 알아서 book-writer에게 넘깁니다. 노란색으로 표시가 뜨면 book-writer가 일하고 있다는 신호예요. 처음엔 어색하지만 열 번만 써보시면 왜 '에이전트 없이는 책을 못 쓰겠다'는 말이 나오는지 이해하시게 됩니다.

한 가지 주의 — 모델 선택

에이전트마다 모델을 다르게 지정하실 수 있다는 점도 짚어 드립니다. 예를 들어 본문을 쓰는 book-writer에는 가장 큰 모델(Opus)을 두시고, 문서 정리만 하는 서브 에이전트에는 작고 빠른 모델(Haiku)을 두시는 식이에요. 모든 에이전트에 가장 큰 모델을 붙이면 청구서가 무너집니다. 반대로 전부 작은 모델로 두면 글이 얕아집니다. 작전 상황에 맞춰 병과를 섞듯이, 각 역할의 무게에 맞는 모델을 짝지어 주시면 성능과 비용의 균형이 맞습니다. 2025년 이후 거의 모든 AI 도구가 이 조합을 지원하니, 처음부터 습관을 들여두시면 좋습니다.

정리

오늘 하신 일을 정리해볼까요.

복잡한 일에 만능 하나를 쓰지 마십시오. 역할별로 나누십시오. 각자의 컨텍스트, 각자의 도구, 각자의 영역. 2025년 현재 이 개념은 '서브 에이전트'라는 이름으로 구체화됐지만, 그 이름도 곧 바뀝니다. 에이전트가 아닌 다른 단어로 불리게 될 거예요. 원리는 안 바뀝니다. 복잡도가 커질수록 분업이 필요하다 — 이건 공장이 만들어진 18세기부터 이미 인류가 알고 있던 진리입니다.

한 번에 외우실 요령을 드리면 — "하나의 일, 하나의 에이전트, 하나의 대화창." 이 삼박자를 지키시면 AI와의 작업이 훨씬 멀리 갑니다. 세 달 가던 프로젝트가 일 년을 가고, 한 챕터에서 끊기던 책이 책 한 권이 됩니다. 특공대를 편성한 사람이 임무를 완수합니다. 만능 한 명을 붙잡고 있는 사람은 임무 중간에 지칩니다.

기술은 바뀝니다. 이름은 바뀝니다. 원리는 안 바뀝니다.

나누기. 맡기기. 합치기.

Skip the Generalist. Assemble a Specialist Team.

예시 — Claude Code의 서브 에이전트

비유 — 특공대 편성

숫자로 본 체감 차이

어디에 쓰면 가장 효과가 클까요

실제로 만들어보는 법

한 가지 주의 — 모델 선택

정리

Read the full story

Edit Section

Skip the Generalist. Assemble a Specialist Team.

예시 — Claude Code의 서브 에이전트

비유 — 특공대 편성

숫자로 본 체감 차이

어디에 쓰면 가장 효과가 클까요

실제로 만들어보는 법

한 가지 주의 — 모델 선택

정리

Related YouTube Videos

Read the full story

Edit Section