Daily Practice VIP 2026-08-05

Files Remain. Judgment Evaporates.

When you work with AI, the artifacts pile up but the reasoning vanishes. A judgment not recorded is a judgment you will have to make again. Let's unpack this slowly.

Let's start here: after a full day of working with AI, the artifacts definitely pile up. Commits land in the repo, a deploy URL lights up, files are tidy. And yet, a few days later, you open those files and they feel strange. "Why did I write it this way?" That moment comes.

This essay unpacks that hollow feeling. The principle applies whether you work with AI or not, but AI makes it stark. We'll go slowly.

Files are the shadow of the work

Lay the principle flat first. Every work session produces two kinds of output: files and judgments.

Files are visible. essays.ts, learn.js, wave7_shortlist.json. They sit in the commit log and live on the deploy URL. Judgments are invisible. "Why collapse the tags from 34 to 8?" "Why set LIST as the default view?" "Why OpenRouter?" — the reasons behind those decisions.

Both are work products. But they have different lifespans. Files persist as long as the disk does. Judgments, unrecorded, evaporate within hours. Short-term memory drops about 50% after one day and 90% within a week. The reasons alive in your head during a work session decay along that same curve.

Why AI makes this especially visible

When you work with people, the reasoning leaves traces. Meeting notes, email threads, Slack channels. Six months later, "why did we do this?" can be searched and recovered.

With AI, it's different. The conversation disappears the instant you close the ChatGPT window. The repo holds only the end-state files. The three hours of talking, the two reversals on direction, the three alternatives you chose between — all evaporate.

Worse, AI forgets the reasons even faster than people do. In the next session on the same topic, the AI doesn't know why you chose this path last week. You get fuzzy too. Only the files remain. And here's the dangerous part: if you ask "why is this code here," the AI will confidently make up a reason. A new reason, not the original, becomes the record. That is the worst outcome.

The carpenter's house

An analogy. A carpenter builds a house and leaves. Thirty years later a repair worker arrives. A window is in an odd place. Usually big, facing south. This one is small, facing east.

With no "why" on the blueprint, the repair worker can only do two things: patch it as-is, or tear it all down and rebuild. Either way, the carpenter's original intent is lost.

If the blueprint had said, "The east window is small because the 40-year-old persimmon tree catches the morning light, and the owner wanted the first sight on waking to be persimmon blossoms," the repair worker would either widen the window or protect the tree. Recorded judgment makes the next hand move in the same direction.

Evaporation speed, in numbers

Take today's session as an example. I made 7 decisions.

Tag normalization: 34 → 8
/learn page wired to OpenRouter + Claude Haiku 3.5 ($0.007 per query)
LIST/GRID view toggle
Editorial masonry cards, 4 variants
Scene auto-builder fallback in gen-essay-image.py
Skip a Shorts section (the channel has none)
Skip Atlas review-batch (verifier false alarms)

The files are all there. essays.astro, learn.js, gen-essay-image.py, evolution.md. Commits logged, deployed. But of those 7 reasons, how many will a different agent a week from now actually see?

A judgment not recorded is a judgment you will have to make again.

A week later, if a new agent proposes "let's subdivide the tags more," today's 8-tag judgment can't be replayed. Without a line in evolution.md saying "there were 34, collapsed to 8 because 34 was noise," the new agent might push back to 40. The same debate, re-run.

Will my future self remember the reason in a week

Try the question.

Will my future self, one week from now, remember why I decided this?

Most of the time, the honest answer is no. Which means right now you need to park that reason somewhere outside the file. Commit message, PR description, team notion, a note anywhere. The key is somewhere that is not the result file itself.

Why not in the result file? Because result files get overwritten on the next edit. The reasons get wiped along with the code. Park them separately and each lives its own lifespan.

The three-line judgment capture

A simple routine you can use from today. Before the session ends, leave three lines.

[what]    Reduced tag set from 34 to 8.
[why]     Users can't browse 34 categories. 8 is near the cognitive sweet spot; the rest was noise.
[caution] If the next agent suggests subdividing, read this note first.

Three lines is enough. You're not writing a full essay. You're tossing a retrievable fragment so that the you-a-week-from-now (or anyone else) can find it. Three minutes of logging prevents three hours of re-argument.

One more example. A second decision from today:

[what]    Wired /api/learn to OpenRouter using Claude Haiku 3.5.
[why]     Direct Anthropic billing is friction and slightly pricier. OpenRouter at $0.007/query is cheap enough.
[caution] Older Haiku 3 is cheaper but Korean quality drops. 3.5 is the quality/cost sweet spot.

The point isn't to write a manifesto. The point is to make sure the next hand sees this judgment before they overturn it. Overturning is fine. Conditions change and some decisions should be reversed. But when you reverse, reverse against the old judgment, not around it. That difference is the difference between accumulation and regression.

MR.5PM's blog has a file called _SYSTEM/writing/evolution.md. Every time a rule changes, a few lines are added: what changed, why, what it affects. Because of this log, eleven preset evolutions are a stack, not a random walk. Each agent stands on the previous one's judgment.

Tools change. The weight of judgment doesn't.

ChatGPT becomes Claude, Claude becomes the next model. File formats change, cloud providers change. But the fact that work has two axes — files and judgments — does not.

People who only stack files keep solving the same problem. People who record judgments stack layers. Thirty years later, when the repair worker opens that house, if the persimmon-tree blueprint is inside, the house is still alive.

I got that lesson fresh this very session. I dispatched AI agents to write 50 essays, and the output shipped cleanly. But later, when I asked myself "what judgment picked those tags, why is the card-variant ratio 40/30/20/10," the files had no answer. So I wrote this essay and logged seven decisions into evolution.md. Not just the output — the shadow of the output.

Pick one decision from today and leave three lines. Three minutes. Your future self will thank you.

Files remain. Judgment evaporates. Notation is gravity.

자, AI와 하루 종일 일하고 나면 결과물은 확실히 쌓여 있습니다. 깃 저장소에 커밋이 쌓이고, 배포 URL이 찍히고, 파일이 정리되어 있습니다. 그런데 이상하게, 며칠 뒤 그 파일을 열면 낯설어집니다. "왜 내가 이렇게 짰지"라는 순간이 옵니다.

이 글에서는 그 허전함의 정체를 풀어봅니다. AI를 쓰든 안 쓰든 똑같이 적용되는 오래된 원리인데, AI 앞에서 유독 도드라집니다. 천천히 가겠습니다.

파일은 작업의 그림자입니다

먼저 원리부터 눕혀놓고 시작하겠습니다. 작업에는 두 가지 결과물이 있습니다. 하나는 파일이고, 다른 하나는 판단입니다.

파일은 눈에 보입니다. essays.ts, learn.js, wave7_shortlist.json. 커밋 메시지에 남고, 배포 URL로 살아있습니다. 판단은 눈에 안 보입니다. "왜 태그를 34개에서 8개로 줄였는지", "왜 LIST를 기본 뷰로 뒀는지", "왜 OpenRouter를 썼는지" — 이 결정들의 이유입니다.

두 개 모두 작업의 산물입니다. 그런데 수명이 다릅니다. 파일은 디스크에 있는 한 남습니다. 판단은 기록 안 하면 몇 시간 안에 증발합니다. 사람의 단기 기억은 하루 지나면 50%가 떨어지고 일주일이면 90%가 떨어집니다. 작업 중 머리에 있던 이유들의 수명이 그 곡선 위에 있습니다.

왜 AI 앞에선 이 상식이 유독 드러나는가

사람과 일할 땐 판단의 맥락이 자연스럽게 남습니다. 회의록이 있고, 이메일 스레드가 있고, 슬랙 채널이 있습니다. 6개월 뒤 "왜 이렇게 했지?" 물으면 검색해서 꺼낼 수 있습니다.

AI와 일할 땐 다릅니다. 대화는 ChatGPT 창을 닫는 순간 사라집니다. 깃에는 결과 파일만 남습니다. 판단이 오간 3시간의 대화, 두세 번 뒤집은 설계 방향, 세 가지 대안 중에 왜 이걸 골랐는지 — 전부 증발합니다.

더 나쁜 건, AI는 판단 근거를 사람보다 훨씬 더 빨리 잊습니다. 다음 세션에서 같은 주제로 이야기하면, AI는 지난 주에 왜 그걸 골랐는지 모릅니다. 사람도 흐려집니다. 파일만 있습니다. 그리고 파일만 보고 "이 코드가 왜 여기 있지" 물으면, AI는 그럴듯한 이유를 새로 지어냅니다. 원래 이유가 아닌 새 이유가 기록이 되기 시작합니다. 이게 가장 위험합니다.

목수가 짓고 떠난 집

비유 하나 드리겠습니다. 목수가 집을 한 채 짓고 떠납니다. 30년 뒤 수리공이 옵니다. 창문 위치가 이상합니다. 보통은 남향에 크게 내는데, 이 집은 동향에 작게 나 있습니다.

도면에 "왜"가 없으면 수리공은 두 가지만 할 수 있습니다. 그대로 수리하거나, 전부 뜯어내고 다시 짓거나. 둘 다 목수의 원래 의도는 무시됩니다.

만약 도면에 "동쪽 아침 햇살이 40년 된 감나무를 비춰서 집주인이 깨어나면 첫 풍경이 감꽃이길 바랐다"라고 적혀 있다면, 수리공은 창을 더 크게 내거나 감나무를 지킬 겁니다. 판단의 기록은 다음 손이 같은 방향으로 움직이게 합니다.

숫자로 본 증발 속도

이번 세션 하나 예로 들어보겠습니다. 오늘 저는 7가지 결정을 내렸습니다.

태그 34개 → 8개 정규화
/learn 페이지에 OpenRouter + Claude Haiku 3.5 연동 (쿼리당 $0.007)
LIST/GRID 뷰 토글
Editorial 마소너리 카드 4 variant
gen-essay-image.py에 scene auto-builder fallback
쇼츠 섹션 안 만들기 (채널에 없으니)
Atlas review-batch 스킵 (verifier 거짓 경보)

파일은 전부 남았습니다. essays.astro, learn.js, gen-essay-image.py, evolution.md. 깃 로그에 찍혔고 배포됐습니다. 그런데 판단의 이유 7가지 중 몇 개가 1주일 뒤 다른 에이전트에게 보일까요?

기록 안 된 판단은 결국 다시 해야 하는 판단입니다.

1주일 뒤 새 에이전트가 "태그를 더 세분화해볼까?" 제안하면, 오늘의 8개 정규화 판단을 복기할 수가 없습니다. "아, 34개 있었는데 너무 많아서 8개로 모았었다"는 맥락이 evolution.md에 없으면, 그 에이전트는 다시 40개로 늘릴 수도 있습니다. 같은 논쟁을 다시 하게 됩니다.

지금 이 결정, 1주일 뒤의 나도 이유를 기억할까

질문을 한 번 던져보세요.

지금 내가 내린 이 결정은, 1주일 뒤의 나도 이유를 기억할까?

대부분 "아니요"입니다. 그러면 지금 그 이유를 파일 밖 어딘가에 남겨야 합니다. 커밋 메시지, PR 설명, 팀 노션, 블로그, 노트 — 어디든 상관없습니다. 핵심은 결과 파일이 아닌 곳에 판단을 남기는 것입니다.

왜 결과 파일이면 안 되냐면, 결과 파일은 다음 수정에서 덮어써집니다. 판단의 이유까지 함께 지워집니다. 별도 장소에 남겨야 수정과 이유가 각자의 수명을 갖습니다.

판단을 잡는 세 줄 규칙

오늘부터 쓸 수 있는 간단한 루틴입니다. 세션이 끝나기 전에 세 줄만 남깁니다.

[무엇] 태그 34개를 8개로 줄였다.
[왜]   사용자가 브라우즈할 때 8개가 인지 한계. 34개는 소음.
[주의] 다음 에이전트가 세분화 제안하면 이 판단 먼저 확인할 것.

세 줄이면 충분합니다. 지금 판단을 길게 풀어쓰는 게 아니라, 1주일 뒤의 나(또는 다른 누군가)가 검색해서 찾을 수 있는 조각을 던지는 겁니다. 3분 투자로 3시간의 재논쟁을 막습니다.

한 개 더 예를 들어보겠습니다. 두 번째 결정.

[무엇] /api/learn 엔드포인트를 OpenRouter로 연결했다. 모델은 Claude Haiku 3.5.
[왜]   Anthropic 직접 연결은 잔액 관리 번거롭고 약간 더 비쌈. OpenRouter는 0.007달러/쿼리로 충분히 저렴.
[주의] 더 싼 Haiku 3(구버전)도 옵션이지만 한국어 품질 떨어짐. 품질-비용 비로 3.5가 스윗스팟.

중요한 건 "나중에 건드릴 사람이 이 판단을 뒤집기 전에 먼저 보게 하는 것"입니다. 뒤집을 수도 있습니다. 환경이 바뀌면 당연히 뒤집어야 합니다. 다만 뒤집을 때 이 판단을 무시하고 다시 하는 게 아니라, 이 판단을 보고 다시 해야 합니다. 그 차이가 축적과 회귀의 차이입니다.

오후다섯씨 블로그에는 _SYSTEM/writing/evolution.md 라는 기록장이 있습니다. 규칙이 한 번 바뀔 때마다 "무엇을 바꿨고, 왜 바꿨고, 어디에 영향을 주는지" 짧게 남깁니다. 이 기록 덕분에 11번의 preset 진화가 무작위가 아니라 누적이 됩니다. 각 에이전트는 앞사람의 판단 위에 섭니다. 오늘 이 글도 그 기록장에 한 줄이 덧붙습니다 — "2026-04-24, 판단 기록의 원리를 에세이화함."

기술은 바뀝니다. 판단의 무게는 안 바뀝니다.

ChatGPT가 Claude로 바뀌고 Claude가 다음 모델로 바뀝니다. 파일 포맷도 바뀌고, 클라우드 공급자도 바뀝니다. 하지만 작업은 파일과 판단, 두 축으로 이루어진다는 사실은 안 바뀝니다.

파일만 쌓는 사람은 같은 문제를 계속 다시 풉니다. 판단을 기록하는 사람은 층이 쌓입니다. 30년 뒤 수리공이 그 집을 열었을 때, 감나무 도면이 들어있다면 그 집은 계속 살아있는 겁니다.

저도 이번 세션에서 그 교훈을 새로 받았습니다. AI 에이전트에게 에세이 50편을 쓰라고 시켰고, 결과물은 깨끗하게 배포됐습니다. 그런데 나중에 "어떤 판단으로 이 태그를 선택했지", "왜 이 카드 변형 비율이 40/30/20/10이지" 물으니 파일에는 답이 없었습니다. 그래서 이 글을 쓰고 evolution.md에 7개 결정을 등록했습니다. 결과물의 절반이 아니라 결과물의 그림자까지 저장한 것입니다.

오늘 내린 판단 중 하나만 골라, 세 줄로 남겨보세요. 3분이면 됩니다. 1주일 뒤의 당신이 고마워합니다.

파일은 남음. 판단은 증발. 기록이 중력.