AI Workflow VIP 2026-05-31

The Switchboard Operator Returns

Just as 1920s switchboard operators connected calls, AI operators have returned. You request; AI clicks and types. The human role shifts from dialer to director.

How many times a day do you wish something on the internet would "just get done for me"? Hotel booking, food delivery, ticket buying, grocery shopping, restaurant reservations. Different tasks, same structure: open a website, click a button, fill a form, press pay. We do this dozens of times a day. That loop is quietly disappearing. AI has returned as the old switchboard operator.

This essay walks through that shift from start to finish. Even if you've never used an AI agent, you can follow along — we'll go slowly. Today's example is OpenAI Operator, released January 2025. But the principle applies to any tool that comes next. Even if the name changes, the structure of a "middleman returning" does not.

In the 1920s, people were the operators. You picked up the phone and someone answered first. "Which number should I connect you to?" They physically plugged cables to link caller and receiver. A human sat in the middle.

Time passed. Automatic exchanges came. Mobile phones came. We entered the era of dialing our own number. The middleman was replaced by technology. A hundred years later, computing followed the exact same arc. First, you walked into a cafe to order. Then phone orders. Then delivery apps. Now we click buttons on an app screen — directly, with our own fingers.

Strangely, we stopped there. Why are we still pressing buttons one by one? AI supposedly "understands" everything, so why is the hand still mine? Simple answer. AI didn't know how to look at a screen for us. That just got solved.

In January 2025, OpenAI released an AI agent called Operator. Fun name. Operator. The old switchboard workers used the same word. The name itself gives the answer — OpenAI revived "the being that runs things for you."

Here's Operator in action. User types: "Find me a family-friendly campsite near Joshua Tree this weekend." Operator opens Hipcamp on its own. Clicks buttons. Picks dates. Selects options. Brings back results. Only at the final payment does it stop and ask, "Book this one?"

Other demos — reserves a refundable hotel meeting specific conditions, grabs a 7pm table for two on OpenTable, fills a grocery cart on Instacart, picks four basketball tickets under $500 on StubHub. In every case, the user provided one sentence. The rest — AI watched the screen and did the work.

Technically this works because OpenAI shipped a model called CUA (Computer Using Agent). Computer vision reads the screen in real time, figures out which pixels are buttons and input fields, then drives mouse and keyboard directly. No API integration. It sees and clicks like a human. That's the big shift.

Try this analogy — the old neighborhood errand service.

Years ago, services existed where you'd call a local office and say, "Please submit this form at the district office tomorrow morning." The owner would go, pull a number, find the right window, file the papers, and return with a receipt. You stayed home drinking coffee while the task finished.

The AI operator has the same shape.

You = the client. You only say what you need.
Operator = the errand-service owner. Goes out and actually walks around.
Websites = the places the errand service visits.

Old delivery apps were navigation — "here's a map, go find it yourself." Operator is delegation — "I'll go; you wait." Different structure entirely.

Conditions as of January 2025. These will loosen, but right now:

Item	Status
Model	CUA (Computer Using Agent)
Availability	US ChatGPT Pro users only
Price	$200/month (ChatGPT Pro tier)
Stage	Research Preview
Supported sites	OpenTable, Instacart, StubHub, a few others
Before payment	Always confirms with user

ChatGPT Plus costs $20/month. Pro costs 10× more. Why so expensive? Because AI has to watch the screen in real time and reason. Screenshot → analyze → click → take another screenshot → analyze again. This loop runs hundreds of times per task. Computing cost is dozens of times a simple text answer.

Here's the first aha.

The real shift of the agent era isn't the price. It's that the human role moves from dialer to director.

Previously, my finger pressed things. Now AI presses for me. So what do I do? Design what, why, and under what conditions the task should happen. I climb from the one who clicks to the one who instructs.

How do you find the entry point for yourself? Ask one question:

"What do I click and repeat every single day?"

That answer is where Operator will arrive first. Three buckets.

Daily repeat orders — groceries, restaurant reservations, coffee subscription. This goes first.
Information gathering — hotel price comparison, flight search, multi-site quotes. Second wave.
Personal-taste decisions — clothing, gifts, exhibition booking. Needs learning; arrives last.

Count how many of your current tasks fall in bucket one. That's the area about to automate.

A concrete example. Task: book a 4-person Italian restaurant in Yeonnam-dong for this Saturday dinner.

Old way — by hand.

Open Naver Map → search Yeonnam-dong → filter Italian → sort by rating → compare four spots → find booking links → enter date → enter party size → enter phone → receive confirmation. About 20 minutes.

New way — one sentence.

"Book a 4-person table for Saturday 7pm at an Italian restaurant in Yeonnam-dong. Quiet seating, no wait, under ₩50,000 per person."

Operator opens the map, finds matching spots, navigates to the booking page, fills the form, and stops only at the final confirmation — "Book this one?" I see one confirmation window. That's it.

Perceived time: about 2 minutes. 10× faster than the old way. And the bigger deal — during the 18 minutes I'd saved, I can do something else. We've returned to the moment when, while the switchboard operator connected your call, you could already prep the next one.

How to start today. Three options.

1. ChatGPT Pro ($200/month) — Operator directly (US)
2. Claude Computer Use — API-based automation
3. Mech, Zapier, Make — existing automation + AI agent glue

If you're in Korea, either use VPN or wait for local platforms (Baedal Minjok, Coupang, etc.) to bolt their own AI agents on top. Between 2025-2026, expect Korean services to ship similar features.

Do one thing in advance. Practice requesting with precision. "Book dinner" won't work. "Saturday 7pm, Italian in Yeonnam-dong, 4 people, quiet table, under ₩50k/person" will. AI getting smarter doesn't automatically sharpen your request. To work well with an Operator, you have to train the language of a clear director.

So what did we do today?

In the 1920s, switchboard operators existed. In 2025, the AI operator returned. A century later, the middleman is back. The internet and apps briefly invented a "click-it-yourself era" in between, but the structure has snapped back to middleman. Because humans weren't born to click. Humans were born to plan, to decide, to savor.

Keep one question inside you — "What do I click and repeat every day?" That's the first thing to hand off. Clicks to AI, decisions to you. Once this division becomes a body habit, your time widens tenfold.

The product names will change. OpenAI Operator won't be called Operator forever. Claude Computer Use, Gemini Agent — whatever comes next, the principle of the middleman returning stays the same. Back then, switchboard operators connected phone calls. Now AI operators connect the web. Tools change. The middleman doesn't.

Request. Relay. Confirm.

자, 하루에 몇 번이나 인터넷에서 뭔가를 '대신 해줬으면' 하시나요. 호텔 예약, 음식 주문, 티켓 구매, 장보기, 식당 자리 잡기. 다 중요한 일이지만 사실은 비슷한 구조입니다. 웹사이트에 들어가서, 버튼을 클릭하고, 양식을 채우고, 결제를 누르는 것. 우리는 매일 수십 번 이 반복을 하고 있습니다. 그런데 이 반복이 지금 사라지는 중입니다. AI가 옛 전화 교환원처럼 돌아왔기 때문이에요.

이 글에서는 그 변화의 구조를 처음부터 끝까지 설명드립니다. AI 에이전트를 한 번도 안 써보신 분도 따라오실 수 있게 천천히 가겠습니다. 오늘의 예시는 2025년 1월 공개된 OpenAI Operator이지만, 원리는 앞으로 어떤 도구가 나와도 똑같이 적용됩니다. 이름이 Operator가 아니어도, 중계자가 돌아왔다는 구조 자체는 사라지지 않습니다.

1920년대엔 사람이 교환원이었습니다

먼저 원리부터 짚어보시면 좋습니다. 예전에 전화를 걸면 바로 상대에게 이어지지 않았어요. 교환원이 먼저 받았습니다. "몇 번에 연결해 드릴까요?" 수백 개 선을 손으로 꽂아서 발신자와 수신자를 이어주셨죠. 사람이 중계자 역할을 하던 시대였습니다.

시간이 지나며 자동 교환기가 생기고, 휴대폰이 생기고, 우리는 스스로 번호를 누르는 시대로 넘어왔습니다. 중계자가 기술로 대체된 거죠. 100년이 지나서 컴퓨터도 똑같이 따라왔습니다. 처음엔 사람이 카페에 가서 직접 주문했어요. 전화 주문이 생겼고, 그다음엔 배달앱이 생겼습니다. 이제는 앱 화면에서 우리가 직접 버튼을 누릅니다.

그런데 이상하게 이 단계에서 멈춰 있었어요. 왜 우리가 아직도 일일이 버튼을 눌러야 할까요. AI가 모든 것을 '이해'한다고 하는데, 왜 결국 손은 내 손이어야 할까요. 답은 간단합니다. AI가 우리 대신 화면을 볼 줄 몰랐기 때문입니다. 이제 그게 풀렸어요.

예시 — OpenAI Operator

2025년 1월에 OpenAI가 Operator라는 AI 에이전트를 공개했습니다. 이름이 재미있죠. Operator — 운영자, 조작자. 옛 전화 교환원도 똑같은 단어를 썼습니다. 이름에서 이미 답이 보입니다. "우리 대신 뭔가를 운영해주는 존재"를 되살려낸 거예요.

Operator가 어떻게 움직이는지 보시겠습니다. 사용자가 이렇게 씁니다. "이번 주말 조슈아 트리 근처에 가족 캠핑장 찾아줘." 그러면 Operator가 스스로 Hipcamp 같은 캠핑 예약 사이트를 엽니다. 버튼을 클릭하고, 날짜를 고르고, 옵션을 선택하고, 결과를 찾아서 보여줍니다. 마지막 결제 직전에만 사용자에게 물어봐요. "이걸로 예약할까요?"

또 다른 데모에서는 환불 가능한 호텔을 조건에 맞게 예약하고, OpenTable에서 저녁 7시 2인 테이블을 잡고, Instacart에서 식재료를 장바구니에 담고, StubHub에서 500달러 이하 농구 티켓 4장을 고릅니다. 전부 사용자는 문장 하나만 던졌습니다. 나머지는 AI가 화면을 보면서 처리합니다.

기술적으로 이게 어떻게 가능하냐면, OpenAI가 CUA(Computer Using Agent)라는 모델을 붙였기 때문이에요. 컴퓨터 비전 기술로 스크린샷을 실시간으로 보고, 어디가 버튼인지 어디가 입력창인지를 분석해서, 마우스와 키보드로 직접 조작합니다. API 연동 없이 사람처럼 화면을 보고 클릭하는 거예요. 이게 큰 차이입니다.

비유 — 동네 심부름 센터

쉽게 이해하시려면 동네 심부름 센터를 떠올려보세요. 예전에 이삿짐 날라주고, 서류 접수 대신 해주고, 세무서 방문 대신 가주는 서비스가 있었습니다. 사장님한테 전화해서 "내일 오전에 이 서류 구청에 제출 좀 해주세요"라고 하면, 사장님이 혼자 가서 대기 번호 뽑고, 담당 창구 찾고, 서류 내고, 접수증 들고 오십니다. 나는 집에서 커피만 마시고 있어도 일이 됩니다.

AI 오퍼레이터가 정확히 이 구조예요.

나 = 심부름을 의뢰하는 사람. 뭐가 필요한지만 말합니다.
오퍼레이터 = 심부름 센터 사장님. 밖에 나가 실제로 뛰어다닙니다.
웹사이트들 = 심부름 센터가 방문하는 장소들.

옛날 배달앱은 "지도를 드릴 테니 직접 찾아가세요" 하는 내비게이션이었어요. 오퍼레이터는 "제가 갔다 올게요" 하는 대행자입니다. 구조가 다르죠.

구체적 조건 — 가격과 제약

2025년 1월 기준 Operator의 조건입니다. 앞으로 풀리겠지만 지금은 이렇습니다.

항목	조건
사용 모델	CUA (Computer Using Agent)
공개 범위	미국 내 ChatGPT Pro 유저만
가격	월 $200 (ChatGPT Pro 요금)
상태	Research Preview
지원 사이트	OpenTable, Instacart, StubHub 등 일부
결제 직전	반드시 사용자 확인

ChatGPT 일반 요금이 월 $20인데 Pro는 10배입니다. 왜 이렇게 비쌀까요. AI가 화면을 실시간으로 보면서 판단해야 하기 때문입니다. 스크린샷 분석 + 추론 + 클릭 + 다시 분석, 이 과정이 한 작업에 수백 번 반복돼요. 텍스트 답변만 주는 것보다 계산 비용이 수십 배입니다.

여기서 첫 번째 아하 모멘트가 옵니다.

AI 에이전트 시대의 진짜 변화는 가격이 아니라, 사람의 역할이 발신자에서 기획자로 이동한다는 것입니다.

예전엔 내가 손가락으로 눌렀습니다. 이제는 AI가 대신 누릅니다. 그럼 나는 뭘 하냐면 — 무엇을, 왜, 어떤 조건으로 할지를 설계합니다. 클릭하는 사람에서 지시하는 사람으로 올라가는 거예요.

적용 — 질문 하나

그럼 이 변화가 당신에게 지금 어떻게 닿을까요. 질문 하나만 스스로 던져보시면 됩니다.

"내가 매일 반복해서 화면을 클릭하는 일이 뭐지?"

이 답이 Operator가 가장 먼저 덤벼들 영역입니다. 세 가지로 나뉩니다.

매일 반복되는 주문 — 식재료 장보기, 식당 예약, 커피 정기 배송. 여기서부터 풀립니다.
정보를 모으는 작업 — 호텔 가격 비교, 항공권 검색, 여러 사이트 견적 받기. 이건 두 번째 단계예요.
개인 취향이 필요한 결정 — 옷 쇼핑, 선물 고르기, 전시 예약. 학습이 필요해서 마지막 단계에 도달합니다.

지금 당장 내 삶에서 첫 번째에 속하는 일이 몇 개인지 세어보세요. 그게 곧 자동화될 영역입니다.

실제 예제 — 저녁 식사 예약

구체적으로 해보겠습니다. 과제: 이번 주 토요일 저녁에 연남동에서 4인 이탈리안 레스토랑을 예약해야 합니다.

옛날 방식 — 손으로

네이버 지도 열기 → 연남동 검색 → 이탈리안 필터 → 평점순 정렬 → 네 군데 비교 → 각각 예약 링크 찾기 → 날짜 입력 → 인원 입력 → 전화번호 입력 → 확인 문자 받기. 약 20분입니다.

새 방식 — 문장 하나

"이번 주 토요일 저녁 7시에 연남동 이탈리안 레스토랑, 4인, 조용한 자리. 웨이팅 없는 곳으로 예약해줘. 1인당 5만원 이하."

Operator가 스스로 지도 앱을 열고, 조건에 맞는 곳을 찾고, 예약 페이지로 이동하고, 양식을 채우고, 결제 직전에 "이 식당으로 예약할까요?"라고 물어봅니다. 내가 보는 건 마지막 한 번의 확인 창 뿐이에요.

체감 시간 약 2분. 옛날 방식의 10분의 1입니다. 그리고 더 중요한 건 — 그 18분 동안 내가 딴 일을 할 수 있다는 점이에요. 교환원이 연결해주는 동안 나는 다른 전화를 준비하던 그 시절로 돌아간 거죠.

구체적으로 어떻게 시작하나요

지금 쓸 수 있는 방법은 세 가지입니다.

1. ChatGPT Pro ($200/월) — Operator 직접 사용 (미국)
2. Claude Computer Use — API 기반 자동화 구축
3. Mech, Zapier, Make — 기존 자동화 솔루션 + AI 에이전트 연결

한국에 계시면 지금은 VPN으로 우회 접속하시거나, 배달의민족·쿠팡 같은 국내 플랫폼이 자체 AI 에이전트를 붙이실 때까지 기다리시면 됩니다. 2025-2026년 사이에 한국 서비스에도 이런 기능이 붙을 가능성이 큽니다.

그전에 해두실 일은 하나입니다. "구체적으로 요청하는 연습". "저녁 예약해줘"는 안 됩니다. "토요일 저녁 7시 연남동 이탈리안 4인 조용한 자리"라고 해야 합니다. AI가 똑똑해진다고 당신의 요청이 저절로 선명해지는 건 아니에요. Operator와 잘 일하시려면 명확한 기획자의 언어를 익히셔야 합니다.

정리

오늘 하신 일을 정리해볼까요.

1920년대에 전화 교환원이 있었습니다. 2025년에 AI 오퍼레이터가 돌아왔습니다. 100년 간격을 두고 중계자가 다시 등장한 거예요. 그 사이에 인터넷과 앱이 잠시 "직접 누르는 시대"를 만들었지만, 결국 구조는 다시 중계자로 돌아왔습니다. 왜냐하면 사람은 클릭을 위해 태어나지 않았기 때문입니다. 사람은 기획하기 위해, 결정하기 위해, 음미하기 위해 태어났습니다.

질문 하나만 몸에 붙이세요 — "내가 매일 반복해서 클릭하는 일이 뭐지?" 그게 가장 먼저 AI에게 넘길 일입니다. 클릭은 AI에게, 결정은 나에게. 이 분업이 몸에 붙으면 시간이 열배 넓어집니다.

구체적인 제품 이름은 바뀝니다. OpenAI Operator가 영원히 Operator가 아닐 겁니다. Claude Computer Use, Gemini Agent, 어떤 새 이름이 나와도 중계자의 귀환이라는 원리는 그대로 갑니다. 옛날엔 교환원 언니들이 전화를 이어주었습니다. 이제는 AI 오퍼레이터가 웹을 이어줍니다. 기술은 바뀝니다. 중계자는 안 바뀝니다.

요청. 중계. 확인.

The Switchboard Operator Returns

1920년대엔 사람이 교환원이었습니다

예시 — OpenAI Operator

비유 — 동네 심부름 센터

구체적 조건 — 가격과 제약

적용 — 질문 하나

실제 예제 — 저녁 식사 예약

옛날 방식 — 손으로

새 방식 — 문장 하나

구체적으로 어떻게 시작하나요

정리

Read the full story

Edit Section

The Switchboard Operator Returns

1920년대엔 사람이 교환원이었습니다

예시 — OpenAI Operator

비유 — 동네 심부름 센터

구체적 조건 — 가격과 제약

적용 — 질문 하나

실제 예제 — 저녁 식사 예약

옛날 방식 — 손으로

새 방식 — 문장 하나

구체적으로 어떻게 시작하나요

정리

Related YouTube Videos

Read the full story

Edit Section