교육 세션 · 생각하게 하라Learning session · let it think

RULE No.4

생각하게
하라.Let it
think.

체스 선수는 어려운 수 앞에서 더 오래 읽습니다. 오늘은 Claude에게 답 전에 ‘생각할 공간’을 주는 법 — Extended Thinking. 어려울수록, 생각하는 시간이 답을 바꿉니다. A chess player reads longer before a hard move. Today: how to give Claude a ‘thinking space’ before the answer — Extended Thinking. The harder it is, the more thinking time changes the answer.

As of 2026-06 · 파라미터·가격은 바뀝니다 — 메커니즘이 진실. [추론]As of 2026-06 · parameters & prices change — the mechanism is the truth. [inference]

생각 · THINKINGTHINKING

1막 — 즉답 vs 숙고Act 1 — Snap answer vs deliberation

바로 답하게 할까, 생각하게 할까?Answer now, or think first?

> 이 다단계 논리 퍼즐의 답은?

즉답 — 바로 한 줄로 답해 버린다 (자주 틀림)

생각 후 — 단계를 펼쳐 검토하고 답한다 > what's the answer to this multi-step logic puzzle?

Snap — fires one line right away (often wrong)

After thinking — unfolds the steps, then answers

지난 시간엔 Claude를 해부했죠 — 확률로 답을 잇는다고. 그렇다면, 잇기 전에 생각할 공간을 주면?Last time we dissected Claude — it strings the answer by probability. So what if we give it space to think before it strings?

[추론] s03 콜백 — Claude는 토큰을 확률로 잇는 ‘패턴 생성기’. → 답 전에 ‘생각 공간’을 주면 더 긴 추론 사슬을 펼칠 수 있다.[inference] s03 callback — Claude is a ‘pattern generator’ that strings tokens by probability. → Give it a ‘thinking space’ before the answer and it can unfold a longer reasoning chain.

2막 — ‘생각 공간’이란Act 2 — What the ‘thinking space’ is

답을 내기 전에, ‘생각하는 칸’을 연다.It opens a ‘scratchpad’ before answering.

질문Question입력input

→

🧠 생각 블록Thinking block…단계적 추론……step-by-step reasoning…

→

최종 답Final answer출력output

보통은 질문 → 답. Thinking은 그 사이에 ‘생각 칸’을 끼운다.Usually Question → Answer. Thinking inserts a ‘scratchpad’ between them.

[출처: platform.claude.com/docs/en/docs/build-with-claude/extended-thinking] Extended thinking은 복잡한 작업에 단계적 추론 능력을 부여하고 그 사고 과정에 부분 투명성을 준다. → 답이 아니라 ‘답에 이르는 과정’을 위한 공간.[source: platform.claude.com/docs/en/docs/build-with-claude/extended-thinking] Extended thinking gives complex tasks step-by-step reasoning and partial transparency into that process. → A space for the path to the answer, not the answer itself.

2막 — 왜 더 정확해지나Act 2 — Why it gets more accurate

수를 읽을 시간이 있으면, 더 멀리 본다.Given time to read the board, it sees further.

체스 비유Chess analogy

즉수는 직감, 수읽기는 검토. 어려운 판일수록 더 많은 수를 읽는다. Claude도 중간 단계를 펼쳐 스스로 점검한다.A snap move is instinct, reading is review. Harder boards, more moves read. Claude too unfolds the middle steps and checks itself.

효과가 큰 곳Where it pays off

다단계 추론multi-step reasoning 수학/논리math/logic 복잡한 코딩 설계complex code design

[출처: platform.claude.com/docs/en/docs/build-with-claude/extended-thinking] 복잡한 작업(수학적 추론·다단계 도구 사용·복잡한 문제 분해)에서 더 큰 예산이 더 철저한 분석으로 응답 품질을 높일 수 있다. → 복잡할수록 생각 공간의 값어치가 커진다.[source: platform.claude.com/docs/en/docs/build-with-claude/extended-thinking] On complex work (math reasoning · multi-step tool use · breaking down hard problems), a larger budget can lift quality through more thorough analysis. → The more complex, the more the thinking space is worth.

2막 — 켜는 법Act 2 — How to turn it on

딱 한 줄 — `thinking` 객체를 켠다.One line — enable the `thinking` object.

thinking: { type: "enabled", budget_tokens: 4000 }

budget_tokens = 추론에 쓸 최대 토큰budget_tokens = max tokens for reasoning budget_tokens < max_tokens (반드시 작아야)budget_tokens < max_tokens (must be smaller)

[출처: platform.claude.com/docs/en/docs/build-with-claude/extended-thinking] thinking 객체에 type을 enabled로, budget_tokens를 지정해 켠다. budget_tokens는 max_tokens보다 작아야 한다. → 예산 = 생각의 상한선. (코드는 2026-06 기준 형식, 발표 시점 공식 문서가 진실)[source: platform.claude.com/docs/en/docs/build-with-claude/extended-thinking] Set the thinking object's type to enabled and specify budget_tokens. budget_tokens must be smaller than max_tokens. → Budget = the ceiling for thought. (Format is as of 2026-06; the official docs at present time are the truth.)

2막 — 생각에도 값이 붙는다Act 2 — Thinking has a price tag

생각 토큰은 ‘무료’가 아니다 — 출력으로 과금된다.Thinking tokens aren't ‘free’ — they bill as output.

항목Item	사실Fact
생각 토큰 과금Thinking-token billing	전부 출력 토큰으로 청구Charged in full, as output tokens
보이는 양What you see	청구 토큰 ≠ 응답에 보이는 토큰 (요약만 표시)Billed tokens ≠ tokens shown in the reply (summary only)
그래서So	예산을 무한정 키우지 마라Don't grow the budget without limit

[출처: platform.claude.com/docs/en/docs/build-with-claude/extended-thinking] “You're charged for the full thinking tokens generated.” 생각 토큰은 전부 출력 토큰으로 과금되며, 청구 토큰 수는 응답에 보이는 토큰 수와 일치하지 않는다(요약 표시). → 비용 = 어려운 문제에만 예산을 배정하는 이유. (2026-06 기준)[source: platform.claude.com/docs/en/docs/build-with-claude/extended-thinking] “You're charged for the full thinking tokens generated.” Thinking tokens bill in full as output tokens, and the billed count doesn't match what's shown in the reply (summary). → Cost = why you assign budget only to hard problems. (As of 2026-06)

3막 — 켤 때 / 끌 때Act 3 — When to turn it on / off

어려운 추론엔 켜고, 단순 작업엔 끈다.On for hard reasoning, off for simple work.

✅ 켜라Turn on	⚠️ 오버킬Overkill
다단계 추론Multi-step reasoning	단순 조회Simple lookup
수학/논리 증명Math/logic proofs	포맷 변환Format conversion
복잡한 디버깅Complex debugging	짧은 요약Short summaries
설계 트레이드오프 비교Comparing design trade-offs	정형 반복작업Routine repetitive work

[추론] 경계선은 ‘추론의 깊이’. 한 번에 그럴듯하게 이으면 되는 일엔 생각 예산이 곧 낭비(비용+지연). → 이 한 장이 Rule No.4의 핵심 판단표.[inference] The dividing line is ‘depth of reasoning’. For work you can string plausibly in one pass, a thinking budget is pure waste (cost + latency). → This one page is the core decision table for Rule No.4.

3막 — 얼마나 줄까Act 3 — How much to give

예산은 ‘문제 난이도’에 맞춘다.Match the budget to the problem's difficulty.

난이도에 맞춰match the difficulty

작은 예산Small budget

빠르고 싸다fast and cheap

큰 예산Large budget

더 철저하지만 비싸고 느리다more thorough, but costly and slow

큰 예산이라도 모델이 전부 안 쓸 수 있다. 예산은 의무가 아니라 상한선.Even a large budget may not be fully used. The budget is a ceiling, not an obligation.

[출처: platform.claude.com/docs/en/docs/build-with-claude/extended-thinking] 더 큰 예산은 복잡한 문제 품질을 높일 수 있으나, 모델이 예산 전부를 쓰지 않을 수 있다(특히 32k 이상 범위). → 예산은 ‘상한’이지 ‘의무’가 아니다. 작게 시작해 올린다. (2026-06 기준)[source: platform.claude.com/docs/en/docs/build-with-claude/extended-thinking] A larger budget can lift quality on complex problems, but the model may not spend all of it (especially above the 32k range). → Budget is a ‘ceiling’, not an ‘obligation’. Start small, raise it. (As of 2026-06)

3막 — 같은 모델, 다른 모드Act 3 — Same model, two modes

끄면 빠른 직관, 켜면 느린 검토.Off is fast intuition; on is slow review.

Thinking OFF

즉답 · 저비용 · 저지연Snap answer · low cost · low latency

→ 단순 작업 최적→ best for simple work

→

Thinking ON

단계적 추론 · 고비용 · 고지연Step-by-step reasoning · high cost · high latency

→ 복잡 작업 최적→ best for complex work

[추론] Thinking은 모델을 바꾸지 않는다 — 같은 모델에 ‘추론 시간’을 더 줄 뿐. → 그래서 ‘체급(s03 S09: Opus·Sonnet·Haiku)’과 ‘생각 시간’은 별개의 레버. 더 똑똑한 모델이 아니라, 더 오래 생각하는 같은 모델.[inference] Thinking doesn't swap the model — it just gives the same model more ‘reasoning time’. → So ‘weight class (s03 S09: Opus·Sonnet·Haiku)’ and ‘thinking time’ are separate levers. Not a smarter model — the same model thinking longer.

4막 — 켜기 전 4가지 점검Act 4 — 4 checks before you enable it

이 4가지면, 켤지 말지 즉시 판단된다.These 4 settle whether to turn it on.

1
다단계 추론을 요구하나?Does it need multi-step reasoning? → 아니면 끈다→ if not, turn it off
2
예산을 난이도에 맞췄나?Did you match budget to difficulty? → 작게 시작해 올린다→ start small, raise it
3
비용·지연을 감당할 만한가?Can you absorb the cost·latency? → 생각도 과금이다→ thinking is billed too
4
사실 정확성이 핵심인가?Is factual accuracy the crux? → 그럼 검증·근거가 답→ then verify·sources, not thinking

[추론] 4항목은 각각 S04·S08·S06·S11과 1:1 대응. → 메커니즘을 알면 체크리스트가 외워진다.[inference] The 4 items map one-to-one onto S04·S08·S06·S11. → Know the mechanism and the checklist memorizes itself.

4막 — 생각해도 못 푸는 것Act 4 — What thinking can't fix

더 생각해도,
모르는 사실은 모른다.More thinking won't
conjure facts it lacks.

학습에 없는 최신 사실 — 생각해도 못 만든다Recent facts not in training — thinking can't create them 잘못된 전제 — 생각이 오답을 더 정교하게 만든다A wrong premise — thinking just refines the wrong answer 그래서 — 사실은 여전히 Rule No.1(검증)So — facts still fall under Rule No.1 (verify)

[추론] Thinking은 추론을 깊게 할 뿐 새로운 사실을 만들지 못한다 — 틀린 전제는 더 그럴듯하게 틀릴 수도. → 사실 확인 문제엔 Thinking이 아니라 근거·검증(Rule No.1).[inference] Thinking only deepens reasoning — it can't manufacture new facts; a wrong premise can get more plausibly wrong. → For fact-checking, the answer is sources·verification (Rule No.1), not thinking.

4막 — 한 줄로Act 4 — In one line

그래서 — 어려우면 생각시키고, 사실은 검증하라.So — let it think when it's hard, verify the facts.

Thinking 켤까? = [다단계 추론인가?] + [예산 맞췄나?] + [비용 OK?]
  → 켠다
  단, [사실 정확성]은 → 검증으로 Turn thinking on? = [Multi-step reasoning?] + [Budget matched?] + [Cost OK?]
  → on
  But [factual accuracy] → handle by verifying

[추론] Rule No.4(생각하게 하라)와 Rule No.1(결과를 믿지 마라)이 만나는 지점 — 생각은 깊이, 검증은 사실.[inference] Where Rule No.4 (let it think) and Rule No.1 (don't trust the result) meet — thinking owns depth, verification owns facts.

닫으며 — 생각하게 하라Closing — let it think

급할수록,
생각할 시간을 줘라.The harder it is,
the more time to think.

한 단어로: 어려우면 → 생각 공간.In one word: when it's hard → a thinking space.

다음 강 예고 — 지시의 기술: 명령하지 말고 설계하라.Next up — the craft of instruction: don't command, design.

RULE No.4

As of 2026-06 · 파라미터는 바뀌어도 ‘어려운 문제엔 생각 시간’이라는 원리는 남습니다. [추론]As of 2026-06 · parameters change, but the principle — ‘hard problems get thinking time’ — remains. [inference]

RULE No.4

생각하게하라.Let itthink.

바로 답하게 할까, 생각하게 할까?Answer now, or think first?

답을 내기 전에, ‘생각하는 칸’을 연다.It opens a ‘scratchpad’ before answering.

수를 읽을 시간이 있으면, 더 멀리 본다.Given time to read the board, it sees further.

딱 한 줄 — thinking 객체를 켠다.One line — enable the thinking object.

생각 토큰은 ‘무료’가 아니다 — 출력으로 과금된다.Thinking tokens aren't ‘free’ — they bill as output.

어려운 추론엔 켜고, 단순 작업엔 끈다.On for hard reasoning, off for simple work.

예산은 ‘문제 난이도’에 맞춘다.Match the budget to the problem's difficulty.

끄면 빠른 직관, 켜면 느린 검토.Off is fast intuition; on is slow review.

이 4가지면, 켤지 말지 즉시 판단된다.These 4 settle whether to turn it on.

더 생각해도,모르는 사실은 모른다.More thinking won'tconjure facts it lacks.

그래서 — 어려우면 생각시키고, 사실은 검증하라.So — let it think when it's hard, verify the facts.

급할수록,생각할 시간을 줘라.The harder it is,the more time to think.

생각하게
하라.Let it
think.

딱 한 줄 — `thinking` 객체를 켠다.One line — enable the `thinking` object.

더 생각해도,
모르는 사실은 모른다.More thinking won't
conjure facts it lacks.

급할수록,
생각할 시간을 줘라.The harder it is,
the more time to think.