Agentic AI: 단일 LLM에서 에이전트 팀으로

Agentic AI: From Single LLM to Agent Teams

AI의 진화, Agentic AI의 핵심 원리, 그리고 실무 적용

The Evolution of AI, Core Principles of Agentic AI, and Practical Applications

2026-04-16 개념 가이드 Conceptual Guide

Agentic AI 개념 (현재) Agentic AI Concepts (current) MCP 생태계 MCP Ecosystem AI 개발 시스템 AI Dev System

1. AI의 진화: 규칙에서 자율로

1. The Evolution of AI: From Rules to Autonomy

AI는 네 단계를 거쳐 진화했다. 각 단계는 이전 단계의 한계를 극복하며, 점점 더 높은 수준의 자율성을 갖추었다.

AI has evolved through four stages. Each stage overcomes the limitations of the previous one, achieving progressively higher levels of autonomy.

1950s–

규칙 기반

Rule-Based

사람이 모든 규칙을 직접 작성. if-else 분기로 동작

Humans write all rules. Operates via if-else branches

2010s–

머신러닝

Machine Learning

데이터에서 패턴을 학습. 규칙을 직접 쓰지 않아도 됨

Learns patterns from data. No manual rules needed

2022–

LLM

자연어를 이해하고 생성. 범용적이지만 수동적

Understands and generates language. Versatile but passive

2024–

Agentic AI

스스로 계획하고 도구를 쓰고 결과를 검증하는 자율 루프

Autonomous loops that plan, use tools, and verify results

💡

핵심 차이: LLM은 "질문하면 답한다." Agentic AI는 "목표를 주면 스스로 해결한다." 수동적 응답에서 능동적 행동으로의 전환이 핵심이다.

Key difference: LLMs "answer when asked." Agentic AI "solves when given a goal." The shift from passive response to proactive action is the core change.

2. Agentic AI란?

2. What is Agentic AI?

Agentic AI는 LLM에 자율성(autonomy)을 부여한 시스템이다. 단순한 질의응답이 아니라, 목표를 받으면 스스로 계획을 세우고, 도구를 사용하고, 결과를 평가하고, 필요하면 수정하는 완전한 루프를 수행한다.

Agentic AI adds autonomy to LLMs. Instead of simple Q&A, it receives a goal and autonomously plans, uses tools, evaluates results, and iterates — a complete loop.

기존 LLM

Traditional LLM

사람이 프롬프트를 작성
Human writes the prompt
LLM이 한 번에 응답
LLM responds in one shot
결과 검증은 사람 몫
Human verifies the result
틀리면 다시 프롬프트 수정
If wrong, revise and re-prompt
외부 도구 사용 불가
Cannot use external tools

Agentic AI

사람은 목표만 제시
Human provides the goal only
에이전트가 계획 → 실행 → 검증
Agent plans → executes → verifies
자동으로 결과 평가 후 수정
Auto-evaluates and corrects
실패 시 다른 접근법 시도
Tries alternative approaches on failure
파일, DB, API, 브라우저 등 도구 활용
Uses files, DB, APIs, browsers as tools

3. Agentic AI의 네 가지 원리

3. Four Principles of Agentic AI

Agentic AI를 구성하는 네 가지 핵심 능력이 있다. 이 중 하나라도 빠지면 진정한 "에이전트"라 부르기 어렵다.

Four core capabilities define Agentic AI. Missing any one of them means it's not a true "agent."

🎯

계획 (Planning)

Planning

목표를 하위 작업으로 분해하고 실행 순서를 결정한다. "암호화 구현" → "라이브러리 선택 → 키 관리 설계 → 코드 작성 → 테스트"

Decomposes goals into subtasks and determines execution order. "Implement encryption" → "Choose library → Design key management → Write code → Test"

🔧

도구 사용 (Tool Use)

Tool Use

파일 읽기/쓰기, 코드 실행, API 호출, 웹 검색 등 외부 도구를 직접 사용한다. MCP 프로토콜로 500개 이상의 도구에 연결 가능.

Directly uses external tools: file I/O, code execution, API calls, web search. MCP protocol enables connection to 500+ tools.

🔄

자기 검증 (Self-Verification)

Self-Verification

자신의 출력을 스스로 평가하고, 문제가 있으면 수정한다. 테스트 실행, 코드 린트, 결과 비교 등을 자동으로 수행.

Evaluates its own output and corrects issues. Automatically runs tests, linting, and result comparison.

🧠

기억 (Memory)

Memory

세션 간 맥락을 유지한다. 이전에 무엇을 했고, 무엇이 실패했는지 기억해 같은 실수를 반복하지 않는다.

Maintains context across sessions. Remembers what was done, what failed, and avoids repeating the same mistakes.

4. 에이전트는 어떻게 동작하는가?

4. How Does an Agent Work?

대부분의 에이전트는 Observe → Plan → Act → Reflect 루프를 반복한다. 이 루프가 한 번 돌 때마다 목표에 한 걸음씩 가까워진다.

Most agents repeat an Observe → Plan → Act → Reflect loop. Each iteration moves one step closer to the goal.

관찰

Observe

현재 상태 파악
파일, 에러, 환경 확인

Assess current state
Check files, errors, env

계획

Plan

다음 행동 결정
하위 작업 분해

Decide next action
Decompose subtasks

실행

Act

도구 호출, 코드 실행
파일 수정, API 호출

Call tools, execute code
Modify files, call APIs

성찰

Reflect

결과 평가
성공? 다음 단계 / 실패? 수정

Evaluate result
Success? Next step / Fail? Revise

⚠️

무한 루프 방지: 에이전트에는 항상 최대 반복 횟수, 타임아웃, 비용 한도 같은 안전장치가 필요하다. 제한 없는 자율성은 리소스 낭비와 예측 불가능한 행동으로 이어진다.

Preventing infinite loops: Agents always need safety guardrails — max iterations, timeouts, cost limits. Unconstrained autonomy leads to resource waste and unpredictable behavior.

5. 왜 하나가 아닌 여럿인가: 멀티 에이전트

5. Why Multiple, Not One: Multi-Agent Systems

하나의 에이전트가 모든 것을 하는 것보다, 역할을 나눈 여러 에이전트가 협력하는 것이 더 효과적이다. 이유는 세 가지다.

Instead of one agent doing everything, multiple agents with divided roles collaborating is more effective. Three reasons:

이유	Reason	단일 에이전트	Single Agent	멀티 에이전트	Multi-Agent
편향 검증	Bias Check	자기가 쓴 코드를 자기가 리뷰 — 결함 발견율 낮음	Reviews its own code — low defect detection	다른 에이전트가 리뷰 — 독립적 검증	Different agent reviews — independent verification
비용 효율	Cost Efficiency	모든 작업에 최고급 모델 사용	Uses premium model for all tasks	단순 작업은 저비용 모델에 위임	Delegates simple tasks to cheaper models
전문화	Specialization	하나의 프롬프트로 모든 역할 수행	One prompt for all roles	각 에이전트가 특화된 프롬프트와 도구 사용	Each agent uses specialized prompts and tools

대표 패턴

Common Patterns

패턴	Pattern	흐름	Flow	적합한 상황
Plan-Do-Review	Planner → Worker → Reviewer	분해 가능한 작업, 코드 작성	Decomposable tasks, code writing
ReAct	Reason → Act → Observe → repeat	탐색적 작업, 디버깅	Exploratory tasks, debugging
Supervisor	감독자가 하위 에이전트에 작업 분배	Supervisor distributes tasks to sub-agents	대규모 병렬 작업	Large-scale parallel tasks
Debate	두 에이전트가 서로 반박하며 결론 도출	Two agents debate and reach conclusion	의사결정, 분석	Decision-making, analysis

6. 에이전트의 기억 문제

6. The Agent Memory Problem

LLM 에이전트의 가장 큰 약점 중 하나는 세션 간 맥락 소실이다. 대화가 끝나면 모든 것을 잊는다. 이를 해결하는 구조적 접근이 필요하다.

One of the biggest weaknesses of LLM agents is context loss between sessions. Everything is forgotten when the conversation ends. Structural approaches are needed.

기억 유형	Memory Type	범위	Scope	해결 방법	Solution
단기 기억	Short-term	현재 대화 안	Within current conversation	컨텍스트 윈도우 (자동)	Context window (automatic)
작업 기억	Working	현재 작업 진행 상태	Current task progress	TODO 리스트, 체크포인트 파일	TODO lists, checkpoint files
장기 기억	Long-term	세션 간 지속	Persists across sessions	핸드오프 파일, 지식 베이스, 벡터 DB	Handoff files, knowledge base, vector DB
실패 기억	Failure	같은 실수 반복 방지	Prevent repeating mistakes	"하지 말 것" 목록, lessons learned	"Do NOT retry" lists, lessons learned

💡

실패 기억이 가장 중요하다. 에이전트는 같은 실수를 반복하는 경향이 있다. "이 방법은 실패했고, 이유는 X" 같은 기록이 없으면 매 세션마다 같은 삽질을 반복한다.

Failure memory is the most important. Agents tend to repeat the same mistakes. Without records like "this approach failed because X," they waste each session retrying the same dead ends.

7. 실제 적용 분야

7. Real-World Applications

분야	Domain	에이전트가 하는 일	What the Agent Does	예시 도구
소프트웨어 개발	Software Development	코드 작성, 테스트, 디버깅, PR 리뷰를 자율적으로 수행	Autonomously writes code, tests, debugs, reviews PRs	Claude Code, Cursor, Copilot
자율 디버깅	Autonomous Debugging	에러 감지 → 원인 분석 → 코드 수정 → 배포 → 검증을 성공까지 반복	Detect error → analyze root cause → patch → deploy → verify, loops until success	`/autodebug`, `/inspect`
지속적 개선	Continuous Improvement	프로젝트를 스캔해 불필요 코드·중복·성능 병목을 찾고, 개선 Blueprint를 자동 생성	Scans projects for dead weight, redundancy, and perf bottlenecks; auto-generates improvement Blueprints	`/ci`, `/harness`
데이터 분석	Data Analysis	데이터 탐색 → 시각화 → 인사이트 도출을 반복	Iterates: data exploration → visualization → insight extraction	MATLAB, Jupyter, Python
DevOps	DevOps	배포, 모니터링, 인시던트 대응을 자동화	Automates deployment, monitoring, incident response	Docker, AWS, Vercel
연구	Research	논문 검색, 요약, 가설 검증 실험을 자율적으로 실행	Autonomously searches papers, summarizes, runs hypothesis tests	Web Search, arXiv, Python
업무 자동화	Business Automation	이메일 분류, 일정 관리, CRM 업데이트를 에이전트가 처리	Agent handles email triage, scheduling, CRM updates	Gmail, Calendar, Notion, Slack

실전 에이전트 루프 예시: /autodebug

Real Agent Loop Example: /autodebug

Step 1

Audit

13카테고리로 버그 탐지
Tier 2/3

Detect bugs via 13 categories
Tier 2/3

Step 2

Fix

CRITICAL 코드 수정
Claude

Patch CRITICAL code
Claude

Step 3

Deploy

변경 배포 + 빌드 확인

Push changes + verify build

Step 4

Verify

CRITICAL=0? 완료 : Step 1
최대 5라운드

CRITICAL=0? Done : Step 1
Max 5 rounds

실전 에이전트 루프 예시: /ci (Continuous Improvement)

Real Agent Loop Example: /ci (Continuous Improvement)

Step 1

Scan

프로젝트 구조·의존성 분석
Tier 3

Analyze structure + deps
Tier 3

Step 2

Assess

7가지 기준으로 개선점 평가
Tier 2

Evaluate against 7 criteria
Tier 2

Step 3

Report

영향도 순 랭킹 + 요약
Claude

Rank by impact + summarize
Claude

Step 4

Blueprint

/harness로 실행 계획 생성
나중에 적용

Generate plan via /harness
Apply later

✅

핵심: 두 에이전트 모두 3-Tier 라우팅으로 비용을 최소화한다. 무거운 분석은 Gemini/Groq에 위임, Claude는 판단과 코드 수정만 담당. AI 개발 시스템 가이드에서 상세 구조 확인.

Key: Both agents minimize cost via 3-Tier routing. Heavy analysis delegated to Gemini/Groq; Claude handles judgment and code changes only. See AI Dev System Guide for details.

8. 다음 단계

8. Next Steps

Agentic AI를 실제로 구축하려면 두 가지를 더 이해해야 한다:

To actually build Agentic AI, you need to understand two more things:

MCP: 도구 연결

MCP: Tool Connectivity

에이전트가 외부 시스템을 호출하는 표준 프로토콜
Standard protocol for agents to call external systems
하나의 인터페이스로 500+ 도구 연결
One interface to connect 500+ tools
Claude Code에서 로컬 도구 바로 사용
Use local tools directly from Claude Code
MCP 생태계 가이드 →MCP Ecosystem Guide →

LightHarness: 실전 구현

LightHarness: Working Implementation

이 가이드의 개념을 실제로 구현한 오케스트레이터
Orchestrator implementing concepts from this guide
4-Tier 라우팅, Blueprint 설계, 품질 게이트
4-Tier routing, Blueprint design, quality gates
Claude Pro 하나로 동작하는 에이전트 팀
Agent team running on a single Claude Pro
AI 개발 시스템 가이드 →AI Dev System Guide →