10 kịch bản Multi-Agent: Code review, debug, test song song với Claude Code

Tăng 90% độ chính xác với multi-agent workflow trong Claude Code. 10 kịch bản thực tế: từ code review song song đến debug production incident.

Định nghĩa

Subagents (agents) trong Claude Code là các phiên bản AI chuyên biệt chạy song song trong context window riêng biệt, cho phép ủy thác tác vụ cụ thể (review, debug, test) thay vì gánh nặng tất cả trên một agent đa năng duy nhất.

Giải thích chi tiết

Vấn đề của "Monolithic Agent"

Khi yêu cầu một agent đa năng vừa viết code, vừa review, vừa debug, bạn tạo ra hiện tượng "ô nhiễm ngữ cảnh" (context pollution). Quá trình thử-nghiệm khi debug (50 lần đọc file, sai lầm, stack trace) sẽ đẩy kiến trúc hệ thống ra khỏi context window, khiến agent quên mất yêu cầu nghiệp vụ ban đầu. Đây là lý do single agent chỉ giải quyết được khoảng 40% bài toán review thực tế (theo benchmark SWE-bench).

Metaphor "Thần đồng bịt mắt"

LLM agents giống như lập trình viên thiên tài bị bịt mắt và cướp đi terminal. Họ viết code xuất sắc nhưng không thể nhìn stack trace, đọc test output, hay grep codebase. Agent feedback loops là kỹ năng 10x — không phải prompt hay, mà là visibility. Khi agent hallucinate fix, thường là vì không thấy auth middleware ba file xa, không phải vì thiếu kiến thức.

10 vai trò chuyên biệt trong workflow thực tế

Dựa trên nghiên cứu từ SWE-bench và Anthropic, workflow tối ưu tách biệt software engineering thành 10 vai trò có thể ủy thác cho subagents:

Code Reviewer: Dùng semantic context engine (không chỉ diff grep) để truy xuất auth middleware, service dependencies, và "tribal knowledge" (ví dụ: "không log field nhạy cảm loại X trong hệ thống ngân hàng")
Debugger: Interactive step-through với quyền truy cập stack traces, container logs, API responses; tách biệt investigation khỏi implementation
Test Engineer: Sinh verification tests từ human review comments, tạo held-out test suite làm quality gate cho CI/CD
Architect/Planner: Chỉ đọc, không viết — mapping dependencies và technical constraints trước khi chạm code (đặc biệt quan trọng với legacy codebase PHP/Java tại VN)
Security Auditor: Chuyên trách injection vulnerabilities (XSS, SQLi) và auth bypasses, chạy Haiku cho read-only codebase scan trước khi lên production
Feedback Loop Agent: Chạy pytest, eslint, E2E scripts tự động, iterate trên lỗi không cần human copy-paste
Refactoring Agent: Xử lý migrations lớn (Laravel → Node.js, Java 8 → 17, Jest → Vitest) trong worktree riêng
Documentation Agent: Theo dõi API changes và cập nhật docs/CHANGELOG tự động cho team product
Performance Optimizer: Phát hiện N+1 queries, memory leaks qua static analysis + profiling data từ production
Orchestrator: Điều phối multi-agent workflows, tổng hợp kết quả từ các subagent vào quyết định cuối cùng

Kiến trúc Context Isolation

Mỗi subagent spawn với system prompt và tool permissions riêng:

Explore Agent (Haiku): Chỉ đọc, dùng cho codebase navigation nhanh, rẻ
Implement Agent (Sonnet): Quyền write, edit file
Audit Agent (Opus): Phân tích sâu security critical paths

Kết quả từ subagent được nén thành summary 2-5k tokens trước khi trả về parent, loại bỏ "cognitive debris" (intermediate reasoning steps).

Multi-Agent Feedback Loop

Pattern "Reviewer + Builder" đạt 90% accuracy trên SWE-bench so với 80% của single agent. Cơ chế:

Builder agent viết code
Reviewer agent critique trong context window riêng (không bị implementation bias)
Orchestrator tổng hợp và yêu cầu iteration
Test Engineer verify bằng automated tests

Đây là lossy compression — thay vì parent agent chứng kiến toàn bộ quá trình debug (grep → read → edit → test → error), nó chỉ nhận kết luận cuối.

Ví dụ thực tế

Song song Review PR VietQR và viết tính năng hoàn tiền

Bạn là tech lead tại startup fintech ở TP.HCM đang phát triển tính năng VietQR cho app ví điện tử. Truyền thống: phải dừng code để review PR của junior dev về module tạo mã QR. Với subagents:

Spawn Reviewer Agent trong background với context riêng: "Review PR #234 về VietQR generator, check SQL injection và xử lý Unicode trong tên merchant"
Tiếp tục code tính năng "hoàn tiền tự động" qua Momo/ZaloPay trong session chính
Reviewer trả về report: 2 lỗi không sanitize input merchant_name, 1 missing edge case khi VietQR timeout
Fix lỗi bằng cách spawn Refactor Agent thực hiện suggested changes trong worktree riêng
Cả quá trình song song, bạn không bị context-switching giữa business logic hoàn tiền và chi tiết kỹ thuật encode QR

Debug incident thanh toán giờ cao điểm 11/11

Hệ thống e-commerce (giống Shopee/Tiki) báo lỗi 500 trên /api/payments trong đợt flash sale:

Spawn Debugger Agent với quyền truy cập CloudWatch logs (region ap-southeast-1) và database connection (read-only)
Agent đọc stack trace, trace error qua 5 microservices, phát hiện race condition trong Redis cache khi xử lý concurrent VietQR requests
Parent agent (bạn) chỉ nhận báo cáo: "Lỗi ở payment_service.py:142, do concurrent access chưa lock khi query số dư ví. Đề xuất fix: thêm asyncio.Lock() hoặc dùng Redis SETNX"
Không cần tự mò log hay làm pollute context window chính bằng hàng trăm dòng log trace từ hệ thống phức tạp

Migration từ Laravel monolith sang microservices logistics

Công ty logistics (mô hình Grab) đang tách hệ thống đặt xe từ Laravel monolith sang microservices Node.js:

Architect Agent phân tích dependencies, vẽ boundary contexts giữa module "tính giá cước" (tích hợp Google Maps/VietMap) và "tracking tài xế" (WebSocket)
2 Refactoring Agent song song xử lý: một migrate module VietMap integration sang Node.js, một xử lý module SMS Gateway (Twilio/Viettel)
Test Engineer Agent sinh integration tests cho API gateway mới với kịch bản "đặt xe khi tài xế ở vùng kém sóng"
Security Agent audit API boundaries cho potential data leaks (PII của tài xế/khách hàng)
Tổng thời gian giảm 60-70% so với single agent xử lý tuần tự do parallel execution và context isolation.

Audit security cho ngân hàng số trước release

Ngân hàng số cần audit auth flow JWT trước khi lên production:

Security Auditor Agent (dùng Claude Opus) scan codebase Java Spring Boot trong chế độ read-only
Kiểm tra secret rotation, SQL injection trong native query thường thấy ở legacy bank systems
Song song, Performance Agent scan N+1 query patterns trong module báo cáo giao dịch
Cả hai chạy độc lập không block team dev đang fix UI ở branch release/v2.3

Ứng dụng

Developer độc lập (Freelancer VN)

Dùng 3-4 agent đồng thời để "gia tăng bandwidth": trong khi bạn tập trung phát triển business logic cho khách hàng US/EU, Reviewer Agent scan security vulnerabilities, Test Agent viết unit tests coverage. Phù hợp với freelancer Việt Nam làm việc một mình nhưng cần chất lượng deliverable như team lớn.

Tech Lead tại công ty product (Tiki, Shopee VN, Momo)

Ủy thác code review hàng ngày cho Code Reviewer Agent để giảm "approval bottleneck". Agent kiểm tra tribal knowledge (coding standards trong CLAUDE.md, quy định về xử lý PII theo NĐ13/2023/NĐ-CP) và common anti-patterns, chỉ escalate lên bạn khi phát hiện architectural mismatch hoặc security risk thực sự.

Startup fintech Series A (Hà Nội/TP.HCM)

Triển khai Agent Teams cho CI/CD pipeline: mỗi PR tự động trigger parallel analysis (security + performance + compliance với quy định NHNN) trước khi vào tay human reviewer. Benchmark cho thấy coverage tăng từ 3% (manual sampling) lên 100% interactions với consistency score 0.92.

So sánh

Tiêu chí	Single Generalist Agent	Multi-Agent Workflow
Context Window	Bị pollute bởi intermediate steps	Mỗi agent giữ context sạch, chuyên môn hóa
Accuracy (SWE-bench)	~80%	~90% (Reviewer + Builder pattern)
Parallel Processing	Tuần tự, serialized	Song song 3-5 agents, 2-4x nhanh hơn
Chi phí Token	Thấp hơn một chút (1 context)	Cao hơn (N contexts) nhưng hiệu quả hơn do tránh regeneration
Debugging	Khó trace vì "lộn xộn" suy nghĩ	Dễ trace, mỗi agent có vai trò rõ ràng
Phù hợp	Task đơn giản, file đơn lẻ	Task phức tạp, multi-file, cần verification

Lưu ý quan trọng: Agent khác với Skill và MCP. Skill là "khả năng" (prompt + tools), MCP là "cổng kết nối dữ liệu". Agent là "quy trình tự chủ" (autonomous sub-process) có thể sử dụng skills và MCP bên trong context window riêng.