Hiagent: Hierarchical working memory management for solving long-horizon agent tasks with large language model

์ €์ž: Mengkang Hu, Tianxing Chen, Qiguang Chen, Yi Mu, Wenqi Shao, Ping Luo | ๋‚ ์งœ: 2024 | URL: https://arxiv.org/abs/2408.09559 📄 PDF


Essence

Figure 2

Figure 2: An overview of the process of HIAGENT.

HIAGENT๋Š” LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ์˜ ์ž‘์—… ๋ฉ”๋ชจ๋ฆฌ๋ฅผ subgoal์„ ์ค‘์‹ฌ์œผ๋กœ ๊ณ„์ธต์ ์œผ๋กœ ๊ด€๋ฆฌํ•˜์—ฌ ์žฅ๊ธฐ ์ž‘์—…์—์„œ ๋งฅ๋ฝ ์ค‘๋ณต์„ฑ์„ ์ค„์ด๊ณ  ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ํ”„๋ ˆ์ž„์›Œํฌ์ด๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: Top right: A commonly adopted paradigm STAN-

How

Figure 2

Figure 2: An overview of the process of HIAGENT.

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: HIAGENT๋Š” ์ธ์ง€ ๊ณผํ•™ ์›๋ฆฌ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ LLM ์—์ด์ „ํŠธ์˜ ์ž‘์—… ๋ฉ”๋ชจ๋ฆฌ ๋ฌธ์ œ๋ฅผ ์ฐฝ์˜์ ์œผ๋กœ ํ•ด๊ฒฐํ•˜๋ฉฐ, ์‹คํ—˜ ๊ฒฐ๊ณผ๊ฐ€ ๋›ฐ์–ด๋‚˜๊ณ  ์‹ค์šฉ์  ๊ฐ€์น˜๊ฐ€ ๋†’๋‹ค. ๋‹ค๋งŒ subgoal ์ƒ์„ฑ ๋ฉ”์ปค๋‹ˆ์ฆ˜์˜ ๊ฐ•๊ฑด์„ฑ๊ณผ ๋‹ค์–‘ํ•œ ํ™˜๊ฒฝ์—์„œ์˜ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ์— ๋Œ€ํ•œ ์ถ”๊ฐ€ ๊ฒ€์ฆ์ด ํ•„์š”ํ•˜๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๊ณ„์ธต์  ์—์ด์ „ํŠธ์™€ ์ž‘์—… ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ๋ฅผ ๋‹ค๋ฃจ๋Š” ๊ธฐ๋ณธ ๊ฐœ๋…์„ ์ œ์‹œํ•˜๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
From Human Memory to AI Memory ๋…ผ๋ฌธ์€ ๊ณ„์ธต์ /๋‹จ๊ธฐ-์žฅ๊ธฐ ๊ธฐ์–ต ๋ฉ”์ปค๋‹ˆ์ฆ˜์˜ ์ด๋ก ์„ ์ƒ์„ธํžˆ ๋ถ„์„ํ•˜์—ฌ HIAGENT์˜ ๊ตฌ์กฐ์  ์„ค๊ณ„ ๊ทผ๊ฑฐ๊ฐ€ ๋ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
A-MEM: Agentic Memory for LLM Agents ๋…ผ๋ฌธ์€ LLM ์—์ด์ „ํŠธ์˜ ์ž‘์—… ๋ฉ”๋ชจ๋ฆฌ ๋ฉ”์ปค๋‹ˆ์ฆ˜(๊ณ„์ธต์ /์ƒํ™ฉ์  ๊ด€๋ฆฌ)์˜ ์ด๋ก ์  ํ”„๋ ˆ์ž„์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
HIAGENT๋Š” ์„œ๋ธŒ๊ณจ ์ค‘์‹ฌ์˜ ๊ณ„์ธต์  ์ž‘์—… ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ ์ ‘๊ทผ์„ ํ†ตํ•ด ์žฅ๊ธฐ ํ”Œ๋ž˜๋‹๊ณผ ์—์ด์ „ํŠธ ์ผ๊ด€์„ฑ ์—ฐ๊ตฌ์˜ ์ด๋ก ์  ํ† ๋Œ€๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
822๋ฒˆ ๋…ผ๋ฌธ์€ AI agent ์‹ ๋ขฐ์„ฑ ํ‰๊ฐ€์˜ ๊ณผํ•™์  ์ ‘๊ทผ ๋ฐฉ๋ฒ•๋ก ์„ ๋‹ค๋ฃจ๋ฏ€๋กœ, 400๋ฒˆ์—์„œ ๊ณ„์ธต์  ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ๋กœ ์‹ ๋ขฐ๋„๋ฅผ ๋†’์ด๋Š” ์ „๋žต์ด ๊ฐ–๋Š” ์˜์˜์™€ ํ•œ๊ณ„๋ฅผ ์ดํ•ดํ•˜๋Š” ๋ฐ ๊ธฐ์ดˆ๊ฐ€ ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
HuggingGPT ๋…ผ๋ฌธ์ฒ˜๋Ÿผ ์—ฌ๋Ÿฌ AI๋ชจ๋ธ์„ ์ปจํŠธ๋กคํ•˜๋Š” LLM ์—์ด์ „ํŠธ๊ฐ€, ๊ณ„์ธต์  ์ž‘์—…๊ณ„ํš๊ณผ ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ๋ฅผ ์–ด๋–ป๊ฒŒ ์„ค๊ณ„ํ•  ์ˆ˜ ์žˆ๋Š”์ง€ ๋Œ€์กฐ์ ์œผ๋กœ ์กฐ๋งํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
400๋ฒˆ Hiagent๋Š” ์ž‘์—… ๋ถ„ํ•ด ๋ฐ ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ ์ค‘์‹œ์˜ LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ๋กœ, 813๋ฒˆ Toolformer์˜ ๋„๊ตฌ ํ™œ์šฉ ํ•™์Šต๊ณผ ๋‹ค๋ฅธ ์—์ด์ „ํŠธ ์„ค๊ณ„ ์ „๋žต์„ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
400์€ ์—์ด์ „ํŠธ ๊ธฐ๋ฐ˜ ๋Œ€์šฉ๋Ÿ‰ ๋ฌธ์ œ ํ•ด๊ฒฐ์—์„œ ๊ณ„์ธต์  ๋ฉ”๋ชจ๋ฆฌ ๋งค๋‹ˆ์ง€๋จผํŠธ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ ์šฉํ•ด, AnyTool๊ณผ ๋‹ค๋ฅธ ๋ฐฉ์‹์„ ์ œ์•ˆํ•ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
395๋ฒˆ ๋…ผ๋ฌธ์€ ๊ฐ•ํ™”ํ•™์Šต ์—์ด์ „ํŠธ์˜ ์•ˆ์ „์„ฑ ๋ฌธ์ œ๋ฅผ ์ œ์–ด ์žฅ๋ฒฝ ํ•จ์ˆ˜๋กœ ๋‹ค๋ฃจ๊ณ , 400๋ฒˆ์€ ์ž‘์—… ๋ฉ”๋ชจ๋ฆฌ ๊ตฌ์กฐ ํ˜์‹ ์— ์ดˆ์ ์„ ๋‘์–ด, ์žฅ๊ธฐ reasoning๊ณผ ์•ˆ์ „์„ฑ ๋‹ค์–‘ํ•œ ๋ฐฉ์‹์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ์‚ดํ•„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Hiagent(400)์€ ์‹คํ—˜์‹ค ์›Œํฌํ”Œ๋กœ์šฐ๋ฅผ ์ž๋™ํ™”ํ•˜๋Š” ์—์ด์ „ํŠธ ์‹œ์Šคํ…œ์œผ๋กœ, 432์˜ ์‹ค์‹œ๊ฐ„ ์‹คํ—˜ ์ž๋™ํ™”์™€ ์ƒํ˜ธ๋ณด์™„์  ์ ‘๊ทผ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Tree-planner ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ๊ฐ€ ๊ณ„์ธต์ /ํ๋ฃจํ”„์  ํ”Œ๋ž˜๋‹์„ ์ˆ˜ํ–‰ํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋กœ, HIAGENT์˜ ์žฅ๊ธฐ ๋งฅ๋ฝ ๊ด€๋ฆฌ ๊ตฌ์กฐ๋ฅผ ์‹ค์ œ ํ”Œ๋ž˜๋‹์œผ๋กœ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
400์€ ์—์ด์ „ํŠธ์˜ ์ž‘์—… ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ์™€ ๋ฉ€ํ‹ฐ์Šคํ… ๋ฌธ์ œ ํ•ด๊ฒฐ์„ ๊ฐ•์กฐํ•˜์—ฌ 061์˜ ๋ฉ”๋ชจ๋ฆฌ/๊ณ„ํš ์‹œ์Šคํ…œ์„ ๋‹ค์–‘ํ•œ ๋ฌธ์ œ์— ์ ์šฉํ•˜๋Š” ํ™•์žฅ์  ์˜ˆ์‹œ๋ฅผ ์ œ๊ณตํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
400์€ ๊ณ„์ธต์  ์ž‘์—… ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ๋กœ 180์˜ foundation model memory ๋ฌธ์ œ๋ฅผ ํ™•์žฅ ์—ฐ๊ตฌํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์˜๋ฃŒ/๋ณ‘๋ฆฌํ•™ ๋ถ„์•ผ ๋‚ด LLM ๋ฐ AI ๋„๊ตฌ์˜ ๊ณ„์ธต์  ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ, ์ง„๋‹จ ๋ณด์กฐํ”„๋ ˆ์ž„์›Œํฌ๊ฐ€ human pathology copilot์˜ ํ™•์žฅ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์—ฌ์ค€๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
๊ณ„์ธต์  ์ž‘์—…๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ ๊ธฐ๋ฒ• ์—ฐ๊ตฌ๋ฅผ ํ†ตํ•ด, A-MEM์˜ ๋™์  ๋ฉ”๋ชจ๋ฆฌ ์ง„ํ™” ์ „๋žต์„ ์‹ฌํ™”ยทํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
HIAGENT๋Š” LLM ๊ธฐ๋ฐ˜ ์—์ด์ „ํŠธ์˜ ์ž‘๋™ ๋ฉ”๋ชจ๋ฆฌ ๊ตฌ์กฐ ๊ตฌ์ฒด์  ๊ตฌํ˜„ ์‚ฌ๋ก€๋กœ, AI Memory Survey์˜ ์ด๋ก ์„ ์‹ค์ œ๋กœ ์—ฐ๊ฒฐํ•ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •