Building machines that learn and think with people

์ €์ž: Katherine M. Collins, Ilia Sucholutsky, Umang Bhatt, Kartik Chandra, Lionel Wong, Mina Lee, Cedegao E. Zhang, Tan Zhi-Xuan, Mark Ho, Vikash Mansinghka, Adrian Weller, Joshua B. Tenenbaum, Thomas L. Griffiths | ๋‚ ์งœ: 2024-10 | DOI: 10.1038/s41562-024-01991-9 📄 PDF


Essence

์ธ๊ณต์ง€๋Šฅ์ด ๋‹จ์ˆœํ•œ ์ƒ๊ฐ์˜ ๋„๊ตฌ๋ฅผ ๋„˜์–ด ์ธ๊ฐ„๊ณผ ํ•จ๊ป˜ ์‚ฌ๊ณ ํ•˜๋Š” ํŒŒํŠธ๋„ˆ(์‚ฌ๊ณ  ํŒŒํŠธ๋„ˆ, thought partner)๋กœ ๋ฐœ์ „ํ•ด์•ผ ํ•œ๋‹ค๋Š” ๊ด€์ ์—์„œ, ํ˜‘๋ ฅ์  ์ธ์ง€(collaborative cognition)์˜ ์›๋ฆฌ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์„ค๊ณ„๋œ AI ์‹œ์Šคํ…œ์˜ ํ•„์š”์„ฑ๊ณผ ๊ตฌํ˜„ ๋ฐฉ์•ˆ์„ ์ œ์‹œํ•œ๋‹ค.

Motivation

Achievement

  1. ํ˜‘๋ ฅ์  ์‚ฌ๊ณ ์˜ ๋ชจ๋“œ ์ฒด๊ณ„ํ™”: ํ˜‘๋ ฅ์  ๊ณ„ํš(collaborative planning), ํ˜‘๋ ฅ์  ํ•™์Šต(collaborative learning), ํ˜‘๋ ฅ์  ์ˆ™์˜(collaborative deliberation), ํ˜‘๋ ฅ์  ์˜๋ฏธ ํŒŒ์•…(collaborative sense-making), ํ˜‘๋ ฅ์  ์ฐฝ์ž‘(collaborative creation)์˜ 5๊ฐ€์ง€ ์ฃผ์š” ๋ชจ๋“œ๋ฅผ ์ •์˜ํ•˜๊ณ , ๊ฐ ๋ชจ๋“œ๋ณ„ ํ•ต์‹ฌ ๊ณผ์ œ(์˜ˆ: ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๋Š” ๋ชฉํ‘œ ์ถ”๋ก , ๊ฐœ์ธํ™”๋œ ํ•™์Šต ์†๋„, ๊ฒ€์ฆ ๊ฐ€๋Šฅํ•œ ์ถ”๋ก )๋ฅผ ๋ช…ํ™•ํžˆ ์ œ์‹œ
  2. ๋‹ค์˜์—ญ ์‹ค์ œ ์‚ฌ๋ก€ ๋ถ„์„: ํ”„๋กœ๊ทธ๋ž˜๋ฐ ์–ด์‹œ์Šคํ„ดํŠธ, ๊ตฌ์ฒดํ™”๋œ ๋ณด์กฐ ๋กœ๋ด‡(embodied assistive robots), ์ฐฝ์ž‘ ์ง€์›(storytelling), ์˜๋ฃŒ ์ง„๋‹จ ๋“ฑ 4๊ฐœ ๋„๋ฉ”์ธ์—์„œ ํ˜„์žฌ ๊ธฐ์ˆ ์˜ ํ•œ๊ณ„์™€ ์‚ฌ๊ณ  ํŒŒํŠธ๋„ˆ์‹ญ์˜ ์š”๊ตฌ์‚ฌํ•ญ์„ ๊ตฌ์ฒด์ ์œผ๋กœ ๋ถ„์„
  3. ์„ค๊ณ„ ์›์น™(desiderata) ์ œ์•ˆ: ํšจ๊ณผ์ ์ธ ์ธ๊ฐ„-ํ˜ธํ™˜ ์‚ฌ๊ณ  ํŒŒํŠธ๋„ˆ์‹ญ์„ ์œ„ํ•œ 3๊ฐ€์ง€ ํ•„์ˆ˜ ์š”๊ฑด ์ œ์‹œ:
    • ์ธ๊ฐ„์„ ์ดํ•ดํ•  ์ˆ˜ ์žˆ๋Š” ๋Šฅ๋ ฅ(understand us)
    • ์ธ๊ฐ„์ด ์ดํ•ด ๊ฐ€๋Šฅํ•œ ์„ค๋ช…์„ฑ(we can understand)
    • ๊ณตํ†ต์˜ ๊ธฐ๋ฐ˜์ด ๋˜๋Š” ์„ธ๊ณ„ ์ดํ•ด(understanding of world)

How

Fig. 2 | Case study depictions

WatChat์ด ์‚ฌ์šฉ์ž์˜ ์˜ค๋ฅ˜๊ฐ€ ์žˆ๋Š” ์ •์‹  ๋ชจ๋ธ์„ ์ถ”๋ก ํ•˜๋Š” ์‚ฌ๋ก€

Originality

Limitation & Further Study

Evaluation

์ดํ‰: ์ด ๋…ผ๋ฌธ์€ ์Šค์ผ€์ผ๋ง ์ค‘์‹ฌ์˜ ํ˜„๋Œ€ AI ๊ฐœ๋ฐœ ํŒจ๋Ÿฌ๋‹ค์ž„์— ๋Œ€ํ•œ ์ค‘์š”ํ•œ ์„ฑ์ฐฐ์„ ์ œ์‹œํ•˜๋ฉฐ, ์ธ๊ฐ„๊ณผ ๊ธฐ๊ณ„๊ฐ€ ์ง„์ •ํ•œ ํ˜‘๋ ฅ์ž๋กœ์„œ ํ•จ๊ป˜ ์‚ฌ๊ณ ํ•  ์ˆ˜ ์žˆ๋Š” ์‹œ์Šคํ…œ์˜ ์„ค๊ณ„ ์›์น™์„ ์ œ์•ˆํ•œ๋‹ค๋Š” ์ ์—์„œ ํ•™์ˆ ์ ยท์‹ค๋ฌด์  ๊ฐ€์น˜๊ฐ€ ๋†’๋‹ค. ๋‹ค๋งŒ ์ œ์‹œ๋œ ๋ฒ ์ด์ง€์•ˆ ๊ตฌ์กฐ์  ์ ‘๊ทผ์˜ ๊ตฌ์ฒด์  ๊ตฌํ˜„๊ณผ ์‹ค์ œ ๋„๋ฉ”์ธ์—์„œ์˜ ์„ฑ๋Šฅ ๊ฒ€์ฆ์ด ํ›„์† ์—ฐ๊ตฌ๋ฅผ ํ†ตํ•ด ๋ณด์ถฉ๋˜์–ด์•ผ ํ•  ๊ฒƒ์œผ๋กœ ๋ณด์ธ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Building machines that learn and think with people ๋…ผ๋ฌธ์€ ํ˜‘๋ ฅ์  ์ธ์ง€ ๊ธฐ๋ฐ˜ ์„ค๊ณ„ ์›๋ฆฌ๋ฅผ AutoGen์˜ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ํ˜‘์—… ํ”„๋ ˆ์ž„์›Œํฌ์— ์ด๋ก ์ ์œผ๋กœ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Building machines that learn and think with people ๋…ผ๋ฌธ์€ ์ธ๊ฐ„-๊ธฐ๊ณ„ ํ˜‘์—…์ด ์‹คํ—˜ยท๋ฐœ๊ฒฌ ์ž๋™ํ™”์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ์ด๋ก ์ ์œผ๋กœ ๋ถ„์„ํ•˜์—ฌ, AI ๊ธฐ๋ฐ˜ ํ˜„๋ฏธ๊ฒฝ ์‹คํ—˜์‹ค ๊ตฌ์ถ•์˜ ๊ธฐ์ˆ ์ ยท์ธ๊ฐ„์  ํ•œ๊ณ„๋ฅผ ๊ณ ์ฐฐํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
์ธ๊ฐ„๊ณผ AI๊ฐ€ ํ•จ๊ป˜ ๋ฐฐ์šฐ๊ณ  ์‚ฌ๊ณ ํ•˜๋Š” ์ธ์ง€์  ๋ฉ”์ปค๋‹ˆ์ฆ˜(ํ˜‘๋™์ง€๋Šฅ)์— ๋Œ€ํ•œ ๊ทผ๋ณธ ์ด๋ก ์  ํ† ๋Œ€ ์—ญํ• ์„ ํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
AI์™€ ์ธ๊ฐ„์ด ํ•จ๊ป˜ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” ์ธ๊ฐ„-AI ํ˜‘๋ ฅ์  ์ธ๊ณต์ง€๋Šฅ(co-learning)์˜ ์ด๋ก ์ด ์ „์ œ๋œ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ธ๊ฐ„๊ณผ AI์˜ ํ˜‘๋ ฅ์  ์ธ์ง€์™€ ์‚ฌ๊ณ  ํŒŒํŠธ๋„ˆ์‹ญ ์‹คํ˜„์„ ์œ„ํ•œ ์‹œ์Šคํ…œ ์„ค๊ณ„๋ฅผ ์—ฐ๊ตฌํ•˜๋Š” ์œ ์‚ฌํ•œ ๋…ผ๋ฌธ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ธ๊ฐ„ ์ค‘์‹ฌ์  AI ์„ค๊ณ„์™€ ํ˜‘๋ ฅ์  ์ธ์ง€ ์‹œ์Šคํ…œ์˜ ๊ตฌํ˜„ ์›๋ฆฌ๋ฅผ ํƒ๊ตฌํ•˜๋Š” ์œ ์‚ฌํ•œ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํ˜‘๋ ฅ์  AI ์‹œ์Šคํ…œ ์„ค๊ณ„์—์„œ ์ธ๊ฐ„ ์ธ์ง€ ์›๋ฆฌ๋ฅผ ์ ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก ์„ ์—ฐ๊ตฌํ•˜๋Š” ์œ ์‚ฌํ•œ ๋…ผ๋ฌธ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ธ๊ฐ„๊ณผ AI์˜ ๊ณต๋™ ์‚ฌ๊ณ  ๋ฐ ํ˜‘๋ ฅ์  ๋ฌธ์ œ ํ•ด๊ฒฐ ๋Šฅ๋ ฅ ํ–ฅ์ƒ์„ ์œ„ํ•œ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•˜๋Š” ๊ด€๋ จ ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Exploring collaboration mechanisms ๋…ผ๋ฌธ์€ LLM ์—์ด์ „ํŠธ ๊ฐ„ ํ˜‘์—… ๋ฐ ์‚ฌํšŒ์  ๊ธฐ์ž‘(mechanism)์„ ๋ถ„์„ํ•˜๋ฉฐ, ์ธ๊ฐ„-AI ํ˜‘๋ ฅ์  ์ธ์ง€ ์„ค๊ณ„์— ์‹ค์šฉ์  ํ†ต์ฐฐ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
์ธ๊ฐ„-AI ํŒ€ ๊ตฌ์„ฑ์˜ ํšจ๊ณผ์™€ ์‹ค์ œ ๊ตฌํ˜„ ์‚ฌ๋ก€๋ฅผ ์ถ”๊ฐ€๋กœ ์ œ๊ณตํ•ด ํ˜‘๋ ฅ์  ์ธ์ง€ ์—ฐ๊ตฌ์— ๊นŠ์ด๋ฅผ ๋”ํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
From Labor to Collaboration ๋…ผ๋ฌธ์€ ์ธ๊ฐ„-LLM ํ˜‘๋™ ์ž‘์—…์˜ ๋ฐฉ๋ฒ•๋ก ์  ์‹คํ—˜์„ ํ†ตํ•ด ํ˜‘๋ ฅ์  ์ธ์ง€์˜ ์‹ค์งˆ์  ํŒŒ๊ธ‰ ํšจ๊ณผ๋ฅผ ํƒ์ƒ‰ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
Many Heads Are Better Than One ๋…ผ๋ฌธ์€ ํ˜‘๋ ฅ์  ์•„์ด๋””์–ด ์ƒ์„ฑ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ LLM์— ์ ์šฉํ•œ ์‹ค์ œ ์‚ฌ๋ก€๋กœ, ์‚ฌ๊ณ  ํŒŒํŠธ๋„ˆ๋กœ์„œ ํ˜‘๋ ฅ์  AI ์—ฐ๊ตฌ์™€ ์ง์ ‘์ ์œผ๋กœ ์—ฐ๊ฒฐ๋ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
Prototypical human-ai collaboration behaviors ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฐ˜ ํ•™์ˆ  ํ˜‘์—…์˜ ์‹ค์ œ์  ํ–‰๋™ ํŒจํ„ด์„ ๋ถ„์„ํ•˜์—ฌ, ์ธ๊ฐ„-AI ํ˜‘๋ ฅ์  ์‚ฌ๊ณ  ์‹œ์Šคํ…œ์ด ์‚ฌํšŒ์ ์œผ๋กœ ์–ด๋–ป๊ฒŒ ๊ตฌํ˜„๋˜๋Š”์ง€ ์‚ฌ๋ก€๋ฅผ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
โ€˜AI-์ธ๊ฐ„ ํ˜‘๋ ฅ ๊ธฐ๋ฐ˜ ๊ณผํ•™์—ฐ๊ตฌโ€™์˜ ์ฒ ํ•™์  ๋…ผ์˜๊ฐ€ ์‹ค์ œ๊ณผํ•™ ํ˜์‹  ๊ตฌํ˜„ ๋…ผ๋ฌธ(824)์— ์ ์šฉ๋˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •