A survey of large language models

์ €์ž: Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Yang Chen, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu | ๋‚ ์งœ: 2023 | URL: https://arxiv.org/abs/2303.18223 📄 PDF


Essence

Figure 2

Fig. 2: An evolution process of the four generations of language models (LM) from the perspective of task solving capaci

๋Œ€๊ทœ๋ชจ ์–ธ์–ด๋ชจ๋ธ(LLM)์˜ ๋ฐœ์ „ ๊ณผ์ •์„ ํ†ต๊ณ„์  ์–ธ์–ด๋ชจ๋ธ๋ถ€ํ„ฐ ์‹ ๊ฒฝ๋ง ์–ธ์–ด๋ชจ๋ธ, ์‚ฌ์ „ํ•™์Šต ์–ธ์–ด๋ชจ๋ธ์„ ๊ฑฐ์ณ ํ˜„์žฌ์˜ ์ƒ์„ฑํ˜• ๋Œ€๊ทœ๋ชจ ๋ชจ๋ธ๊นŒ์ง€ ์ฒด๊ณ„์ ์œผ๋กœ ์กฐ์‚ฌํ•œ ์ข…ํ•ฉ ์„œ๋ฒ ์ด ๋…ผ๋ฌธ์ด๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1: The trends of the cumulative numbers of arXiv papers that contain the keyphrases โ€œlanguage modelโ€ (since June 20

How

Figure 3

Fig. 3: A timeline of representative LLMs released in recent years. Models with publicly available checkpoints are

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ์ด ์„œ๋ฒ ์ด๋Š” ๋Œ€๊ทœ๋ชจ ์–ธ์–ด๋ชจ๋ธ์˜ ๋ฐœ์ „ ์—ญ์‚ฌ์™€ ํ•ต์‹ฌ ๊ธฐ์ˆ ์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ •๋ฆฌํ•œ ๋งค์šฐ ์‹œ์˜์ ์ ˆํ•œ ์ข…ํ•ฉ ์ž๋ฃŒ๋กœ, ์—ฐ๊ตฌ์ž์™€ ์‹ค๋ฌด์ž ๋ชจ๋‘์—๊ฒŒ LLM์˜ ํ˜„ํ™ฉ์„ ์ดํ•ดํ•˜๋Š” ๋ฐ ํ•„์ˆ˜์ ์ธ ์ฐธ๊ณ ์ž๋ฃŒ์ด๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
387 GPT-4 ํ…Œํฌ๋‹ˆ์ปฌ ๋ฆฌํฌํŠธ๋Š” 026์—์„œ ์–ธ๊ธ‰ํ•˜๋Š” ์ตœ์‹  LLM ๋ฐœ์ „์˜ ๊ทผ๊ฑฐ๋กœ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
ํ†ต๊ณ„ ๊ธฐ๋ฐ˜ LLM ๋ฐœ์ „์‚ฌ ๋ฐ ์ƒ์„ฑํ˜• ๋ชจ๋ธ ์ „๋ฐ˜์„ ์„œ๋ฒ ์ดํ•ด, ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ์—์„œ์˜ ์ถ”๋ก ๋ ฅ ๋…ผ์˜๋ฅผ ๋’ท๋ฐ›์นจํ•จ.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
LLM์˜ ์ „๋ฐ˜์  ํ‰๊ฐ€ ๋ถ„์•ผ ์„œ๋ฒ ์ด๋กœ, ํ•ต์‹ฌ ์—ญ๋Ÿ‰ ํ”„๋ ˆ์ž„์›Œํฌ ๋…ผ์˜์— ์—ญ์‚ฌ์ ยท์ด๋ก ์  ๋ฐฐ๊ฒฝ์„ ์ œ๊ณตํ•œ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
RAG์˜ ๊ธฐ์ดˆ๊ฐ€ ๋˜๋Š” ๊ฒ€์ƒ‰ ๋ฐ ์ƒ์„ฑ ๋ฐฉ๋ฒ•๋ก ์„ ์ œ๊ณตํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
A Survey of Large Language Models ๋…ผ๋ฌธ์€ LLM ๊ธฐ๋ฒ•๊ณผ ํ™œ์šฉ๋ฐฉ์•ˆ์— ๋Œ€ํ•œ ํฌ๊ด„์ ์ธ ์ด๋ก ์  ๋ฐฐ๊ฒฝ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
026์€ ๋Œ€ํ˜• ์–ธ์–ด๋ชจ๋ธ ๋ฐ ์—์ด์ „ํŠธ์— ๋Œ€ํ•œ ์ตœ๊ทผ ์„ฑ๋Šฅ ํ‰๊ฐ€์™€ ๋น„๊ต๋ฅผ ๋‹ค๋ฃจ์–ด, 3274์— ํ†ตํ•ฉ๋œ ๋จธ์‹ ๋Ÿฌ๋‹ ๋ฐ ์–ธ์–ด๋ชจ๋ธ ์ „๋žต์˜ ํ˜„์ฃผ์†Œ๋ฅผ ๋ถ„์„ํ•˜๋Š” ๋ฐ ๋„์›€์ด ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋Œ€๊ทœ๋ชจ ์–ธ์–ด๋ชจ๋ธ์˜ ๋Šฅ๋ ฅ๊ณผ ํ•œ๊ณ„๋ฅผ ๋ถ„์„ํ•˜๋Š” ์œ ์‚ฌํ•œ ์กฐ์‚ฌ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
์ƒ์„ฑํ˜• ๋Œ€๊ทœ๋ชจ ์–ธ์–ด๋ชจ๋ธ์˜ ๋ฐœ์ „ ๊ณผ์ •์„ ๋‹ค๋ฃจ๋Š” ๊ด€๋ จ ์„œ๋ฒ ์ด ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
ํŒŒ๋ผ๋ฏธํ„ฐ ์ ‘๊ทผ ์—†์ด LLM ํŽธํ–ฅ์„ ์™„ํ™”ํ•˜๋Š” ์œ ์‚ฌํ•œ ์ ‘๊ทผ ๋ฐฉ์‹์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ณผํ•™๋ถ„์•ผ LLM ์„œ๋ฒ ์ด์— ์ดˆ์ ์„ ๋‘์–ด, 026์˜ ๋ฒ”์šฉ LLM ๋ฐœ์ „์‚ฌ ์„œ๋ฒ ์ด์™€ ํŠน์ • ๋„๋ฉ”์ธ LLM ๋ฐœ์ „ ๋น„๊ต๊ฐ€ ์šฉ์ด.
๋‹ค๋ฅธ ์ ‘๊ทผ
์–ธ์–ด๋ชจ๋ธ์˜ ์‚ฌ์ „ํ•™์Šต ๋ฐ ๋ฏธ์„ธ์กฐ์ • ๋ฐฉ๋ฒ•๋ก ์„ ๋‹ค๋ฃจ๋Š” ๊ด€๋ จ ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
A survey of large language models ๋…ผ๋ฌธ์€ LLM์˜ ์ „๋ฐ˜์  ๋ฐœ์ „์‚ฌ๋ฅผ ๋‹ค๋ฃจ๋ฉฐ, NLP ํŠนํ™” ์ ์šฉ๊ณผ ์ƒํ˜ธ๋ณด์™„์ ์œผ๋กœ ํ™œ์šฉ ๊ฐ€๋Šฅํ•˜๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ณผํ•™ ์—ฐ๊ตฌ ์ž๋™ํ™”๋ฅผ ์œ„ํ•œ AI ์—์ด์ „ํŠธ ์‹œ์Šคํ…œ์„ ๋‹ค๋ฃจ๋Š” ์œ ์‚ฌํ•œ ์„œ๋ฒ ์ด ๋˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ ์—ฐ๊ตฌ์ด๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
LLM์—์„œ ์ถ”๋ก ๋Šฅ๋ ฅ, ๋ฐฉ๋ฒ•๋ก  ๋ฐ ๋ฒค์น˜๋งˆํฌ ๋ฐœ์ „์‚ฌ๋ฅผ ์ข…ํ•ฉ์ ์œผ๋กœ ์ •๋ฆฌํ•˜์—ฌ, 026์˜ LLM ์„œ๋ฒ ์ด๋ฅผ ์ดˆ์ ํ™”์‹œ์ผœ ์‹ฌํ™” ์ดํ•ด๋ฅผ ๊ฐ€๋Šฅ์ผ€ ํ•จ.
ํ›„์† ์—ฐ๊ตฌ
475๋„ LLM์— ๊ด€ํ•œ ์ข…ํ•ฉ ์„œ๋ฒ ์ด๋กœ, 026์˜ ๋ฐœ์ „์‚ฌ ์š”์•ฝ์„ ์ข€ ๋” NLP ๋‚ด ์—ญํ• ๊ณผ ์„ฑ๋Šฅ ์ธก๋ฉด์—์„œ ํ™•์žฅ ํƒ๊ตฌํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
377์€ ์ƒ์„ฑํ˜• AI ๋ฐ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ ์‹œ๋Œ€์˜ ์ „๋ฐ˜์  ํ๋ฆ„์„ ๋‹ค๋ค„ 026์˜ LLM ์„œ๋ฒ ์ด ํ‹€์— ์ตœ์‹  ๋™ํ–ฅ์„ ๋”ํ•œ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
Gemini ํŒจ๋ฐ€๋ฆฌ๋ฅผ ๋‹ค๋ฃจ๋ฉฐ ์‹ค์งˆ์  LLM ์‹œ์Šคํ…œ์˜ ๊ตฌ์กฐ์™€ ์„ฑ๋Šฅ ๋ฐœ์ „ ์‚ฌ๋ก€๋ฅผ ์ œ์‹œํ•ด ๋Œ€๊ทœ๋ชจ ์–ธ์–ด๋ชจ๋ธ ์„œ๋ฒ ์ด์™€ ์—ฐ๊ณ„๋œ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
์ƒ์„ฑํ˜• LLM์˜ ํ™œ์šฉ์ด ์—ฐ๊ตฌ ์•„์ด๋””์–ด ๋ฐœ๊ตด ๋“ฑ ์‹ค์ œ ์—ฐ๊ตฌ์— ์–ด๋–ป๊ฒŒ ์ ์šฉ๋˜๋Š”์ง€ ์‚ฌ๋ก€๋ฅผ ์ œ๊ณตํ•˜์—ฌ ์ด๋ก ๊ณผ ์‹ค๋ฌด๋ฅผ ์ž‡๋Š”๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •