Crosslingual capabilities and knowledge barriers in multilingual large language models

์ €์ž: Lynn Chua, Badih Ghazi, Yangsibo Huang, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Amer Sinha, Chulin Xie, Chiyuan Zhang | ๋‚ ์งœ: 2024 | DOI: arXiv:2406.16135 📄 PDF


Essence

Figure 1

๊ทธ๋ฆผ 1: ๋‹ค๊ตญ์–ด LLM์€ ๊ธฐ๊ณ„๋ฒˆ์—ญ๊ณผ ๊ฐ™์€ ๋ช…์‹œ์  ์ž‘์—…์—์„œ๋Š” ๊ฐ•ํ•œ ๊ต์ฐจ์–ธ์–ด ๋Šฅ๋ ฅ์„ ๋ณด์ด๋‚˜, ๋ชจ๋ธ ๊ฐ€์ค‘์น˜์— ์•”๋ฌต์ ์œผ๋กœ ์ €์žฅ๋œ ์ง€์‹์„ ํ™œ์šฉํ•˜๋Š” ์ง€์‹ ์ง‘์•ฝ์  ์ž‘์—…์—์„œ๋Š” ์–ธ์–ด ๊ฐ„ ๊ฒฉ์ฐจ๋ฅผ ํ•ด์†Œํ•˜์ง€ ๋ชปํ•จ์„ ๋ณด์—ฌ์คŒ.

๋ณธ ๋…ผ๋ฌธ์€ ๋‹ค๊ตญ์–ด ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์ด ๋ช…์‹œ์  ๊ต์ฐจ์–ธ์–ด ์ž‘์—…(๊ธฐ๊ณ„๋ฒˆ์—ญ)์—์„œ๋Š” ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์ด๋‚˜, ๋งค๊ฐœ๋ณ€์ˆ˜ ์ง€์‹์˜ ์•”๋ฌต์  ๊ต์ฐจ์–ธ์–ด ํ™œ์šฉ์—์„œ๋Š” ์‹ฌ๊ฐํ•œ ์„ฑ๋Šฅ ์ €ํ•˜๋ฅผ ๊ฒฝํ—˜ํ•˜๋Š” '๊ต์ฐจ์–ธ์–ด ์ง€์‹ ์žฅ๋ฒฝ(crosslingual knowledge barrier)'์„ ์ฒ˜์Œ์œผ๋กœ ์ฒด๊ณ„์ ์œผ๋กœ ๊ทœ๋ช…ํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.

Motivation

Achievement

Figure 2

๊ทธ๋ฆผ 2: ์˜๋ฌธ ํ…์ŠคํŠธ์™€ ํ˜ผํ•ฉ์–ธ์–ด ๋ฒˆ์—ญ ํ…์ŠคํŠธ์˜ ์ž„๋ฒ ๋”ฉ์ด ๊ธฐ์ค€์„ ๋ณด๋‹ค ๋” ์ž˜ ์ •๋ ฌ๋จ์„ ์‹œ๊ฐํ™”ํ•จ.

  1. ๊ต์ฐจ์–ธ์–ด ๋Šฅ๋ ฅ์˜ ์ด์ค‘์„ฑ ๊ทœ๋ช…: 15๊ฐœ ๋ชจ๋ธ, 16๊ฐœ ์–ธ์–ด, 3๊ฐœ ๋ฐ์ดํ„ฐ์…‹์„ ํ†ตํ•ด LLM์ด ๊ธฐ๊ณ„๋ฒˆ์—ญ(COMET ์ ์ˆ˜ 87์ ๋Œ€, Google Translate์™€ ๊ฒฝ์Ÿ ์ˆ˜์ค€)๊ณผ ์ž„๋ฒ ๋”ฉ ์œ ์‚ฌ์„ฑ ์ธก๋ฉด์—์„œ๋Š” ๊ฐ•ํ•œ ๋ช…์‹œ์  ๊ต์ฐจ์–ธ์–ด ๋Šฅ๋ ฅ์„ ๋ณด์œ ํ•˜๋‚˜, ์‚ฌ์‹ค์ƒ์˜ ์ง€์‹ ํ™œ์šฉ ๋‹จ๊ณ„์—์„œ๋Š” ํ˜„์ €ํ•œ ์„ฑ๋Šฅ ์ €ํ•˜๋ฅผ ๊ฒฝํ—˜ํ•จ์„ ์ž…์ฆ.
  2. ๊ต์ฐจ์–ธ์–ด ์ง€์‹ ์žฅ๋ฒฝ์˜ ์ฒด๊ณ„์  ๋ฐœ๊ฒฌ: MMLU ๋ฒค์น˜๋งˆํฌ(์ผ๋ฐ˜ ์ง€์‹), Harry Potter ํ€ด์ฆˆ, TOFU ๋ฒค์น˜๋งˆํฌ(๋„๋ฉ”์ธ ํŠนํ™” ์ง€์‹)์—์„œ ์˜๋ฌธ์œผ๋กœ ํ•™์Šต๋œ ์ง€์‹์„ ํƒ€์–ธ์–ด ์งˆ๋ฌธ์œผ๋กœ ์ ‘๊ทผํ•  ๋•Œ ์œ ์˜๋ฏธํ•œ ์„ฑ๋Šฅ ๊ฒฉ์ฐจ ๋ฐœ์ƒ์„ ์ตœ์ดˆ๋กœ ๋ฌธํ—Œํ™”. ์‚ฌ์ „ํ•™์Šต(ยง3.1)๊ณผ ๋ฏธ์„ธ์กฐ์ •(ยง3.2) ๋‹จ๊ณ„ ๋ชจ๋‘์—์„œ ์žฅ๋ฒฝ ์กด์žฌ ํ™•์ธ.
  3. ํ˜ผํ•ฉ์–ธ์–ด ๋ฏธ์„ธ์กฐ์ •์˜ ํšจ๊ณผ์„ฑ: ๋‹จ์ˆœ ํ”„๋กฌํ”„ํŠธ ์—”์ง€๋‹ˆ์–ด๋ง(ยง4.1)๋ณด๋‹ค ํ˜ผํ•ฉ์–ธ์–ด ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ๋ฏธ์„ธ์กฐ์ •(ยง4.2)์ด ์žฅ๋ฒฝ์„ ํšจ๊ณผ์ ์œผ๋กœ ์™„ํ™”ํ•˜๋ฉฐ, (1) ๋„๋ฉ”์ธ ์™ธ(out-of-domain) WikiText ๊ฐ™์€ ๋ฐ์ดํ„ฐ์…‹์—์„œ๋„ ํšจ๊ณผ์ ์ด๊ณ  (2) ๋ฏธ์„ธ์กฐ์ •์— ํฌํ•จ๋˜์ง€ ์•Š์€ ์–ธ์–ด๋กœ์˜ ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ๋„ ํ–ฅ์ƒ๋จ์„ ์‹ค์ฆ.

How

Figure 4

๊ทธ๋ฆผ 4: MMLU ํ˜ผํ•ฉ์–ธ์–ด MCQ ํ‰๊ฐ€์—์„œ 16๊ฐœ ์–ธ์–ด ์ „๋ฐ˜์— ๊ฑธ์นœ ๊ต์ฐจ์–ธ์–ด ์ง€์‹ ์žฅ๋ฒฝ์„ ์‹œ๊ฐํ™”ํ•จ.

Originality

Limitation & Further Study

Evaluation

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ๋‹ค๊ตญ์–ด LLM์ด ํ‘œ๋ฉด์  ๊ต์ฐจ์–ธ์–ด ๋Šฅ๋ ฅ์€ ๊ฐ–์ถ”์—ˆ์œผ๋‚˜ ๊นŠ์ด ์žˆ๋Š” ์ง€์‹ ํ™œ์šฉ์—์„œ๋Š” ํ˜„์ €ํ•œ ์žฅ๋ฒฝ์„ ๊ฒฝํ—˜ํ•œ๋‹ค๋Š” ์ค‘์š”ํ•œ ๋ฐœ๊ฒฌ์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ž…์ฆํ•˜๋ฉฐ, ํ˜ผํ•ฉ์–ธ์–ด ๋ฏธ์„ธ์กฐ์ •์„ ํ†ตํ•œ ์‹ค์งˆ์  ์™„ํ™” ๋ฐฉ์•ˆ์„ ์ œ์‹œํ•œ ์˜๋ฏธ ์žˆ๋Š” ์—ฐ๊ตฌ์ด๋‹ค. ๋‹ค๋งŒ ์ €์ž์› ์–ธ์–ด ํ™•๋Œ€์™€ ์‹ ๊ฒฝ๋ง ์ˆ˜์ค€์˜ ํ•ด์„ ๋ถ„์„์ด ํ›„์† ๊ณผ์ œ๋กœ ๋‚จ์•„์žˆ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
858 ๋…ผ๋ฌธ์€ ๋Œ€๊ทœ๋ชจ ๋น„์ง€๋„ ๊ต์ฐจ์–ธ์–ด ํ‘œํ˜„ ํ•™์Šต๊ธฐ๋ฒ•์„ ๊ณ ์ฐฐํ•˜์—ฌ, 245์—์„œ ๋ฌธ์ œ์‹œํ•˜๋Š” crosslingual knowledge barrier์˜ ๊ทผ๋ณธ ์›์ธ ํ•ด์„์— ๋„์›€์„ ์ค„ ์ˆ˜ ์žˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๊ต์ฐจ์–ธ์–ด ์ง€์‹ ํ™œ์šฉ ๋Šฅ๋ ฅ ํ–ฅ์ƒ์„ ์œ„ํ•œ ๋Œ€์•ˆ์  ๋ฐฉ๋ฒ•๋ก ์„ ์ œ์‹œํ•˜๋Š” ์—ฐ๊ตฌ์ด๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
๋‘˜ ๋‹ค ์–ธ์–ด ๋˜๋Š” ํ‘œํ˜„์˜ ์ „์ด์™€ ๋ณ‘๋ ฌ ์ฒ˜๋ฆฌ, ํŒŒ๋ผ๋ฏธํ„ฐ ํšจ์œจ์„ฑ์— ์ง‘์ค‘ํ•˜์ง€๋งŒ ์„ธ๋ถ€ ๋ฉ”์ปค๋‹ˆ์ฆ˜์ด ๋‹ค๋ฆ…๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
A smack of all neighbouring languages ๋…ผ๋ฌธ์€ LLM์˜ ๋‹ค๊ตญ์–ด ๋Šฅ๋ ฅ์„ ์‹ค์ œ ๋ฐ์ดํ„ฐ๋กœ ํ‰๊ฐ€ํ•˜์—ฌ, crosslingual knowledge barrier ๋…ผ๋ฌธ์ด ์ œ์‹œํ•œ ์žฅ๋ฒฝ ๋ฌธ์ œ์™€ ๋น„๊ตํ•  ๋งŒํ•œ ์‹ค์ฆ์  ๊ทผ๊ฑฐ๋ฅผ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
245๋ฒˆ ๋…ผ๋ฌธ์€ ๋‹ค๊ตญ์–ด ํ™˜๊ฒฝ์—์„œ ์ง€์‹ ์žฅ๋ฒฝ๊ณผ ๊ต์ฐจ์–ธ์–ด ๋Šฅ๋ ฅ์„ ์‹ฌ์ธต ๋ถ„์„ํ•˜์—ฌ, 858๋ฒˆ์˜ XLM-RoBERTa ๊ธฐ๋ฐ˜ ๊ต์ฐจ์–ธ์–ด ํ‘œํ˜„ ์—ฐ๊ตฌ์˜ ์˜ํ–ฅ์„ ๋ฐฉ๋ฒ•๋ก ์ ์œผ๋กœ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
190 ๋…ผ๋ฌธ์€ ์ธ๊ณผ ๊ด€๊ณ„ ์ค‘์‹ฌ์˜ ํ•™์ˆ  ๋ฌธ์„œ ์ƒ์„ฑ์—์„œ crosslingual ๋ฌธ์ œ๊ฐ€ ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ์‹คํ—˜์ ์œผ๋กœ ๋ถ„์„ํ•˜์—ฌ, 245์˜ ๊ต์ฐจ์–ธ์–ด ์žฅ๋ฒฝ ๋ถ„์„์„ ์‹ค์ œ ํ™œ์šฉ ์‚ฌ๋ก€๋กœ ํ™•์žฅํ•œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
๋‹ค๊ตญ์–ด LLM์˜ ์ง€์‹ ์žฅ๋ฒฝ๊ณผ crosslingual ํ‰๊ฐ€๋ฅผ ๋‹ค๋ฃจ๋Š” 245๋ฒˆ ๋…ผ๋ฌธ์ด 690์˜ ๊ด€์ฐฐ์„ ๋ณด๋‹ค ๊ด‘๋ฒ”์œ„ํ•œ ์–ธ์–ด ์ง€์‹ ๋ฌธ์ œ๋กœ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
245 ๋…ผ๋ฌธ์€ ๋‹ค๊ตญ์–ด LLM์˜ ๊ต์ฐจ์–ธ์–ด ์ง€์‹์žฅ๋ฒฝ์„ ์‹ฌ์ธต์ ์œผ๋กœ ์ง„๋‹จํ•˜๋ฉฐ, 119์˜ AUTOCAP ํ”„๋ ˆ์ž„์›Œํฌ ๊ฐœ๋ฐœ ๋™๊ธฐ๋ฅผ ๋ฌธ์ œ ๋ถ„์„ ์ฐจ์›์—์„œ ํ™•์žฅํ•œ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
119๋ฒˆ AUTOCAP ํ”„๋ ˆ์ž„์›Œํฌ๊ฐ€ 245์—์„œ ์–ธ๊ธ‰ํ•œ ์•”๋ฌต์  ๊ต์ฐจ์–ธ์–ด ์ง€์‹ ์žฅ๋ฒฝ ๊ทน๋ณต์„ ์‹คํ—˜์ ์œผ๋กœ ์‹œ๋„ํ•˜๋ฏ€๋กœ, ์—ฐ๊ตฌ์  ๊ด€๊ณ„๊ฐ€ ๋ช…ํ™•ํ•ฉ๋‹ˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •