Robots Enact Malignant Stereotypes

์ €์ž: Andrew Hundt, William Agnew, Vicky Zeng, Severin Kacianka, Matthew Gombolay | ๋‚ ์งœ: 2022-07-23 | URL: https://arxiv.org/abs/2207.11569 📄 PDF


Essence

Figure 1

Fig. 1. An example trial showing harmful robot behavior that is, in aggregate, racially stratified like White supremacis

๋ณธ ๋…ผ๋ฌธ์€ CLIP ๊ฐ™์€ ๋Œ€๊ทœ๋ชจ ๊ธฐ์ดˆ ๋ชจ๋ธ์„ ํ™œ์šฉํ•˜๋Š” ๋กœ๋ด‡ ์กฐ์ž‘ ์‹œ์Šคํ…œ์ด ์‹ค์ œ ๋ฌผ๋ฆฌ์  ํ™˜๊ฒฝ์—์„œ ์ธ์ข…, ์„ฑ๋ณ„ ๊ณ ์ •๊ด€๋…๊ณผ ๊ณผํ•™์ ์œผ๋กœ ์ž…์ฆ๋˜์ง€ ์•Š์€ ๊ณจ์ƒํ•™์„ ์ฒด๊ณ„์ ์œผ๋กœ ์žฌํ˜„ํ•˜๋Š” ๊ฒƒ์„ ์ฒ˜์Œ์œผ๋กœ ์‹ค์ฆ์ ์œผ๋กœ ์ž…์ฆํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Fig. 1. An example trial showing harmful robot behavior that is, in aggregate, racially stratified like White supremacis

How

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ ๋กœ๋ด‡๊ณตํ•™์—์„œ ๊ธฐ์ดˆ ๋ชจ๋ธ์˜ ํŽธํ–ฅ์ด ๋ฌผ๋ฆฌ์  ์„ธ๊ณ„์—์„œ ์‹ค์ œ๋กœ ์žฌํ˜„๋˜๋Š” ํ˜„์ƒ์„ ์ฒ˜์Œ์œผ๋กœ ์‹ค์ฆ์ ์œผ๋กœ ์ž…์ฆํ•˜๋ฉฐ, ๋กœ๋ด‡ ์ž์œจ์„ฑ์˜ ์œ„ํ—˜์„ฑ์„ ๊ฐ•์กฐํ•˜๋Š” ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋‹ค. ํ•™์ œ ๊ฐ„ ์ ‘๊ทผ๊ณผ ๋ช…ํ™•ํ•œ ์ •์ฑ… ์ œ์–ธ์œผ๋กœ ๋กœ๋ด‡๊ณตํ•™ ๊ณต๋™์ฒด์˜ ์šฐ์„ ์  ํ–‰๋™ ๋ณ€ํ™”๋ฅผ ์ด‰๊ตฌํ•˜๋Š” ์˜๋ฏธ ์žˆ๋Š” ์ž‘์—…์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •