RoboArena: Distributed Real-World Evaluation of Generalist Robot Policies

์ €์ž: Pranav Atreya, Karl Pertsch, Tony Lee, Moo Jin Kim, Arhan Jain, Artur Kuramshin, Clemens Eppner, Cyrus Neary, Edward Hu, Fabio Ramos, Jonathan Tremblay, Kanav Arora, Kirsty Ellis, Luca Macesanu, Marcel Torne Villasevil, Matthew Leonard, Meedeum Cho, Ozgur Aslan, Shivin Dass, Jie Wang, William Reger, Xingfang Yuan, Xuning Yang, Abhishek Gupta, Dinesh Jayaraman, Glen Berseth, Kostas Daniilidis, Roberto Martin-Martin, Youngwoon Lee, Percy Liang, Chelsea Finn, Sergey Levine | ๋‚ ์งœ: 2025-06-22 | URL: https://arxiv.org/abs/2506.18123 📄 PDF


Essence

Figure 1

Figure 1: We present RoboArena, a distributed real-world evaluation framework for generalist robot

RoboArena๋Š” ๋ถ„์‚ฐ๋œ ํ‰๊ฐ€์ž ๋„คํŠธ์›Œํฌ๋ฅผ ํ†ตํ•ด ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ ์ผ๋ฐ˜ํ™”๋œ ๋กœ๋ด‡ ์ •์ฑ…์„ pairwise ๋น„๊ตํ•˜๊ณ  ์ง‘๊ณ„ํ•˜์—ฌ ์ •์ฑ… ์ˆœ์œ„๋ฅผ ๋„์ถœํ•˜๋Š” ํฌ๋ผ์šฐ๋“œ์†Œ์‹ฑ ๊ธฐ๋ฐ˜ ํ‰๊ฐ€ ํ”„๋ ˆ์ž„์›Œํฌ์ด๋‹ค. 600ํšŒ ์ด์ƒ์˜ ์‹ค์ œ ๋กœ๋ด‡ ํ‰๊ฐ€๋ฅผ ํ†ตํ•ด ์ค‘์•™ ์ง‘์ค‘์‹ ํ‰๊ฐ€๋ณด๋‹ค ์ •ํ™•ํ•œ ์ •์ฑ… ์ˆœ์œ„๋ฅผ ์ œ๊ณตํ•จ์„ ์ž…์ฆํ–ˆ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: We present RoboArena, a distributed real-world evaluation framework for generalist robot

How

Figure 2

Figure 2: Pipeline for extracting qualitative policy characteristics from RoboArenaโ€™s rich evaluation

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: RoboArena๋Š” ์ผ๋ฐ˜ํ™” ๋กœ๋ด‡ ์ •์ฑ…์˜ ํ‰๊ฐ€๋ผ๋Š” ์ค‘์š”ํ•œ ๋ฌธ์ œ์— ๋Œ€ํ•ด ํ˜์‹ ์ ์ธ ๋ถ„์‚ฐ ํฌ๋ผ์šฐ๋“œ์†Œ์‹ฑ ์ ‘๊ทผ๋ฒ•์„ ์ œ์‹œํ•˜๋ฉฐ, 600ํšŒ์˜ ์‹ค์ œ ๋กœ๋ด‡ ํ‰๊ฐ€๋ฅผ ํ†ตํ•ด ๋ฐฉ๋ฒ•์˜ ํšจ๊ณผ์„ฑ์„ ์ž…์ฆํ–ˆ๋‹ค. ์˜คํ”ˆ ์ปค๋ฎค๋‹ˆํ‹ฐ ํ”Œ๋žซํผ์œผ๋กœ์„œ ๋กœ๋ด‡ ์ •์ฑ… ๋ฒค์น˜๋งˆํ‚น ์ƒํƒœ๊ณ„์— ์ƒ๋‹นํ•œ ๊ธฐ์—ฌ๋ฅผ ํ•  ์ˆ˜ ์žˆ๋Š” ํš๊ธฐ์ ์ธ ์—ฐ๊ตฌ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •