ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

์ €์ž: Wenlong Huang, Chen Wang, Yunzhu Li, Ruohan Zhang, Li Fei-Fei | ๋‚ ์งœ: 2024-09-03 | URL: https://arxiv.org/abs/2409.01652 📄 PDF


Essence

Figure 1

Figure 1: Relational Keypoint Constraints (ReKep) specify diverse manipulation behaviors as an opti-

ReKep๋Š” ๋กœ๋ด‡ ์กฐ์ž‘ ์ž‘์—…์„ 3D ํ‚คํฌ์ธํŠธ๋ฅผ ์ž…๋ ฅ์œผ๋กœ ํ•˜๋Š” Python ํ•จ์ˆ˜ ํ˜•ํƒœ์˜ ์ œ์•ฝ ์กฐ๊ฑด์œผ๋กœ ํ‘œํ˜„ํ•˜๋ฉฐ, ๋Œ€๊ทœ๋ชจ ๋น„์ „ ๋ชจ๋ธ๊ณผ ๋น„์ „-์–ธ์–ด ๋ชจ๋ธ์„ ํ™œ์šฉํ•˜์—ฌ ์ž๋™์œผ๋กœ ์ด๋Ÿฌํ•œ ์ œ์•ฝ์„ ์ƒ์„ฑํ•˜๊ณ  ๊ณ„์ธต์  ์ตœ์ ํ™”๋กœ ์‹ค์‹œ๊ฐ„ ๋กœ๋ด‡ ์ œ์–ด๋ฅผ ์‹คํ˜„ํ•œ๋‹ค.

Motivation

Achievement

Figure 3

Figure 3: Experiment tasks and visualization of optimization results. Seven tasks are designed to validate

How

Figure 2

Figure 2: Overview of ReKep. DINOv2 [5] first proposes keypoints in the scene, which are overlaid on

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ReKep์€ keypoint ๊ธฐ๋ฐ˜ ์ œ์•ฝ ํ‘œํ˜„๊ณผ foundation model ๊ธฐ๋ฐ˜ ์ž๋™ ์ƒ์„ฑ์„ ๊ฒฐํ•ฉํ•˜์—ฌ ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅํ•˜๊ณ  ์‹ค์‹œ๊ฐ„ ์‹คํ–‰ ๊ฐ€๋Šฅํ•œ ๋กœ๋ด‡ ์กฐ์ž‘ ์‹œ์Šคํ…œ์„ ์ œ์‹œํ•œ๋‹ค. ๋‹ค์–‘ํ•œ ์ž‘์—… ์‹ค์—ฐ๊ณผ ๋‘ ๊ฐ€์ง€ ๋กœ๋ด‡ ํ”Œ๋žซํผ ๊ตฌํ˜„์œผ๋กœ ์‹ค์งˆ์  ๊ฐ€์น˜๋ฅผ ์ž…์ฆํ•˜๋ฉฐ, ์ž‘์—…๋ณ„ ๋ฐ์ดํ„ฐ ์—†๋Š” generalist ๋กœ๋ด‡ ๊ตฌํ˜„์˜ ์ค‘์š”ํ•œ ์ง„์ „์„ ๋‚˜ํƒ€๋‚ธ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •