A Rapid Deployment Pipeline for Autonomous Humanoid Grasping Based on Foundation Models

์ €์ž: Yifei Yan, Yankai Liao, Linqi Ye | ๋‚ ์งœ: 2026-04-19 | URL: https://arxiv.org/abs/2604.17258 📄 PDF


Essence

Figure 1

Fig. 1. The three-stage pipeline for rapid deployment of humanoid grasping.

Foundation model๋“ค(YOLOv8, SAM 3D, FoundationPose)์„ ํ†ตํ•ฉํ•˜์—ฌ ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์˜ ์ƒˆ๋กœ์šด ๋ฌผ์ฒด ์กฐ์ž‘ ๋ฐฐํฌ ์‹œ๊ฐ„์„ 1-2์ผ์—์„œ ์•ฝ 30๋ถ„์œผ๋กœ ๋‹จ์ถ•ํ•˜๋Š” end-to-end ํŒŒ์ดํ”„๋ผ์ธ์„ ์ œ์‹œํ•œ๋‹ค.

Motivation

Achievement

How

Figure 1

Fig. 1. The three-stage pipeline for rapid deployment of humanoid grasping.

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: Foundation model๋“ค์˜ ํšจ๊ณผ์  ํ†ตํ•ฉ์œผ๋กœ ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡ ๋ฐฐํฌ ์‹œ๊ฐ„์„ ํš๊ธฐ์ ์œผ๋กœ ๋‹จ์ถ•ํ•œ ์‹ค์šฉ์ ์ด๊ณ  ์šฐ์ˆ˜ํ•œ ๋…ผ๋ฌธ์ด๋ฉฐ, ์ž๋™ ์ฃผ์„, zero-shot 3D ์žฌ๊ตฌ์„ฑ, zero-shot pose tracking์„ ์—ฐ๊ณ„ํ•œ modular ์„ค๊ณ„๊ฐ€ ์‚ฐ์—… ์ ์šฉ์„ฑ์„ ๋†’์ธ๋‹ค. ๋‹ค๋งŒ ์ œํ•œ๋œ ๋ฌผ์ฒด ์œ ํ˜•๊ณผ ํ™˜๊ฒฝ ์กฐ๊ฑด์—์„œ์˜ ๊ฒ€์ฆ์ด ์ผ๋ฐ˜ํ™” ๊ฐ€๋Šฅ์„ฑ์„ ํŒ๋‹จํ•˜๊ธฐ ์œ„ํ•ด ์ถ”๊ฐ€ ํ•„์š”ํ•˜๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •