NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance

์ €์ž: Wenzhe Cai, Jiaqi Peng, Yuqiang Yang, Yujian Zhang, Meng Wei, Hanqing Wang, Yilun Chen, Tai Wang, Jiangmiao Pang | ๋‚ ์งœ: 2025-05-13 | URL: https://arxiv.org/abs/2505.08712 📄 PDF


Essence

Figure 1

Fig. 1: NavDP is solely trained with simulation data but can achieve zero-shot sim-to-real transfer to different types o

NavDP๋Š” ์‹œ๋ฎฌ๋ ˆ์ด์…˜์—์„œ๋งŒ ํ•™์Šตํ•œ unified transformer ๊ธฐ๋ฐ˜ diffusion policy๋กœ, privileged information์„ ํ™œ์šฉํ•œ trajectory generation๊ณผ critic value prediction์„ ํ†ตํ•ด zero-shot sim-to-real transfer๋ฅผ ๋‹ฌ์„ฑํ•œ๋‹ค.

Motivation

Achievement

Figure 4

Fig. 4: Visualization of comparison among navigation approaches. Two common failure mode of baselines are displayed:

How

Figure 2

Fig. 2: Overview of the network architecture. NavDP is con-

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: NavDP๋Š” ์‹œ๋ฎฌ๋ ˆ์ด์…˜์˜ privileged information์„ ํšจ๊ณผ์ ์œผ๋กœ ํ™œ์šฉํ•˜๋Š” unified transformer ์•„ํ‚คํ…์ฒ˜์™€ ๋Œ€๊ทœ๋ชจ ํšจ์œจ์  ๋ฐ์ดํ„ฐ ์—”์ง„์œผ๋กœ navigation ๋ถ„์•ผ์—์„œ significant advance๋ฅผ ๋‹ฌ์„ฑํ–ˆ์œผ๋ฉฐ, zero-shot sim-to-real transfer์™€ cross-embodiment ์ผ๋ฐ˜ํ™” ์ธก๋ฉด์—์„œ ๊ฐ•๋ ฅํ•œ empirical ๊ฒฐ๊ณผ๋ฅผ ๋ณด์—ฌ์ค€๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •