BeliefMapNav: 3D Voxel-Based Belief Map for Zero-Shot Object Navigation

์ €์ž: Zibo Zhou, Yue Hu, Lingkai Zhang, Zonglin Li, Siheng Chen | ๋‚ ์งœ: 2025-05-27 | URL: https://arxiv.org/abs/2506.06487 📄 PDF


Essence

Figure 2

Figure 2: BeliefMapNav pipeline: The agent initializes with a 360ยฐ rotation. During exploration,

๋ณธ ๋…ผ๋ฌธ์€ 3D voxel ๊ธฐ๋ฐ˜ belief map์„ ํ™œ์šฉํ•˜์—ฌ zero-shot object navigation์—์„œ LLM์˜ ์˜๋ฏธ๋ก ์  ์ถ”๋ก ๊ณผ ๊ณ„์ธต์  ๊ณต๊ฐ„ ์ •๋ณด๋ฅผ ํ†ตํ•ฉํ•จ์œผ๋กœ์จ ๋กœ๋ด‡์ด ์‚ฌ์ „ ํ•™์Šต์ด๋‚˜ ์‚ฌ์ „ ๊ตฌ์ถ• ๋งต ์—†์ด ์ž์—ฐ์–ด๋กœ ์ง€์ •๋œ ๋Œ€์ƒ์„ ๋ฏธ์ง€์˜ ํ™˜๊ฒฝ์—์„œ ์ฐพ์„ ์ˆ˜ ์žˆ๋„๋ก ํ•œ๋‹ค.

Motivation

Achievement

Figure 4

Figure 4: Visualization of the prior belief map, visibility map, and the posterior belief map, with an

How

Figure 2

Figure 2: BeliefMapNav pipeline: The agent initializes with a 360ยฐ rotation. During exploration,

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: ๋ณธ ๋…ผ๋ฌธ์€ 3D voxel-based belief map์„ ํ†ตํ•ด LLM ์˜๋ฏธ๋ก ๊ณผ ๊ณต๊ฐ„ ๊ตฌ์กฐ๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ํ†ตํ•ฉํ•˜๊ณ  ํ™•๋ฅ  ๊ธฐ๋ฐ˜ ๊ฒฝ๋กœ ๊ณ„ํš์œผ๋กœ zero-shot object navigation ์„ฑ๋Šฅ์„ ๋Œ€ํญ ํ–ฅ์ƒ์‹œํ‚จ ์šฐ์ˆ˜ํ•œ ๊ธฐ์—ฌ์ด๋‹ค. ๋‹ค๋งŒ ์‹ค์ œ ๋กœ๋ด‡ ๋ฐฐ์น˜ ์‹œ ๊ณ„์‚ฐ ๋ณต์žก๋„์™€ LLM ์˜ค๋ฅ˜์— ๋Œ€ํ•œ ๊ฐ•๊ฑด์„ฑ ๊ฒ€ํ† ๊ฐ€ ํ•„์š”ํ•˜๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •