์ ์: Yunpeng Gao, Chenhui Li, Zhongrui You, Junli Liu, Zhen Li, Pengan Chen, Qizhi Chen, Zhonghan Tang, Liansheng Wang, Penghui Yang, Yiwen Tang, Yuhang Tang, Shuai Liang, Songyi Zhu, Ziqin Xiong, Yifei Su, Xinyi Ye, Jianan Li, Yan Ding, Dong Wang, Xuelong Li, Zhigang Wang, Bin Zhao | ๋ ์ง: 2025-02-25 | URL: https://arxiv.org/abs/2502.18041 📄 PDF
Figure 1: Overview of OpenFly. This work consists of (1) the integration of 4 rendering engines, significantly
OpenFly๋ ํญ๊ณต Vision-Language Navigation์ ์ํ ์ข ํฉ ํ๋ซํผ์ผ๋ก, 4๊ฐ ๋ ๋๋ง ์์ง, ์๋ํ๋ ๋ฐ์ดํฐ ์์ฑ ํด์ฒด์ธ, 100k ๊ถค์ ์ ๋๊ท๋ชจ ๋ฐ์ดํฐ์ , ๊ทธ๋ฆฌ๊ณ keyframe-aware VLN ๋ชจ๋ธ์ ์ ๊ณตํ๋ค.
Figure 1: Overview of OpenFly. This work consists of (1) the integration of 4 rendering engines, significantly
Figure 2: Framework of the automatic data generation. Multiple rendering engines are integrated
์ดํ: OpenFly๋ ํญ๊ณต VLN ์ฐ๊ตฌ์ ๋ฐ์ดํฐ ๋ถ์กฑ ๋ฌธ์ ๋ฅผ ํ๊ธฐ์ ์ผ๋ก ํด๊ฒฐํ ์ข ํฉ ํ๋ซํผ์ผ๋ก, ๋ค์ค ๋ ๋๋ง ์์ง ํตํฉ, ์์ ์๋ํ ํ์ดํ๋ผ์ธ, 100k ๊ท๋ชจ ๋ฒค์น๋งํฌ๋ฅผ ํตํด embodied AI ๋ถ์ผ์ ์ค์ํ ๊ธฐ์ฌ๋ฅผ ํ๋ค. ์ ์๋ keyframe-aware ๋ชจ๋ธ๋ ํญ๊ณต VLN์ ํน์์ฑ์ ๋ฐ์ํ ํจ๊ณผ์ ์ธ ์ ๊ทผ๋ฒ์ด๋ค.