Real-Time Execution of Action Chunking Flow Policies

์ €์ž: Kevin Black, Manuel Y. Galliker, Sergey Levine | ๋‚ ์งœ: 2025-06-09 | URL: https://arxiv.org/abs/2506.07339 📄 PDF


Essence

Real-time chunking (RTC)์€ diffusion ๋˜๋Š” flow ๊ธฐ๋ฐ˜ VLA์˜ inference ์‹œ๊ฐ„์— action chunking ์ •์ฑ…์„ ๋น„๋™๊ธฐ์ ์œผ๋กœ ์‹คํ–‰ํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ, ํ˜„์žฌ chunk ์‹คํ–‰ ์ค‘ ๋‹ค์Œ chunk๋ฅผ ์ƒ์„ฑํ•˜๋ฉด์„œ inference ์ง€์—ฐ์œผ๋กœ ์ธํ•œ ๋ถˆ์—ฐ์†์„ฑ์„ ์ œ๊ฑฐํ•œ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: Top: Real-time chunking (RTC) enables the robot to perform highly dexterous and dynamic tasks,

How

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 3/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: RTC๋Š” modern VLA์˜ inference latency ๋ฌธ์ œ๋ฅผ ์‹ค์šฉ์ ์œผ๋กœ ํ•ด๊ฒฐํ•˜๋Š” ์˜๋ฆฌํ•œ inference-time ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ, flow matching์˜ ๊ตฌ์กฐ๋ฅผ ์ฐฝ์˜์ ์œผ๋กœ ํ™œ์šฉํ•˜๋ฉด์„œ๋„ ๊ธฐ์กด ๋ชจ๋ธ์— ๋Œ€ํ•œ ์žฌํ•™์Šต์„ ์š”๊ตฌํ•˜์ง€ ์•Š์•„ ์ฆ‰์‹œ ์ ์šฉ ๊ฐ€๋Šฅํ•˜๋‹ค. ์‹ค์ œ ๋กœ๋ด‡ ์ž‘์—…์—์„œ์˜ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ๊ณผ latency robustness๋Š” embodied AI ์‹œ์Šคํ…œ์˜ ์‹ค์šฉํ™”์— ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋ฅผ ์ œ์‹œํ•œ๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •