CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation

์ €์ž: Sung-Wook Lee, Xuhui Kang, Brandon Yang, Yen-Ling Kuo | ๋‚ ์งœ: 2025-08-03 | URL: https://arxiv.org/abs/2508.01600 📄 PDF


Essence

Figure 1

Figure 1: Comparison between Behavior Cloning (BC) and Contrastive Learning via Action

CLASS๋Š” ํ–‰๋™ ์‹œํ€€์Šค ์œ ์‚ฌ์„ฑ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜๋Š” supervised contrastive learning์„ ํ†ตํ•ด ๋กœ๋ด‡ ์กฐ์ž‘ ํƒœ์Šคํฌ์—์„œ robustํ•œ ์‹œ๊ฐ์  ํ‘œํ˜„์„ ํ•™์Šตํ•˜๋Š” ๋ฐฉ๋ฒ•์ด๋‹ค. DTW๋กœ ์ธก์ •๋œ action sequence ์œ ์‚ฌ์„ฑ์„ ์•ฝํ•œ ๊ฐ๋… ์‹ ํ˜ธ๋กœ ํ™œ์šฉํ•˜์—ฌ heterogeneous ๋ฐ์ดํ„ฐ์…‹์—์„œ์˜ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œํ‚จ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: Comparison between Behavior Cloning (BC) and Contrastive Learning via Action

How

Figure 2

Figure 2: Training pipeline. The inner block corre-

Originality

Limitation & Further Study

Evaluation

Novelty: 4/5 Technical Soundness: 4/5 Significance: 4/5 Clarity: 4/5 Overall: 4/5

์ดํ‰: CLASS๋Š” action sequence ์œ ์‚ฌ์„ฑ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ์ƒˆ๋กœ์šด ์•ฝํ•œ ๊ฐ๋… ์‹ ํ˜ธ๋ฅผ ์ œ์•ˆํ•˜์—ฌ ๋กœ๋ด‡ ์กฐ์ž‘์—์„œ heterogeneous ์‹œ๊ฐ ์กฐ๊ฑด์— robustํ•œ ํ‘œํ˜„ ํ•™์Šต์„ ํšจ๊ณผ์ ์œผ๋กœ ๋‹ฌ์„ฑํ•œ๋‹ค. Comprehensive ํ‰๊ฐ€์™€ ์‹ค์šฉ์  ์„ฑ๋Šฅ ํ–ฅ์ƒ์œผ๋กœ ๋กœ๋ด‡ ํ•™์Šต ๋ถ„์•ผ์— significant contribution์„ ์ œ๊ณตํ•˜๋Š” ์šฐ์ˆ˜ํ•œ ๋…ผ๋ฌธ์ด๋‹ค.

← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •