Differential analysis of genomics count data with edgePython

์ €์ž: | ๋‚ ์งœ: 2026-02-16 | URL: https://www.biorxiv.org/content/10.64898/2026.02.16.706223v1 📄 PDF


Essence

Figure 1

Figure 1: Validation of edgePython. Each panel shows a scatter plot comparing outputs from identical analyses

๋…ผ๋ฌธ์€ ์ธ๊ธฐ ์žˆ๋Š” R ๊ธฐ๋ฐ˜ edgeR ํŒจํ‚ค์ง€๋ฅผ Python์œผ๋กœ ํฌํŒ…ํ•˜์—ฌ edgePython ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ๊ฐœ๋ฐœํ•˜๊ณ , ๋‹จ์ผ์„ธํฌ RNA-seq ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•ด ์Œ์ดํ•ญ ๋ถ„ํฌ-๊ฐ๋งˆ ํ˜ผํ•ฉ ๋ชจํ˜•๊ณผ ๊ฒฝํ—˜์  ๋ฒ ์ด์ฆˆ ์ถ•์†Œ๋ฅผ ์ถ”๊ฐ€ ๊ตฌํ˜„ํ–ˆ๋‹ค. ์ด๋ฅผ ํ†ตํ•ด Python ์ค‘์‹ฌ์˜ ๋‹จ์ผ์„ธํฌ ์ƒ๋ฌผ์ •๋ณด ์ƒํƒœ๊ณ„(scverse, AnnData)์™€์˜ ํ†ตํ•ฉ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ–ˆ๋‹ค.

Motivation

Achievement

Figure 1

Figure 1: Validation of edgePython. Each panel shows a scatter plot comparing outputs from identical analyses

Python ํฌํŒ…์˜ ์™„์„ฑ๋„: edgeR์˜ 86๊ฐœ ๊ณต๊ฐœ ํ•จ์ˆ˜/ํด๋ž˜์Šค๋ฅผ 24๊ฐœ ๋ชจ๋“ˆ๋กœ ์กฐ์งํ™”ํ•˜์—ฌ ์ •๊ทœํ™”, ๋ถ„์‚ฐ ์ถ”์ •, GLM ์ ํ•ฉ, 4๊ฐ€์ง€ ๊ฐ€์„ค๊ฒ€์ • ๋ฐฉ์‹(exact test, LRT, QL F-test, TREAT), ์œ ์ „์ž ์ง‘ํ•ฉ ๊ฒ€์ •(camera, fry, roast, mroast, romer)์„ ๋ชจ๋‘ ๊ตฌํ˜„. ๊ฒ€์ฆ: 87๊ฐœ ๋‹จ์œ„ ํ…Œ์ŠคํŠธ(4,344์ค„)๋กœ ๋ชจ๋“  ์ฃผ์š” ์„ฑ๋ถ„์„ ๊ฒ€์ฆํ•˜์—ฌ R ๊ฒฐ๊ณผ์™€ ์ƒ๋Œ€ ์˜ค์ฐจ 10โปยณ ์ด๋‚ด๋กœ ์ผ์น˜.

์‹ค์ œ ๋ฐ์ดํ„ฐ ๊ฒ€์ฆ: HOXA1 knockdown ๋ฐ์ดํ„ฐ(207,175 transcript)์™€ GSE60450 ๋งˆ์šฐ์Šค ์œ ์„  ๋ฐ์ดํ„ฐ(15,804 gene)์—์„œ TMM ์ •๊ทœํ™”, ๋ถ„์‚ฐ ์ถ”์ •, GLM ๊ณ„์ˆ˜, ์œ ์ „์ž ์ง‘ํ•ฉ ๊ฒ€์ • ๊ฒฐ๊ณผ๊ฐ€ ๋™์ผํ•˜๊ฒŒ ๋‚˜ํƒ€๋‚จ(Figure 1 a-p์˜ ์‚ฐ์ ๋„์—์„œ ๋Œ€๊ฐ์„ ์ƒ ์œ„์น˜).

์ƒˆ๋กœ์šด ๋ฐฉ๋ฒ•๋ก : NEBULA-LN ๊ธฐ๋ฐ˜ ์Œ์ดํ•ญ-๊ฐ๋งˆ ํ˜ผํ•ฉ ๋ชจํ˜•๊ณผ empirical Bayes ์„ธํฌ ๋ ˆ๋ฒจ ๋ถ„์‚ฐ ์ˆ˜์ถ•์„ edgeR์— ์ฒ˜์Œ ๊ตฌํ˜„ํ•˜์—ฌ ๋‹ค์ค‘ ๋Œ€์ƒ์ž ๋‹จ์ผ์„ธํฌ ๋ถ„์„ ๊ธฐ๋Šฅ ์ถ”๊ฐ€. AnnData์™€ ์–‘๋ฐฉํ–ฅ ๋ณ€ํ™˜ ์ง€์›์œผ๋กœ scverse ์ƒํƒœ๊ณ„ ์™„์ „ ํ†ตํ•ฉ.

How

Figure 1

Figure 1: Validation of edgePython. Each panel shows a scatter plot comparing outputs from identical analyses

๊ตฌํ˜„ ์ „๋žต:

๊ฒ€์ฆ ํ”„๋กœ์„ธ์Šค:

Originality

Limitation & Further Study

ํ•œ๊ณ„์ :

ํ–ฅํ›„ ์—ฐ๊ตฌ:

Evaluation

Novelty: 4/5 Technical Soundness: 5/5 Significance: 5/5 Clarity: 4/5 Overall: 5/5

์ดํ‰: edgePython์€ ๋‹จ์ผ์„ธํฌ ์ƒ๋ฌผ์ •๋ณดํ•™ ๋ถ„์•ผ์—์„œ ํ˜„์ €ํ•œ ์‹ค์ œ ๊ฐ€์น˜๋ฅผ ๊ฐ–๋Š”๋‹ค. R ์ „์šฉ edgeR์„ Python์œผ๋กœ ์™„์ „ํžˆ ํฌํŒ…ํ•˜๊ณ  ์ƒˆ๋กœ์šด ํ˜ผํ•ฉ ๋ชจํ˜•์„ ์ถ”๊ฐ€ํ•˜์—ฌ Python ์ค‘์‹ฌ scverse ์ƒํƒœ๊ณ„์™€์˜ ์™„์ „ํ•œ ํ†ตํ•ฉ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•จ์œผ๋กœ์จ, ์ˆ˜์ฒœ ๊ฐœ์˜ ์ž ์žฌ์  ์‚ฌ์šฉ์ž๊ฐ€ R/Python ๊ฐ„ ๋ฒˆ๊ฑฐ๋กœ์šด ๋ณ€ํ™˜ ์—†์ด ๊ฐ•๋ ฅํ•œ ํ†ต๊ณ„ ๋ฐฉ๋ฒ•๋ก ์„ ์ง์ ‘ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ํ–ˆ๋‹ค. ๊ด‘๋ฒ”์œ„ํ•œ ๊ฒ€์ฆ๊ณผ ๊ธฐ์ˆ ์  ์—„๋ฐ€์„ฑ์„ ๊ฐ–์ถ˜ ๊ณ ํ’ˆ์งˆ ํฌํŒ…์œผ๋กœ, ๋‹จ์ผ์„ธํฌ RNA-seq ๋ถ„์„ ๋ฐฉ๋ฒ•๋ก ์˜ ์‹ค์งˆ์  ์ง„์ „์„ ์ด๋ฃฌ ์ค‘์š”ํ•œ ๊ธฐ์—ฌ๋‹ค.

๊ฐ™์ด ๋ณด๋ฉด ์ข‹์€ ๋…ผ๋ฌธ

๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
๋‹จ์ผ์„ธํฌ ์œ ์ „์ž ๋ฐœํ˜„ ๋ฐ์ดํ„ฐ ๋ถ„์„์—์„œ ๊ธฐ๋ณธ์ด ๋˜๋Š” ๋Œ€๊ทœ๋ชจ ํŒŒ์ด์ฌ ๊ธฐ๋ฐ˜ ๋„๊ตฌ๋กœ, edgePython ๊ตฌํ˜„๊ณผ ์ง์ ‘์ ์œผ๋กœ ๊ด€๋ จ์ด ์žˆ์Šต๋‹ˆ๋‹ค.
๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ
Integrated analysis of multimodal single-cell data ๋…ผ๋ฌธ์€ ๋‹จ์ผ์„ธํฌ ๋ฐ์ดํ„ฐ ํ†ตํ•ฉ ๋ถ„์„์˜ ์ƒˆ๋กœ์šด ์ ‘๊ทผ์„ ์†Œ๊ฐœํ•˜์—ฌ, edgePython์˜ ๋‹จ์ผ์„ธํฌ ์œ ์ „์ž ๋ฐœํ˜„ ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ™•์žฅ์— ๊ธฐ์ดˆ๊ฐ€ ๋ฉ๋‹ˆ๋‹ค.
๋‹ค๋ฅธ ์ ‘๊ทผ
Differential analysis of genomics count data with edgePython์€ ์œ ์ „์žยท๋‹จ๋ฐฑ์งˆ ๋ณ€์ด ๋ถ„์„์„ ์ˆ˜ํ–‰ํ•˜์ง€๋งŒ, ๋‹ค์ค‘์„œ์—ด ์ •๋ณด๋ฅผ ๊นŠ๊ฒŒ ํ™œ์šฉํ•ด ๋‹จ์ผ์„œ์—ด ๊ธฐ๋ฐ˜ ์˜ˆ์ธก์ธ 3109์™€ ๋น„๊ต๋œ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
Effective gene expression prediction from sequence ๋…ผ๋ฌธ์€ ์‹œํ€€์Šค ๊ธฐ๋ฐ˜ ์œ ์ „์ž ๋ฐœํ˜„ ์˜ˆ์ธก์„ ๋‹ค๋ค„, edgePython์˜ ๋‹ค์–‘ํ•œ ์˜ˆ์ธก ๋ฐ ๋ถ„์„ ๋ฐฉ๋ฒ•๋ก ์„ ์ตœ์‹  ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฒ•๊ณผ ์—ฐ๊ฒฐํ•ด์ค๋‹ˆ๋‹ค.
ํ›„์† ์—ฐ๊ตฌ
LLM ๊ธฐ๋ฐ˜์˜ ์ ๊ทน์  ํƒ๊ตฌ(active inquiry)๋ฅผ ํ†ตํ•ด ๋‹จ์ผ์„ธํฌ ๋ฐ์ดํ„ฐ ํ•ด์„ ์„ฑ๋Šฅ ํ–ฅ์ƒ ๋ฐ ์ž๋™ํ™” ๊ฐ€๋Šฅ์„ฑ์„ ํƒ๊ตฌํ•œ ๋…ผ๋ฌธ์ž…๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
edgePython ๋…ผ๋ฌธ์€ ๋‹จ์ผ์„ธํฌ RNA-seq ๋ฐ์ดํ„ฐ์— ํŠนํ™”๋œ ํŒŒ์ด์ฌ ๊ธฐ๋ฐ˜ ๋ถ„์„๋„๊ตฌ๋ฅผ ๊ฐœ๋ฐœํ•˜์—ฌ, ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋‹จ์ผ์„ธํฌ ๋ฐ์ดํ„ฐ ์‹ค์ œ ๋ถ„์„์—์„œ WNN์ฒ˜๋Ÿผ ๋ฐ์ดํ„ฐ ํ†ตํ•ฉ ์‹ค์šฉํ™” ์˜ˆ์‹œ๋ฅผ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
์‘์šฉ ์‚ฌ๋ก€
์ž์œจ๋กœ๋ด‡ ์‹คํ—˜์‹ค์—์„œ ์ƒ์„ฑ๋œ ๋‹จ์ผ์„ธํฌ RNA-seq ๋“ฑ ๋Œ€๊ทœ๋ชจ ์ƒ๋ฌผ์ •๋ณด ๋ฐ์ดํ„ฐ ๋ถ„์„์— Python๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ edgePython์ด ์‹ค์งˆ์ ์œผ๋กœ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค.
← ๋ชฉ๋ก์œผ๋กœ ๋Œ์•„๊ฐ€๊ธฐ

๐ŸŽง Audio Overview

์ด ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ๋ฅผ ํŒŸ์บ์ŠคํŠธํ˜• ์˜ค๋””์˜ค๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. (Gemini ยท ํ‚ค๋Š” ๋ธŒ๋ผ์šฐ์ €์—๋งŒ ์ €์žฅ ยท ์™„์„ฑ๋ณธ์€ ์ด๋ฉ”์ผ๋กœ๋„ ์ „์†ก)
โ–ธ ๊ณ ๊ธ‰: ๊ตฌ์„ฑ ๋ฐฉํ–ฅ(๋Œ€๋ณธ ์ž‘์„ฑ ์ง€์นจ) ์ง์ ‘ ์ˆ˜์ •