๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
๐Ÿ˜ŽAI/Terminology

[Notable] Differentiable image parameterization, DIP

by SolaKim 2025. 1. 21.

https://distill.pub/2018/differentiable-parameterizations/

 

Differentiable Image Parameterizations

A powerful, under-explored tool for neural network visualizations and art.

distill.pub

 

 

Differentiable image parameterization์€ ์ด๋ฏธ์ง€ ์ƒ์„ฑ ๋ฐ ์ตœ์ ํ™” ๊ณผ์ •์—์„œ ์ด๋ฏธ์ง€๋ฅผ ํŒŒ๋ผ๋ฏธํ„ฐํ™”(๋งค๊ฐœ๋ณ€์ˆ˜ํ™”)ํ•˜์—ฌ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ ํ˜•ํƒœ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ๊ธฐ๋ฒ•์ž…๋‹ˆ๋‹ค.

์ด๋ฅผ ํ†ตํ•ด ์ด๋ฏธ์ง€ ์ƒ์„ฑ ๋˜๋Š” ์ˆ˜์ •์˜ ๊ณผ์ •์—์„œ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅ์„ฑ์„ ์œ ์ง€ํ•˜๋ฉด์„œ ์ตœ์ ํ™”๋ฅผ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

์ฃผ๋กœ ์‹ ๊ฒฝ๋ง ๋ชจ๋ธ์—์„œ ์ด๋ฏธ์ง€ ์ƒ์„ฑ, ๋ณ€ํ˜•, ๋ณต์› ๋“ฑ์˜ ์ž‘์—…์„ ํ•˜๋ฉด์„œ, ๋ชจ๋ธ์ด ์ด๋ฏธ์ง€์— ๋Œ€ํ•œ ์ •๋ณด๋ฅผ ์—…๋ฐ์ดํŠธํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•ฉ๋‹ˆ๋‹ค.

 

 

  • ์ด๋ฏธ์ง€ ํŒŒ๋ผ๋ฏธํ„ฐํ™”:
    • ์ด๋ฏธ์ง€ ์ž์ฒด๋ฅผ ํ”ฝ์…€ ๊ฐ’์œผ๋กœ ์ง์ ‘ ๋‹ค๋ฃจ๋Š” ๋Œ€์‹ , ์ด๋ฏธ์ง€์˜ ํŠน์„ฑ์„ ๋‚˜ํƒ€๋‚ด๋Š” ํŒŒ๋ผ๋ฏธํ„ฐ๋“ค(์˜ˆ: ํ…์Šค์ฒ˜, ํ˜•ํƒœ, ์ƒ‰์ƒ, ๊ตฌ์กฐ ๋“ฑ)์„ ์‚ฌ์šฉํ•˜์—ฌ ์ด๋ฏธ์ง€๋ฅผ ํ‘œํ˜„ํ•ฉ๋‹ˆ๋‹ค. ์ด๋Ÿฐ ํŒŒ๋ผ๋ฏธํ„ฐ๋“ค์€ ๊ฐ€์ค‘์น˜(weight) ๋˜๋Š” ์ž ์žฌ ๊ณต๊ฐ„(latent space)์—์„œ ํ•™์Šต๋œ ๊ฐ’์ผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
    • ์˜ˆ๋ฅผ ๋“ค์–ด, NeRF(Neural Radiance Fields)์™€ ๊ฐ™์€ ๋ชจ๋ธ์—์„œ๋Š” ์žฅ๋ฉด์˜ 3D ๊ตฌ์กฐ๋ฅผ ์ž ์žฌ ๋ฒกํ„ฐ๋กœ ํŒŒ๋ผ๋ฏธํ„ฐํ™”ํ•˜์—ฌ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ์—ฌ๊ธฐ์„œ ์ด๋ฏธ์ง€์˜ ๊ตฌ์„ฑ ์š”์†Œ๋Š” ์‹ ๊ฒฝ๋ง์„ ํ†ตํ•ด ํŒŒ๋ผ๋ฏธํ„ฐํ™”๋œ ๊ฐ’์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์ƒ์„ฑ๋ฉ๋‹ˆ๋‹ค.
  • ๋ฏธ๋ถ„ ๊ฐ€๋Šฅ์„ฑ:
    • ๋ฏธ๋ถ„ ๊ฐ€๋Šฅ(differentiable)ํ•˜๋‹ค๋Š” ๊ฒƒ์€ ๋ชจ๋ธ์ด ์ตœ์ ํ™” ๊ณผ์ •์—์„œ ๊ธฐ์šธ๊ธฐ(gradient)๋ฅผ ๊ณ„์‚ฐํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ์˜๋ฏธ์ž…๋‹ˆ๋‹ค. ์ฆ‰, ์ด๋ฏธ์ง€ ์ƒ์„ฑ ๊ณผ์ •์—์„œ ๋‚˜์˜จ ๊ฒฐ๊ณผ๋ฅผ ๊ฐ€์žฅ ์ž‘์€ ๋ณ€ํ™”๋ฅผ ํ†ตํ•ด ๋งค๊ฐœ๋ณ€์ˆ˜๋“ค์„ ์กฐ์ •ํ•  ์ˆ˜ ์žˆ๋„๋ก ๋งŒ๋“ ๋‹ค๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค.
    • ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ ์ด๋ฏธ์ง€๋ฅผ ์‚ฌ์šฉํ•˜๋ฉด, ๊ฒฝ์‚ฌ ํ•˜๊ฐ•๋ฒ•(gradient descent) ๊ณผ ๊ฐ™์€ ์ตœ์ ํ™” ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์‚ฌ์šฉํ•˜์—ฌ ๋ชจ๋ธ์„ ํ›ˆ๋ จ์‹œํ‚ค๊ณ , ์ตœ์ข…์ ์œผ๋กœ ๋” ๋‚˜์€ ๊ฒฐ๊ณผ๋ฅผ ์–ป๊ธฐ ์œ„ํ•ด ์ด๋ฏธ์ง€์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๊ณ„์†ํ•ด์„œ ์ˆ˜์ •ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  • ์ด๋ฏธ์ง€ ์ƒ์„ฑ์—์„œ์˜ ํ™œ์šฉ:
    • ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๋„คํŠธ์›Œํฌ(์˜ˆ: GAN, VAE, Diffusion Models ๋“ฑ)์—์„œ, ์ด๋ฏธ์ง€ ํŒŒ๋ผ๋ฏธํ„ฐํ™” ๊ธฐ๋ฒ•์„ ํ™œ์šฉํ•˜๋ฉด, ์ด๋ฏธ์ง€์˜ ์„ธ๋ถ€์ ์ธ ์กฐ์ •์„ ํ•  ์ˆ˜ ์žˆ๊ณ , ๋” ๋‚˜์€ ํ’ˆ์งˆ์˜ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
    • ์˜ˆ๋ฅผ ๋“ค์–ด, ํ™•์‚ฐ ๋ชจ๋ธ(Diffusion Models)์—์„œ๋Š” ์ดˆ๊ธฐ ๋…ธ์ด์ฆˆ๋กœ๋ถ€ํ„ฐ ์ ์ง„์ ์œผ๋กœ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•˜๋Š”๋ฐ, ์ด ๊ณผ์ •์—์„œ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ ์ด๋ฏธ์ง€ ํŒŒ๋ผ๋ฏธํ„ฐํ™”๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ƒ์„ฑ๋œ ์ด๋ฏธ์ง€๊ฐ€ ์ตœ์ ํ™”๋˜๊ณ , ์กฐ๊ฑด์— ๋งž๋Š” ์ด๋ฏธ์ง€๊ฐ€ ์ ์ฐจ์ ์œผ๋กœ ๋งŒ๋“ค์–ด์ง‘๋‹ˆ๋‹ค.

 

 

ํ•ด๋‹น ๊ธ€์€ chatGPT ๋ฅผ ์ด์šฉํ•˜์—ฌ ์ž‘์„ฑ๋˜์—ˆ์Šต๋‹ˆ๋‹ค.