๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

Generative Model3

[Paper Review] High-Resolution Image Synthesis with Latent Diffusion Models (Aka. Stable Diffusion) https://arxiv.org/abs/2112.10752 High-Resolution Image Synthesis with Latent Diffusion ModelsBy decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image data and beyond. Additionally, their formulation allows for a guiding mechanism tarxiv.org ์ด๋ฒˆ ์ฃผ์ œ๋Š” ์•„์ฃผ ์œ ๋ช…ํ•œ Stable Diffuion ๋…ผ๋ฌธ์„ ๋ฆฌ๋ทฐํ•ด๋ณด๋„.. 2025. 2. 4.
[Notable] GANs ์˜ ์ฃผ์š” ๋ฌธ์ œ์ : Mode Collapse ์™€ Training Instability โœ… 1. Mode Collapse (๋ชจ๋“œ ๋ถ•๊ดด)๐Ÿšฉ Mode Collapse๋ž€?Mode Collapse๋Š” GAN์˜ ์ƒ์„ฑ์ž(Generator)๊ฐ€ ๋ฐ์ดํ„ฐ์˜ ๋‹ค์–‘ํ•œ ํŒจํ„ด์„ ํ•™์Šตํ•˜์ง€ ๋ชปํ•˜๊ณ , ์ œํ•œ๋œ ํŒจํ„ด๋งŒ ๋ฐ˜๋ณต์ ์œผ๋กœ ์ƒ์„ฑํ•˜๋Š” ํ˜„์ƒ์„ ์˜๋ฏธํ•ฉ๋‹ˆ๋‹ค.์˜ˆ์‹œ:๊ณ ์–‘์ด ์‚ฌ์ง„ ๋ฐ์ดํ„ฐ์…‹์œผ๋กœ ํ•™์Šต์‹œ์ผฐ๋‹ค๋ฉด ๋‹ค์–‘ํ•œ ๊ณ ์–‘์ด ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.๊ทธ๋Ÿฌ๋‚˜ Mode Collapse๊ฐ€ ๋ฐœ์ƒํ•˜๋ฉด ์ƒ์„ฑ์ž๋Š” "ํ•œ ๊ฐ€์ง€ ๊ณ ์–‘์ด ์œ ํ˜•"๋งŒ ๋ฐ˜๋ณต์ ์œผ๋กœ ์ƒ์„ฑํ•˜๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.๐Ÿ” ์™œ ๋ฐœ์ƒํ• ๊นŒ?GAN์€ ์ƒ์„ฑ์ž(Generator)์™€ ํŒ๋ณ„์ž(Discriminator)๊ฐ€ ๊ฒฝ์Ÿํ•˜๋Š” ๊ตฌ์กฐ์ž…๋‹ˆ๋‹ค. ์ด ๊ณผ์ •์—์„œ:์ƒ์„ฑ์ž๊ฐ€ ์šฐ์—ฐํžˆ ํŒ๋ณ„์ž๋ฅผ ์ž˜ ์†์ด๋Š” ํŠน์ • ํŒจํ„ด์„ ๋ฐœ๊ฒฌํ•ฉ๋‹ˆ๋‹ค.์ด ํŒจํ„ด์„ ๋ฐ˜๋ณตํ•ด์„œ ์‚ฌ์šฉํ•˜๋ฉด ํŒ๋ณ„์ž๋ฅผ ์†์ผ ์ˆ˜ ์žˆ๋‹ค๊ณ  ํ•™์Šตํ•ฉ๋‹ˆ๋‹ค.๊ฒฐ๊ตญ ๋ฐ์ดํ„ฐ์˜ ๋‹ค์–‘์„ฑ์ด ์‚ฌ๋ผ์ง€๊ณ  ํŠน์ • ๋ชจ.. 2025. 2. 4.
Generative Model 1์„ธ๋Œ€ Autoencoder(AE) ์ฃผ์–ด์ง„ ์ž…๋ ฅ ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๋Šฅ๋ ฅ์„ ๊ฐ–์ถ˜ ๋ชจ๋ธ AE๋Š” ์ฃผ๋กœ ๋น„์ง€๋„ ํ•™์Šต ๋ฐฉ์‹์œผ๋กœ ์‚ฌ์šฉ๋˜๋ฉฐ, ์ž…๋ ฅ ๋ฐ์ดํ„ฐ๋ฅผ ์ž ์žฌ ํ‘œํ˜„(latent representation)์œผ๋กœ ์••์ถ•ํ•œ ํ›„ ์ด๋ฅผ ๋‹ค์‹œ ๋ณต์›ํ•˜์—ฌ ์ž…๋ ฅ ๋ฐ์ดํ„ฐ์™€ ์œ ์‚ฌํ•œ ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑ Generative model 1์„ธ๋Œ€ AE๋Š” ๊ธฐ๋ณธ์ ์œผ๋กœ ์ธ์ฝ”๋”(encoder)์™€ ๋””์ฝ”๋”(decoder)๋ผ๋Š” ๋‘ ๋ถ€๋ถ„์œผ๋กœ ๊ตฌ์„ฑ ์ธ์ฝ”๋”(encoder): ์ž…๋ ฅ ๋ฐ์ดํ„ฐ๋ฅผ ์ €์ฐจ์›์˜ ์ž ์žฌ ํ‘œํ˜„์œผ๋กœ ๋ณ€ํ™˜ํ•ฉ๋‹ˆ๋‹ค. ์ด ์ž ์žฌ ํ‘œํ˜„์€ ์ผ๋ฐ˜์ ์œผ๋กœ ์ €์ฐจ์›์˜ ๋ฐ€์ง‘ ๋ฒกํ„ฐ์ž…๋‹ˆ๋‹ค. ์ธ์ฝ”๋”๋Š” ์ž…๋ ฅ ๋ฐ์ดํ„ฐ๋ฅผ ์ €์ฐจ์› ๊ณต๊ฐ„์œผ๋กœ ์••์ถ•ํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ํ•™์Šต๋ฉ๋‹ˆ๋‹ค. ๋””์ฝ”๋”(decoder): ์ž ์žฌ ํ‘œํ˜„์„ ์›๋ž˜์˜ ๋ฐ์ดํ„ฐ ๊ณต๊ฐ„์œผ๋กœ ๋ณต์›ํ•˜์—ฌ ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ๋””์ฝ”๋”.. 2023. 7. 11.