Qwen-Image Technical Report ์š”์•ฝ

๐Ÿค– AI Summary Notice ์ด ๊ธ€์€ AI(Claude)๊ฐ€ ๋…ผ๋ฌธ์„ ์ฝ๊ณ  ์ž‘์„ฑํ•œ ์š”์•ฝ์ž…๋‹ˆ๋‹ค. ๋ถ€์ •ํ™•ํ•œ ๋‚ด์šฉ์ด ์žˆ์„ ์ˆ˜ ์žˆ์œผ๋‹ˆ, ์ •ํ™•ํ•œ ์ •๋ณด๋Š” ์›๋ฌธ์„ ์ฐธ๊ณ ํ•ด์ฃผ์„ธ์š”. TL;DR Qwen-Image๋Š” Alibaba Qwen ์‹œ๋ฆฌ์ฆˆ์˜ ์ด๋ฏธ์ง€ ์ƒ์„ฑ ํŒŒ์šด๋ฐ์ด์…˜ ๋ชจ๋ธ๋กœ, ๋ณต์žกํ•œ ํ…์ŠคํŠธ ๋ Œ๋”๋ง(ํŠนํžˆ ์ค‘๊ตญ์–ด)๊ณผ ์ •๋ฐ€ํ•œ ์ด๋ฏธ์ง€ ํŽธ์ง‘์—์„œ SOTA๋ฅผ ๋‹ฌ์„ฑํ–ˆ๋‹ค. Qwen2.5-VL์„ ์กฐ๊ฑด ์ธ์ฝ”๋”๋กœ, 20B MMDiT๋ฅผ backbone์œผ๋กœ ์‚ฌ์šฉํ•˜๋ฉฐ, ์ƒˆ๋กœ์šด MSRoPE ์œ„์น˜ ์ธ์ฝ”๋”ฉ, 7๋‹จ๊ณ„ ๋ฐ์ดํ„ฐ ํ•„ํ„ฐ๋ง ํŒŒ์ดํ”„๋ผ์ธ, 3๋‹จ๊ณ„ ํ…์ŠคํŠธ ํ•ฉ์„ฑ ์ „๋žต, dual-encoding ํŽธ์ง‘ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ๋„์ž…ํ–ˆ๋‹ค. GenEval 0.91, ์ค‘๊ตญ์–ด ํ…์ŠคํŠธ ๋ Œ๋”๋ง์—์„œ GPT Image 1 ๋Œ€๋น„ 22%p ์šฐ์œ„, GEdit/ImgEdit ํŽธ์ง‘ ๋ฒค์น˜๋งˆํฌ 1์œ„๋ฅผ ๊ธฐ๋กํ•˜๋ฉฐ, ์˜คํ”ˆ์†Œ์Šค ๋ชจ๋ธ๋กœ์„œ ์ƒ์šฉ ๋ชจ๋ธ๊ณผ ๊ฒฝ์Ÿ๋ ฅ ์žˆ๋Š” ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ค€๋‹ค. ...

February 17, 2026 ยท 6 min ยท 1221 words ยท mori