Mori’s blog

Hi, my name’s mori, and I’m going to write posts about AI and all fun stuffs.

Attention Is All You Need ์š”์•ฝ

๐Ÿค– AI Summary Notice ์ด ๊ธ€์€ AI(Claude)๊ฐ€ ๋…ผ๋ฌธ์„ ์ฝ๊ณ  ์ž‘์„ฑํ•œ ์š”์•ฝ์ž…๋‹ˆ๋‹ค. ๋ถ€์ •ํ™•ํ•œ ๋‚ด์šฉ์ด ์žˆ์„ ์ˆ˜ ์žˆ์œผ๋‹ˆ, ์ •ํ™•ํ•œ ์ •๋ณด๋Š” ์›๋ฌธ์„ ์ฐธ๊ณ ํ•ด์ฃผ์„ธ์š”. ...

February 17, 2026 ยท 6 min ยท 1191 words ยท mori

Proximal Policy Optimization Algorithms ์š”์•ฝ

๐Ÿค– AI Summary Notice ์ด ๊ธ€์€ AI๊ฐ€ ๋…ผ๋ฌธ์„ ์ฝ๊ณ  ์ž‘์„ฑํ•œ ์š”์•ฝ์ž…๋‹ˆ๋‹ค. ๋ถ€์ •ํ™•ํ•œ ๋‚ด์šฉ์ด ์žˆ์„ ์ˆ˜ ์žˆ์œผ๋‹ˆ, ์ •ํ™•ํ•œ ์ •๋ณด๋Š” ์›๋ฌธ์„ ์ฐธ๊ณ ํ•ด์ฃผ์„ธ์š”. ...

February 17, 2026 ยท 2 min ยท 369 words ยท mori

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning ์š”์•ฝ

๐Ÿค– AI Summary Notice ์ด ๊ธ€์€ AI(Claude)๊ฐ€ ๋…ผ๋ฌธ์„ ์ฝ๊ณ  ์ž‘์„ฑํ•œ ์š”์•ฝ์ž…๋‹ˆ๋‹ค. ๋ถ€์ •ํ™•ํ•œ ๋‚ด์šฉ์ด ์žˆ์„ ์ˆ˜ ์žˆ์œผ๋‹ˆ, ์ •ํ™•ํ•œ ์ •๋ณด๋Š” ์›๋ฌธ์„ ์ฐธ๊ณ ํ•ด์ฃผ์„ธ์š”. ...

February 17, 2026 ยท 5 min ยท 972 words ยท mori

Recursive Language Models ์š”์•ฝ

๐Ÿค– AI Summary Notice ์ด ๊ธ€์€ AI(Claude)๊ฐ€ ๋…ผ๋ฌธ์„ ์ฝ๊ณ  ์ž‘์„ฑํ•œ ์š”์•ฝ์ž…๋‹ˆ๋‹ค. ๋ถ€์ •ํ™•ํ•œ ๋‚ด์šฉ์ด ์žˆ์„ ์ˆ˜ ์žˆ์œผ๋‹ˆ, ์ •ํ™•ํ•œ ์ •๋ณด๋Š” ์›๋ฌธ์„ ์ฐธ๊ณ ํ•ด์ฃผ์„ธ์š”. TL;DR Recursive Language Models(RLM)์€ ๊ธด ํ”„๋กฌํ”„ํŠธ๋ฅผ LLM์— ์ง์ ‘ ๋„ฃ๋Š” ๋Œ€์‹ , ์™ธ๋ถ€ ํ™˜๊ฒฝ์˜ ์ผ๋ถ€๋กœ ์ทจ๊ธ‰ํ•˜์—ฌ LLM์ด ํ”„๋กœ๊ทธ๋ž˜๋ฐ์ ์œผ๋กœ ํ”„๋กฌํ”„ํŠธ๋ฅผ ํƒ์ƒ‰ยท๋ถ„ํ•ดํ•˜๊ณ  ์ž๊ธฐ ์ž์‹ ์„ ์žฌ๊ท€์ ์œผ๋กœ ํ˜ธ์ถœํ•˜๋Š” ์ถ”๋ก  ํ”„๋ ˆ์ž„์›Œํฌ์ด๋‹ค. RLM์€ ๊ธฐ์กด ์ปจํ…์ŠคํŠธ ์œˆ๋„์šฐ๋ณด๋‹ค ๋‘ ์ž๋ฆฟ์ˆ˜(100๋ฐฐ) ์ด์ƒ ๊ธด ์ž…๋ ฅ์„ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ์†Œ๊ทœ๋ชจ ๋ชจ๋ธ(RLM-Qwen3-8B)์ด ๊ธฐ๋ณธ ๋ชจ๋ธ ๋Œ€๋น„ ํ‰๊ท  28.3% ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋‹ฌ์„ฑํ•˜๊ณ  ์ผ๋ถ€ ๋ฒค์น˜๋งˆํฌ์—์„œ GPT-5์— ๊ทผ์ ‘ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์˜€๋‹ค. ...

February 16, 2026 ยท 4 min ยท 797 words ยท mori

Vibe Coding์œผ๋กœ ๋งŒ๋“œ๋Š” ์›น ๊ฒŒ์ž„

Number Game: ๋ฐ”์ด๋ธŒ ์ฝ”๋”ฉ์œผ๋กœ ๋งŒ๋“  ์ˆซ์ž ํผ์ฆ ๊ฒŒ์ž„...

July 15, 2025 ยท 6 min ยท 1103 words ยท mori

์ด๊ฑฐ ์–ด๋–ป๊ฒŒ ๋งŒ๋“  ๊ฑฐ์—์š”?

Hugo ๋ธ”๋กœ๊ทธ๋ฅผ ๋งŒ๋“ค๊ณ  ์‹ถ์–ด์„œ ๋‚ด๊ฐ€ ๋ช‡ ๋…„ ๋™์•ˆ ๋งŽ์ด ์ฝ์—ˆ๋˜ Lilian Wengโ€™s Blog๋ฅผ ์ฐธ๊ณ ํ•˜๊ธฐ๋กœ ํ–ˆ๋‹ค. ...

June 22, 2025 ยท 3 min ยท 590 words ยท mori

Sutton์˜ RL ์ฑ… ํ•œ ์žฅ ์š”์•ฝ

Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew Barto: link ...

June 15, 2025 ยท 5 min ยท 1028 words ยท mori