COLM 2025 に以下の論文が採択されました。
- Kazuki Yano, Sho Takase, Sosuke Kobayashi, Shun Kiyono and Jun Suzuki.
“Efficient Construction of Model Family through Progressive Training Using Model Expansion” - Wataru Ikeda, Kazuki Yano, Ryosuke Takahashi, Jaesung Lee, Keigo Shibata and Jun Suzuki.
“Layer-wise Importance Analysis of Feed-Forward Networks in Transformer-based Language Models” - Sho Takase, Shun Kiyono, Sosuke Kobayashi and Jun Suzuki.
“Spike No More: Stabilizing the Pre-training of Large Language Models”