iconLogo
Published:2025/12/3 19:00:00

OmniJet-a爆誕!分類も得意な最強AI✨

超要約: ジェット物理学AI、分類能力UP!ビジネスも加速🚀

✨ ギャル的キラキラポイント ✨ ● ジェット(素粒子)分析AIをさらに進化させたってコト💖 ● 生成AIの得意技はそのままに、分類も得意になった最強AIなの😍 ● データ分析、ビジネス、いろんなとこで活躍しそうじゃん?😎

詳細解説いくねー!

背景 LHC(世界最大の加速器)の実験データとかを分析する「ジェット物理学」っていう分野で、OmniJet-aっていうAIモデルが活躍してたのね!でも、分類(データをカテゴリー分けする事)はちょっと苦手だったみたい😢

続きは「らくらく論文」アプリで

Enhancing next token prediction based pre-training for jet foundation models

Joschka Birk / Anna Hallin / Gregor Kasieczka / Nikol Madzharova / Ian Pang / David Shih

Next token prediction is an attractive pre-training task for jet foundation models, in that it is simulation free and enables excellent generative capabilities that can transfer across datasets. Here we study multiple improvements to next token prediction, building on the initial work of OmniJet-$\alpha$. Instead of tokenizing particles and subsequently only using the token-ID as the model input for both the generative and the classification task, we adopt a hybrid setup, which allows us to use continuous feature vectors as model input while only using token-IDs in the next token prediction target. Secondly, we explore a combined pre-training strategy that combines masked particle modeling and generative learning objectives. Taken together, these changes greatly improve the performance in downstream classification tasks without any loss in generative performance.

cs / hep-ph / cs.LG / hep-ex / physics.data-an