テキストから高品質な3Dモーション(動き)を生成するスゴい技術!VRとかゲームとか、色んな分野で役立つよ!✨
● テキストから自然な動きを生成できるって、マジ神じゃん? ● 全身の動きがめっちゃ自然で、多様な表現ができるんだって! ● モーション(動き)の長さを気にしなくていいのが、超便利!
• 背景 テキストから人間の動きを生成する技術は、色んな分野で求められてたけど、難しかったんだよね~。BiPOは、その問題を解決するために開発されたんだって!アニメとかゲームとか、VRとか、色んなコンテンツ制作がめっちゃ楽になる予感♪
• 方法 BiPOは、「Part-based」と「Bidirectional」っていう2つの技術を組み合わせてるの!体のパーツごとに動きを生成して、全体で自然な動きになるように調整してるんだって。さらに、部分的な情報を隠す「Partial Occlusion (PO)」技術も使って、動きのバリエーションを増やしてるんだって!
続きは「らくらく論文」アプリで
Generating natural and expressive human motions from textual descriptions is challenging due to the complexity of coordinating full-body dynamics and capturing nuanced motion patterns over extended sequences that accurately reflect the given text. To address this, we introduce BiPO, Bidirectional Partial Occlusion Network for Text-to-Motion Synthesis, a novel model that enhances text-to-motion synthesis by integrating part-based generation with a bidirectional autoregressive architecture. This integration allows BiPO to consider both past and future contexts during generation while enhancing detailed control over individual body parts without requiring ground-truth motion length. To relax the interdependency among body parts caused by the integration, we devise the Partial Occlusion technique, which probabilistically occludes the certain motion part information during training. In our comprehensive experiments, BiPO achieves state-of-the-art performance on the HumanML3D dataset, outperforming recent methods such as ParCo, MoMask, and BAMM in terms of FID scores and overall motion quality. Notably, BiPO excels not only in the text-to-motion generation task but also in motion editing tasks that synthesize motion based on partially generated motion sequences and textual descriptions. These results reveal the BiPO's effectiveness in advancing text-to-motion synthesis and its potential for practical applications.