CVCで画像編集が神進化✨　歪み&劣化バイバイ！

Published：2026/1/5 12:05:00

タイトル & 超要約：CVCで画像編集が神進化✨　歪み&劣化バイバイ！

ギャル的キラキラポイント✨ その1: 元画像の構造をしっかりキープ！編集しても変な歪みが出ないから安心💖
ギャル的キラキラポイント✨ その2: 画質も爆上がり！編集しても画像が荒れないって最高じゃん？🌟
ギャル的キラキラポイント✨ その3: テキストで指示した通りの編集がバッチリ！思い通りの画像が作れるって、神！👑

詳細解説

背景: 最近の画像編集はAIがすごいけど、編集すると画像が歪んだり、画質が落ちたり、思ったように編集できなかったり…困っちゃう😭
方法: CVCっていう新しい方法を開発！画像の潜在空間（見えないとこ）で編集するんだけど、特殊な計算で歪みを直したり、画質を良くしたりするんだって！✨
結果: 編集しても画像の歪みが減って、画質も綺麗になった！テキストで「ピンクの髪」って指示したら、ちゃんとピンクになるし、すごい🤩

続きは「らくらく論文」アプリで

On Exact Editing of Flow-Based Diffusion Models

Zixiang Li / Yue Song / Jianing Peng / Ting Liu / Jun Huang / Xiaochao Qu / Luoqi Liu / Wei Wang / Yao Zhao / Yunchao Wei

Recent methods in flow-based diffusion editing have enabled direct transformations between source and target image distribution without explicit inversion. However, the latent trajectories in these methods often exhibit accumulated velocity errors, leading to semantic inconsistency and loss of structural fidelity. We propose Conditioned Velocity Correction (CVC), a principled framework that reformulates flow-based editing as a distribution transformation problem driven by a known source prior. CVC rethinks the role of velocity in inter-distribution transformation by introducing a dual-perspective velocity conversion mechanism. This mechanism explicitly decomposes the latent evolution into two components: a structure-preserving branch that remains consistent with the source trajectory, and a semantically-guided branch that drives a controlled deviation toward the target distribution. The conditional velocity field exhibits an absolute velocity error relative to the true underlying distribution trajectory, which inherently introduces potential instability and trajectory drift in the latent space. To address this quantifiable deviation and maintain fidelity to the true flow, we apply a posterior-consistent update to the resulting conditional velocity field. This update is derived from Empirical Bayes Inference and Tweedie correction, which ensures a mathematically grounded error compensation over time. Our method yields stable and interpretable latent dynamics, achieving faithful reconstruction alongside smooth local semantic conversion. Comprehensive experiments demonstrate that CVC consistently achieves superior fidelity, better semantic alignment, and more reliable editing behavior across diverse tasks.

cs / cs.CV

Arxivで見る