PhysDepthで悪天候もOK！単眼深度推定を改善する物理ベースのフレームワーク✨

Published：2026/1/4 16:07:12

PhysDepthで悪天候もヘッチャラ！単眼深度推定（MDE）を爆上げ🚀

1. タイトル & 超要約 PhysDepthで悪天候もOK！単眼深度推定を改善する物理ベースのフレームワーク✨

2. ギャル的キラキラポイント✨ ● 悪天候でも画像から距離を測れるようになるなんて、マジすごい！😳 ● 物理的な原理（Rayleigh散乱とかBeer-Lambertの法則）をAIに活かすって、なんか賢くてカッコイイよね💖 ● 既存のAIモデルにちょい足しできるプラグアンドプレイってとこも、手軽で良い感じ🎶

3. 詳細解説 ● 背景自動運転とかAR/VR（拡張現実/仮想現実）って、周りの距離を正確に測るのが大事じゃん？でもさ、雨とか雪で視界が悪くなると、AIがうまく距離を測れなくなっちゃう問題があったのよね😢

● 方法そこで登場！ PhysDepthは、画像の色の変化とか、光の物理的な性質を考慮して距離を推定するんだって！具体的には、PPM（赤色チャンネルから特徴抽出するやつ）とRCA（光の減衰を計算する損失関数）っていう、ちょー優秀なパーツを使ってるらしい✨

続きは「らくらく論文」アプリで

Bridging Geometry and Appearance: Topological Features for Robust Self-Supervised Segmentation

Kebin Peng / Haotang Li / Zhenyu Qi / Huashan Chen / Zi Wang / Wei Zhang / Sen He / Huanrui Yang / Qing Guo

Self-supervised semantic segmentation methods often fail when faced with appearance ambiguities. We argue that this is due to an over-reliance on unstable, appearance-based features such as shadows, glare, and local textures. We propose \textbf{GASeg}, a novel framework that bridges appearance and geometry by leveraging stable topological information. The core of our method is Differentiable Box-Counting (\textbf{DBC}) module, which quantifies multi-scale topological statistics from two parallel streams: geometric-based features and appearance-based features. To force the model to learn these stable structural representations, we introduce Topological Augmentation (\textbf{TopoAug}), an adversarial strategy that simulates real-world ambiguities by applying morphological operators to the input images. A multi-objective loss, \textbf{GALoss}, then explicitly enforces cross-modal alignment between geometric-based and appearance-based features. Extensive experiments demonstrate that GASeg achieves state-of-the-art performance on four benchmarks, including COCO-Stuff, Cityscapes, and PASCAL, validating our approach of bridging geometry and appearance via topological information.

cs / cs.CV

Arxivで見る