iconLogo
Published:2025/10/23 8:49:56

合成データと現実の差を測定!🌟

超要約:自動運転データの質を上げる画期的手法✨

● ギャップ(差)を数値化! ● スタイル(見た目)の違いに注目! ● 開発コスト削減にも貢献!

詳細解説いくよ~!

背景 自動運転🚗の訓練(トレ)には、現実世界のデータだけじゃなく、シミュレーションで作った合成データも使うよ! でも、合成データと現実のデータにはズレ(ギャップ)があるから、そのせいで自動運転の性能が落ちちゃうことも…💦

続きは「らくらく論文」アプリで

A Style-Based Profiling Framework for Quantifying the Synthetic-to-Real Gap in Autonomous Driving Datasets

Dingyi Yao / Xinyao Han / Ruibo Ming / Zhihang Song / Lihui Peng / Jianming Hu / Danya Yao / Yi Zhang

Ensuring the reliability of autonomous driving perception systems requires extensive environment-based testing, yet real-world execution is often impractical. Synthetic datasets have therefore emerged as a promising alternative, offering advantages such as cost-effectiveness, bias free labeling, and controllable scenarios. However, the domain gap between synthetic and real-world datasets remains a major obstacle to model generalization. To address this challenge from a data-centric perspective, this paper introduces a profile extraction and discovery framework for characterizing the style profiles underlying both synthetic and real image datasets. We propose Style Embedding Distribution Discrepancy (SEDD) as a novel evaluation metric. Our framework combines Gram matrix-based style extraction with metric learning optimized for intra-class compactness and inter-class separation to extract style embeddings. Furthermore, we establish a benchmark using publicly available datasets. Experiments are conducted on a variety of datasets and sim-to-real methods, and the results show that our method is capable of quantifying the synthetic-to-real gap. This work provides a standardized profiling-based quality control paradigm that enables systematic diagnosis and targeted enhancement of synthetic datasets, advancing future development of data-driven autonomous driving systems.

cs / cs.CV