SpatiaLoc！テキストと3Dデータで場所特定✨

Published：2026/1/7 4:50:39

タイトル & 超要約：SpatiaLoc！テキストと3Dデータで場所特定✨

最強ギャル解説、いくよ～！💕

1. キラキラポイント✨：

● テキスト指示(しじ)で場所を特定（とくてい）できるんだって！🤖💖まるで未来！
● 3Dデータと組み合わせるから、すっごい正確（せいかく）なんだって！✨
● 自動運転（じどううんてん）とか、色んな分野で活躍（かつやく）する予感！🚗💨

2. 詳細解説：

続きは「らくらく論文」アプリで

SpatiaLoc: Leveraging Multi-Level Spatial Enhanced Descriptors for Cross-Modal Localization

Tianyi Shang / Pengjie Xu / Zhaojun Deng / Zhenyu Li / Zhicong Chen / Lijun Wu

Cross-modal localization using text and point clouds enables robots to localize themselves via natural language descriptions, with applications in autonomous navigation and interaction between humans and robots. In this task, objects often recur across text and point clouds, making spatial relationships the most discriminative cues for localization. Given this characteristic, we present SpatiaLoc, a framework utilizing a coarse-to-fine strategy that emphasizes spatial relationships at both the instance and global levels. In the coarse stage, we introduce a Bezier Enhanced Object Spatial Encoder (BEOSE) that models spatial relationships at the instance level using quadratic Bezier curves. Additionally, a Frequency Aware Encoder (FAE) generates spatial representations in the frequency domain at the global level. In the fine stage, an Uncertainty Aware Gaussian Fine Localizer (UGFL) regresses 2D positions by modeling predictions as Gaussian distributions with a loss function aware of uncertainty. Extensive experiments on KITTI360Pose demonstrate that SpatiaLoc significantly outperforms existing state-of-the-art (SOTA) methods.

cs / cs.CV

Arxivで見る