Abstract: In visual-inertial simultaneous localization and mapping (VI-SLAM), visual residuals are typically formulated using multiview geometry, parameterizing both camera poses and scene feature ...
This repo contains the official PyTorch implementation for paper Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding. Look here for 中文解读. conda create -n TSP3D python=3.9 conda activate ...
@inproceedings{steinke2025curbosg, author={Steinke, Tim and Büchner, Martin and Vödisch, Niclas and Valada, Abhinav}, booktitle={2025 IEEE/RSJ International Conference on Intelligent Robots and ...
Abstract: Learning to build 3D scene graphs is essential for real-world perception in a structured and rich fashion. However, previous 3D scene graph generation methods utilize a fully supervised ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results