Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
Paper
•
2502.10392
•
Published
•
6
This repo contains the models for paper Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding. Code is available at: https://github.com/GWxuan/TSP3D
Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
Wenxuan Guo*, Xiuwei Xu*, Ziwei Wang, Jianjiang Feng†, Jie Zhou, Jiwen Lu
* Equal contribution † Corresponding author
In this work, we propose an efficient multi-level convolution architecture for 3D visual grounding. TSP3D achieves superior performance compared to previous approaches in both inference speed and accuracy.