[1] Roboverse RSS Oral: https://roboverseorg.github.io/
[2] A Modality-Invariant Foundation Model for Multimodal Human Sensing, ICLR-25: https://arxiv.org/abs/2410.10167
[3] GERA: Geometric Embedding for Efficient Point Registration Analysis, ICRA-25: https://arxiv.org/abs/2410.00589
[4] Diffusion Model is a Good Pose Estimator from 3D RF-Vision, ECCV-24: https://arxiv.org/abs/2403.16198
[5] Self-supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness, IROS-24: https://arxiv.org/html/2411.06789v1
[6] Multi-Modal Continual Test-Time Adaptation for 3D Semantic Segmentation, ICCV-23: https://arxiv.org/abs/2303.10457