Kalendarium
08
June
Master Thesis Presentation - Albert Mattsson and Pontus Brandin
Albert Mattsson and Pontus Brandin will present their thesis: Explicit model pre-training for 3D occupancy prediction
Examinator: Magnus Oskarsson
Advisors: Viktor Larsson and Johanna Lidholm
Abstract:
Understanding 3D scene structure from camera input is a central challenge in autonomous driving. Recent methods often rely on vision foundation models or annotated occupancy labels to provide strong semantic and geometric supervision, but these dependencies can limit scalability and introduce external biases. This thesis investigates whether an explicit 3D Gaussian scene representation can be pre-trained in a more self-contained manner from multi-view camera data and optional LiDAR supervision. Building on GaussTR, we replace foundation-model feature inputs with a ResNet backbone and replace depth supervision from Metric3D with self-supervised geometric consistency losses, optionally supported by LiDAR depth supervision. The learned representation is evaluated through binary occupancy prediction on Occ3D-nuScenes. Our results show that meaningful explicit 3D representations can be learned without vision foundation models at inference, and that fully self-supervised pre-training remains viable, although with reduced occupancy performance compared to VFM-assisted baselines. When adapted for semantic occupancy prediction, the proposed model achieves competitive results, including stronger mIoU than the original GaussTR baseline in our comparison. The findings suggest that geometric understanding and feature separation are closely linked. Improved geometry facilitates semantic learning, while stronger features support more accurate spatial representations.
Om händelsen
Tid:
2026-06-08 13:15
till
14:00
Plats
MH:Sigma
Kontakt
johanna [dot] lidholm [at] math [dot] lth [dot] se