3D VISION
In a connected world of people, robots and self-driving vehicles, we naturally need to have a good understanding of the 3D world we live in.
Highlights
2021
2 papers at ICCV2021
Co-organizing the 4th Workshop on Long-Term Visual Localization under Changing Conditions at ICCV2021
Invited talk at 3D-DLAD workshop at IEEE IV symposium on ‘Modern methods for visual localization’. Watch the talk
Accepted paper at CVPR:
Large-scale localization datasets in crowded indoor spaces. Comes with dataset below.
Release of world’s biggest indoor localization dataset and a new version of the unified data format kapture!
2020
New Image Retrieval for Visual Localization Benchmark online.
New Kapture toolbox online.
4 outstanding reviewer awards: M. Humenberger (CVPR, 3DV), J. Revaud (CVPR, NeurIPS)
2 papers accepted at 3DV 2020:
- Benchmarking Image Retrieval for Visual Localization
- SMPLy Benchmarking 3D Human Pose Estimation in the Wild
2 papers accepted at NeurIPS 2020:
- SuperLoss: A Generic Loss for Robust Curriculum Learning
- Hard Negative Mixing for Contrastive Learning
IJCV: Volume Sweeping: Learning Photoconsistency for Multi-View Shape Reconstruction
2nd place in Long-Term Visual Localization under Changing Conditions held at ECCV20 (paper).
Kapture: Release of a unified data format and processing pipeline for structure from motion and visual localization (paper).
2 papers ECCV 2020:
- DOPE: Distillation Of Part Experts for whole-body 3D pose estimation in the wild by Philippe Weinzaepfel, Romain Brégier, Hadrien Combaluzier, Vincent Leroy, Gregory Rogez
- Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction (various authors, among them Philippe Weinzaepfel, Romain Brégier, Gregory Rogez).
IROS 2020 paper: Self-Supervised Attention Learning for Depth and Ego-motion Estimation
IEEE Access journal: The IPIN 2019 Indoor Localisation Competition – Description and Results by F. Potorti, B. Chidlovskii, L. Antsfeld, et al.
1st, 2nd and 4th in CVPR Long Term Visual Localization Challenge.
New version of popular synthetic dataset Virtual KITTI available.
Related Content
Datasets
In a connected world of people, robots and self-driving vehicles, we naturally need to have a good understanding of the 3D world we live in.
Results in the tasks related to this understanding such as 3D reconstruction, mapping and visual localization have been getting better and better. In reconstructing the geometry of the world as accurately as possible, it’s common practice to use sensors such as LIDAR, radar and, of course, cameras. This is because geometry is pretty well understood and one needs to ‘measure the world’ for many applications. However, progress in only using geometry to solve 3D vision tasks has been declining and the methods that exist today are not sufficiently robust for everyday situations such as changing environments and weather conditions.
One reason for this lack of robustness is that not everything can be measured or described in a way a computer can reliably detect it. Furthermore, even if a scene were to be perfectly reconstructed, there’s no guarantee that a computer would understand, analyse and interpret it correctly.
A popular strategy of the computer vision community to overcome these problems, is to use machine learning techniques rather than hand-crafted approaches and their success has proven it to be a good choice. There have been some outstanding results in topics such as image categorization, image retrieval and object detection.
However, geometric properties constitute a significant part of the world and we believe they should not be neglected entirely in favour of learning. Our strategy therefore combines both approaches.
We want to learn what we cannot measure.
The research focus of the 3D Vision team lies on the design of methods which combine geometry and learning-based approaches to solve specific real-world challenges such as visual localization, camera pose estimation and 3D reconstruction. Examples for our target applications are robot navigation, indoor mapping, augmented reality (AR) and, more generally speaking, systems which enable ambient intelligence in day to day life.