Andrea Simonelli
I am a Research Engineer at Meta in Zurich (Switzerland), where I work on computer vision and machine learning.
I did my PhD at the University of Trento (Italy), with a research grant co-funded by Mapillary Research and Fondazione Bruno Kessler (FBK). My advisors were prof. Elisa Ricci (University of Trento) and dr. Samuel Rota Bulo' (Meta). My research focused on image-based 3D Object Detection, as well as its combination with Neural Radiance Fields (NeRFs).
Email  / 
Google Scholar
|
|
|
AutoRF: Learning 3D Object Radiance Fields from Single View Observations
Norman Müller,
Andrea Simonelli,
Lorenzo Porzi,
Samuel Rota Bulò,
Matthias Nießner,
Peter Kontschieder,
CVPR, 2022
Project page
From just a single view, we learn neural 3D object representations for free novel view synthesis. This setting is in stark contrast to the majority of existing works that leverage multiple views of the same object, employ explicit priors during training, or require pixel-perfect annotations. Our method decouples object geometry, appearance, and pose enabling generalization to unseen objects, even across different datasets of challenging real-world street scenes such as nuScenes, KITTI, and Mapillary Metropolis.
|
|
3D Object Detection from Images for Autonomous Driving: A Survey
Xinzhu Ma,
Wanli Ouyang,
Andrea Simonelli,
Elisa Ricci
Arxiv, 2022
Arxiv
3D object detection from images, one of the fundamental and challenging problems in autonomous driving, has received increasing attention from both industry and academia in recent years. Benefiting from the rapid development of deep learning technologies, image-based 3D detection has achieved remarkable progress. Particularly, more than 200 works have studied this problem from 2015 to 2021, encompassing a broad spectrum of theories, algorithms, and applications. However, to date no recent survey exists to collect and organize this knowledge. In this paper, we fill this gap in the literature and provide the first comprehensive survey of this novel and continuously growing research field, summarizing the most commonly used pipelines for image-based 3D detection and deeply analyzing each of their components.
|
|
Are we Missing Confidence in Pseudo-LiDAR Methods for Monocular 3D Object Detection?
Andrea Simonelli,
Samuel Rota Bulò,
Lorenzo Porzi,
Peter Kontschieder,
Elisa Ricci
ICCV, 2021
PDF
We closely analyze a novel branch of 3D Object Detection methods from Images i.e. Pseudo-LiDAR based methods. We identify a flaw in their widely popular training protocol, which introduces a distorted perception about their performance. On top of this, we propose to introduce a novel Absolute and Relative 3D Confidence module which is demonstrated to increase 3D Detection performance substantially.
|
|
Towards Generalization Across Depth for Monocular 3D Object Detection
Andrea Simonelli,
Samuel Rota Bulò,
Lorenzo Porzi,
Elisa Ricci,
Peter Kontschieder
ECCV, 2020
arXiv
We propose to perform the detection, instead of on the usual full-resolution image, on a series of Virtual Views. Virtual Views make the appearence of the objects invariant with respect to distance, easing the overall task. We also propose a single-stage, lightweight architecture called MoVi-3D. We achieve state-of-the-art results on the popular KITTI3D benchmark.
|
|
Disentangling Monocular 3D Object Detection: From Single to Multi-Class Recognition
Andrea Simonelli,
Samuel Rota Bulò,
Lorenzo Porzi,
Manuel Lòpez Antequera,
Peter Kontschieder
TPAMI, 2020
IEEExplore
We extend our previous ICCV19 work to the multi-class scenario. In particular, we apply the MonoDIS detector to the challenging nuScenes dataset, achieving comparable results with a LiDAR baseline. In this work we also report updated scores on KITTI3D.
|
|
Disentangling Monocular 3D Object Detection
Andrea Simonelli,
Samuel Rota Bulò,
Lorenzo Porzi,
Manuel Lòpez Antequera,
Peter Kontschieder
ICCV, 2019
PDF
We propose a disentangling transformation which simplifies the training dynamics in the presence of losses with complex interactions of parameters, sidestepping the issue of balancing independent regression terms. We also propose a self-supervised 3D confidence which is very useful to understand the quality of the predicted 3D bounding boxes. In addition, we resolve a flaw in the (now deprecated) KITTI3D metric which affected all the previously published results. Our proposed metric is now the official KITTI3D metric, used to evaluate all methods on the KITTI3D benchmark.
|
|
Increasingly specialized ensemble of convolutional neural networks for fine-grained recognition
Andrea Simonelli,
Francesco De Natale,
Stefano Messelodi,
Samuel Rota Bulò
ICIP, 2018 (Oral presentation)
IEEEXplore
We propose a simple method for fine-grained recognition that exploits a nearly cost-free attention-based focus operation to construct an ensemble of increasingly specialized Convolutional Neural Networks.
|
|
Advanced Algorithms Teaching Assistant, prof. Elisa Ricci and dr. Samuel Rota Bulò, Spring 2020
|
The source code for this website has been taken from this nice repo. Last update: July 2023.
|
|