News


Paper accepted at ICRA 2023 · 02/02/2023
Our paper "Embodied Agents for Efficient Exploration and Smart Scene Description" has been accepted to ICRA 2023 (arxiv).

Best paper award at CBMI 2022 · 08/02/2022
Our paper "Retrieval-Augmented Transformer for Image Captioning" has been selected as best paper at the International Conference on Content-based Multimedia Indexing (CBMI 2022)!

Best Student Paper Award at ICIAP 2021 · 05/27/2022
Our paper "Investigating Bidimensional Downsampling in Vision Transformer Models" by Paolo Bruno, Roberto Amoroso, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, and Rita Cucchiara has been selected for the Best Student Paper Award at ICIAP 2021.

Computational Aspects of Deep Learning (CADL) workshop accepted at ECCV · 03/27/2022
Together with NVAITC, we are organizing the second Workshop on Computational Aspects of Deep Learning (CADL), which will be hosted at ECCV 2022. Check out the website.
From Show to Tell: A Survey on Image Captioning accepted at TPAMI · 01/18/2022
Interested in Image Captioning? Our definitive guide to techniques, datasets and variants has been accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence. Check it out!

ELLIS Scholar · 07/29/2021
I have been elected as an ELLIS Scholar in the ELLIS society, the European Laboratory for Learning and Intelligent Systems.

Interview with La Repubblica · 09/23/2020
I have been interviewed by Jaime D'Alessandro on Rep: Scienze, about Gpt-3 and Transformed-based language models. You can read the article here.

Interview at Smart City on Radio24 · 09/11/2019
I have been interviewed by Maurizio Melis on Radio24. You can hear the podcast of the interview here.

LAMV is being used at Facebook to detect harmful content · 08/05/2019
Our solution for matching and detecting copied videos, published in CVPR 2018, is now being used in production scale at Facebook to detect harmful content.
See the official announcement on the Facebook newsroom website, and the Github repository with the source code.
Older news can be found in the news archive.
Recent pre-prints


Towards Sustainable Video Modeling: Progressive Architecture Shrinkage for Action Recognition
M. Tomei, L. Baraldi, G. Fiameni, S. Bronzin, R. Cucchiara

Tell Me What To Describe: Fully-Attentive Iterative Networks for Region-Controlled Image and Video Captioning
M. Cornia, L. Baraldi, R. Cucchiara
Teaching
Complete list is available in the teaching page.
Scalable AI (2023/2024)
Laurea Magistrale in Ingegneria Informatica
Lorenzo Baraldi, Giuseppe Fiameni
AI for Automotive (2022/2023)
Course material
Advanced Automotive Electronics Engineering, Electronics Engineering
Rita Cucchiara, Lorenzo Baraldi
Computer Vision and Cognitive Systems (2022/2023)
Course material
· Upcoming exams
Laurea Magistrale in Ingegneria Informatica
Rita Cucchiara, Lorenzo Baraldi