Francesco Taioli
PhD Candidate in Artificial Intelligence (2026)
I am a PhD Candidate in the National PhD Programme in Artificial Intelligence at the Polytechnic of Turin, supervised by Prof. Marco Cristani and Prof. Alessandro Farinelli.
My research interests broadly lie in Deep Learning, with a particular focus on enhancing the autonomy of intelligent agents. Currently, I am exploring Foundation Models (VLMs, LLMs & VLAs) and their applications in Embodied AI, with the goal of advancing navigation, embodied reasoning, and human-robot interaction.
News
Paper accepted at ICCV 25 🎉
What do human-agent bidirectional interaction, navigation, vision-language models, and hallucinations have in common? Read our new ICCV paper!
I've been recognized as an Outstanding Reviewer for CVPR25!
🤖 I co-organized the Human-aware Embodied AI workshop at IROS 2025!
Oral Presentation at IROS 2024! 🎉
Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation was accepted at IROS 2024, with an oral presentation!
CVPR 2024!
We had a paper accepted at the 5th Annual Embodied AI Workshop @ CVPR 2024. Moreover, we won the MultiON Challenge @ CVPR 2024!
I attended the International Computer Vision Summer School (ICVSS)!
Education & Work
Applied Scientist Intern
Amazon, Madrid, Spain
Visiting Research Student
Simon Fraser University, Vancouver, Canada
National Ph.D. Programme in Artificial Intelligence
Polytechnic of Turin, Turin, Italy
B.Sc. & M.Sc. in Computer Science & Engineering
University of Verona, Verona, Italy
Publications
Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dialogues
F Taioli, E Zorzi, G Franchi, A Castellini, A Farinelli, M Cristani, Y Wang
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect Detection
L Capogrosso*, F Girella*, F Taioli*, MD Chiara, M Aqeel, F Fummi, F Setti, M Cristani
VISAPP - 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 2024
Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation
F Taioli, S Rosa, A Castellini, L Natale, A Del Bue, A Farinelli, M Cristani, Y Wang
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Unsupervised Active Visual Search With Monte Carlo Planning Under Uncertain Detections
F Taioli, F Giuliari, Y Wang, R Berra, A Castellini, A Del Bue, A Farinelli, M Cristani
IEEE Transactions on Pattern Analysis and Machine Intelligence 46 (12), 10375-10389, 2024
I2EDL: Interactive Instruction Error Detection and Localization
F Taioli, S Rosa, A Castellini, L Natale, A Del Bue, A Farinelli, M Cristani, Y Wang
2024 33rd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)
Language-Enhanced RNR-Map: Querying Renderable Neural Radiance Field Maps with Natural Language
F Taioli, F Cunico, F Girella, R Bologna, A Farinelli, M Cristani
IEEE/CVF International Conference on Computer Vision, 4669-4674, 2023
Designing Logic Tensor Networks for Visual Sudoku Puzzle Classification
L Morra, A Azzari, L Bergamasco, M Braga, L Capogrosso, F Delrio, et al.
NeSy - Proceedings of the 17th International Workshop on Neural-Symbolic Learning and Reasoning, 2023
SCENE-pathy: Capturing the Visual Selective Attention of People Towards Scene Elements
A Toaiari, F Cunico, F Taioli, A Caputo, G Menegaz, A Giachetti, GM Farinella, M Cristani
ICIAP - International Conference on Image Analysis and Processing, 352-363, 2023