Sandro Papais

prof_pic.jpg

I’m Sandro Papais, a Ph.D. Candidate at the University of Toronto Robotics and AI Lab with Prof. Steven Waslander, an Affiliate Researcher at the Vector Institute, and a Machine Learning Perception Researcher at Zoox. I build spatiotemporal transformer models that let robots perceive and act in real time.

My current research develops end-to-end spatiotemporal models for perception, prediction, and planning that reason over time, preserve object permanence, and focus on decision-critical content — including a recent patent filing on sparse 3D object detection. I also co-authored a book on motion prediction for robotics in Foundations and Trends in Robotics.

Previously, I developed autonomy software for interplanetary spacecraft at NASA JPL, lunar landers and eVTOLs at NGC Aerospace (some of which has now flown to the Moon), and rovers at the European Space Agency. I also developed course materials for the Self-Driving Cars Specialization on Coursera, used by 120k+ learners.

Recognition includes the Qualcomm Innovation Fellowship for autonomous driving foundation models research (one of 266 proposals) and the Ontario Graduate Scholarship. I’ve taken first place at the AutoDrive Challenge II and the Spaceport America Genesis Cup, and my research on autonomy was featured by U of T News.

Graduating August 2026 and exploring new opportunities — get in touch.

Selected Publications

  1. beyondsight.png
    BeyondSight: Object Permanence for End-to-End Autonomous Driving
    Sandro Papais, Letian Wang, Mudit Jain, and 2 more authors
    2026
    Under Review
  2. store3d.png
    SToRe3D: Sparse Token Relevance in ViTs for Efficient Multi-View 3D Object Detection
    Sandro Papais, Lezhou Feng, Charles Cossette, and 1 more author
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026
  3. trends_motion_prediction.png
    Trends in Motion Prediction Toward Deployable and Generalizable Autonomy
    Letian Wang, Marc-Antoine Lavoie, Sandro Papais, and 13 more authors
    Foundations and Trends in Robotics, 2026
  4. foresight.png
    ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting
    Sandro Papais, Letian Wang, Brian Cheong, and 1 more author
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2025
  5. swtrack.png
    SWTrack: Multiple Hypothesis Sliding Window 3D Multi-Object Tracking
    Sandro Papais, Robert Ren, and Steven Waslander
    In 2024 IEEE International Conference on Robotics and Automation (ICRA), Oct 2024

Selected Work

Autonomous Driving ML for Robotaxis
Autonomous Driving ML for Robotaxis
Build real-time autonomy models for Zoox's robotaxi fleet, spanning multimodal perception, prediction, and temporal reasoning.
Self-Driving Vehicle Perception Lead
Self-Driving Vehicle Perception Lead
Led U of T aUToronto's perception stack through deployment, helping the team win first place in AutoDrive Challenge II.
Visual Navigation for Lunar Landers
Visual Navigation for Lunar Landers
Developed visual navigation and guidance software for lunar lander systems at NGC Aerospace, with work flown on Firefly's Blue Ghost mission for NASA's CLPS program.