Heng Yang Harvard University
时间： 2022-11-03 10:00-2022-11-03 11:00
地点：Zoom meeting (https://mit.zoom.us/j/94079983299)
Deep learning has witnessed great success in computer vision, and large-scale data-driven models are becoming prevalent in modern machine perception systems. However, a critical drawback of learned perception modules is that they come with little performance guarantees and are often brittle when deployed in complex environments. Although a growing body of literature is trying to endow deep learning with robustness guarantees, most methods either face scalability limitations or require extra assumptions on the structure of the problem.
In this talk, I advocate conformal prediction as a general and scalable framework to equip black-box perception models with statistical guarantees.
I will first review the statistical machinery of inductive conformal prediction (ICP) that, given a calibration set and a nonconformity function, can convert any learned prediction function into a prediction set that provably covers the true label with a user-specified probability (e.g., 90%) under the mild condition of exchangeability.
I will then present in detail a case study to demonstrate how ICP can be applied to the problem of image-based object detection and pose estimation. In the first stage, with properly designed nonconformity functions, we use ICP to successfully conformalize state-of-the-art keypoint detectors to obtain circular or elliptical prediction sets with guaranteed probabilistic coverage of the true keypoints. In the second stage, we use geometric techniques to propagate the uncertainty in keypoints to derive uncertainty in pose estimation — the result of which is a pose uncertainty set (PURSE) that probabilistically contains the ground truth pose. We demonstrate correct pose estimation and uncertainty quantification on the LineMOD occlusion dataset. I conclude by remarking how this opens up new opportunities in designing perception-based control systems with performance guarantees.
Joint work with Marco Pavone from Stanford/NVIDIA.
Heng Yang is an incoming Assistant Professor of Electrical Engineering in the School of Engineering and Applied Sciences at Harvard University. He is currently a Research Scientist at the NVIDIA Autonomous Vehicle Research Group. He holds a B.S. from Tsinghua University, an S.M. and a Ph.D. from MIT, all in Mechanical Engineering. His research interests include large-scale convex optimization, semidefinite relaxation, statistics, and machine learning, applied to robotics and trustworthy autonomy. Heng Yang is a recipient of the Best Paper Award in Robot Vision at the 2020 IEEE International Conference on Robotics and Automation (ICRA), a Best Paper Award Honorable Mention from the 2020 IEEE Robotics and Automation Letters (RA-L), and a Best Paper Award Finalist at the 2021 Robotics: Science and Systems (RSS) conference. He is a Class of 2021 RSS Pioneer.