Bridging the gap between cloud infrastructure and Production Data Science

演讲人: Chenggang Wu
时间: 2022-11-25 14:00-2022-11-25 15:00
地点:Tencent Meeting ID: 362-821-200

Over the last decade, data scientists have mastered the process of building machine learning models. The next frontier is putting these models into production: using them to consistently generate high-quality predictions for use by people, services, and analyses.

Cloud computing infrastructure is crucial to delivering this new frontier. Unfortunately, existing offerings fall short of the challenge of Production Data Science. In this talk, I will cover some of the important promises and weaknesses of current cloud offerings, and describe research from Berkeley's RISELab and the resulting open source Aqueduct system, which are putting Production Data Science at the fingertips of anyone working with data and models.


Chenggang Wu is Co-founder and CTO at Aqueduct, a startup building open source machine learning prediction infrastructure. He earned his Ph.D. from UC Berkeley, advised by Joseph M. Hellerstein. He received best-of-conference citations for research appearing in both VLDB 2019 and ICDE 2018. His Ph.D. dissertation focused extensively on scalable cloud infrastructure, which won him the 2022 ACM SIGMOD Jim Gray Doctoral Dissertation Award.