Wei Xu ( 徐葳 )

Assistant Professor and Ph.d. Advisor

Institute for Interdisciplinary Information Sciences

Tsinghua University

email_no_spam

WeiXu

I am an assistant professor at the Institute for Interdisciplinary Information Sciences of Tsinghua University in Beijing. I am a recipient of the National Youth 1000 Program (青年千人计划) in 2013.

I have a broad research interest in distributed system design and big data. My current projects include data center networking, system management and debugging, large scale system for machine learning and data mining, as well as various big data applications.

I received my Ph.D from UC Berkeley in 2010. I was in the RAD Lab in EECS Department. My advisors are Prof. David Patterson and Prof. Armando Fox. My dissertation is on analyzing free text console logs for problem detection. I worked for Google for 2.5 years as a software engineer before joining Tsinghua University.

I am the director of Open Compute Project (OCP) Certification Lab in China. I am also the director of international partnership for the MOE Research Center for Online Education.

Announcements

Looking for students and postdocs
I am looking for motivated students (undergrad or master - sorry, I already have too many phd students so I won't be taking any more this year.) to work with me on a variety of projects in cloud computing, big data and distributed systems in general. If you are interested, please drop me an email and we can chat.
If you are expecting a phd degree in systems or machine learning, and would like to get involved in some of our exciting new projects, I am looking for postdoc researchers too.

Publications

Semi-supervised Learning for Neural Machine Translation
Yong Cheng, Wei Xu, Zhongjun He, Wei He, Hua Wu, Maosong Sun and Yang Liu
In Proceedings of The Annual Meeting of the Association for Computational Linguistics (ACL’16) 2016 [pdf|slides]

Debugging OpenStack Problems Using a State Graph Approach
Yong Xiang, Hu Li, Sen Wang, Charley Peter Chen and Wei Xu
In Proceedings of ACM SIGOPS Asia-Pacific Workshop on Systems (APSys'16) [BEST PAPER AWARD] Hong Kong, China, 2016 [pdf|slides]

Optimizing Hash-based Distributed Storage Using Client Choices
Peilun Li and Wei Xu
In Proceedings of ACM SIGOPS Asia-Pacific Workshop on Systems (APSys'16) Hong Kong, China, 2016 [pdf|slides]

Optimizing Bulk Transfers with Software-Defined Optical WAN
Xin Jin,Yiran Li, Da Wei, Siming Li, Jie Gao, Lei Xu, Guangzhi Li, Wei Xu, Jennifer Rexford
In Proceedings of Sigcomm 2016 Brazil, August, 2016 [pdf|slides]

A 12-Rack, 180-Server Datacenter Network (DCN) Using Multiwavelength Optical Switching and Full Stack Optimization
Da Wei, Lei Xu, Xin Jin, Yiran Li and Wei Xu
In Optical Fiber Communication Conference (OFC), (Postdeadline Paper PDP) USA, March, 2016 [pdf|slides]

DataLab: A Version Data Management and Analytics System
Yang Zhang, Fangzhou Xu, Erwin Frise, Siqi Wu, Bin Yu and Wei Xu
In Proceedings of ICSE first International Workshop on BIG Data Software Engineering, Austin USA, May, 2016 [pdf|slides]

Improving Spark Performance with Zero-copy Buffer Management and RDMA
Hu Li and Wei Xu
In proceedings of IEEE INFOCOM First International Workshop on Big Data Sciences, Technologies and Applications (BDSTA 2016), San Francisco, USA, Apr, 2016 [pdf|slides]

cOSPREY: A Cloud-Based Distributed Algorithm for Large-Scale Computational Protein Design
Yuchao Pan, Yuxi Dong, Jingtian Zhou, Mark Hallen, Bruce R. Donald, Jianyang Zeng and Wei Xu
In Journal of computational biology (JCB), 2016 [pdf|source code]

Increasing Large-Scale Data Center Capacity by Statistical Power Control
Guosai Wang, Shuhao Wang, Bing Luo, Xin Jin, Yinghang Zhu, Wenjun Yang, Longbo Huang, Weisong Shi, Dianming Hu and Wei Xu
In proceedings of the European Conference on Computer Systems (EuroSys 2016), London, UK, Apr, 2016 [pdf|slides]

Scalable Kernel TCP Design and Implementation for Short-Lived Connections
Xiaofeng Lin, Yu Chen, Xiaodong Li, Junjie Mao, Wei Xu, Jiaquan He, and Yuanchun Shi
In proceedings of the 21th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2016), Atlanta, USA, Apr, 2016 [pdf|slides]

MED: The Monitor-Emulator-Debugger for Software-Defined Networks
Quanquan Zhi, Wei Xu
In proceedings of IEEE International Conference on Computer Communications (INFOCOM 2016), San Francisco, USA, Apr, 2016 [pdf|slides]

Inter-Data-Center Network Traffic Prediction with Elephant Flows
Yi Li, Hong Liu, Wenjun Yang, Dianming Hu and Wei Xu
In proceedings of IEEE/IFIP Network Operations and Management Symposium (NOMS 2016), Istanbul, Turkey, Apr, 2016 [pdf|slides]

An Efficient Parallel Algorithm for Accelerating Computational Protein Design
Yichao Zhou, Wei Xu, Bruce R. Donald, Jianyang Zeng
In proceedings of ISMB 2014, Bioinformatics. Boston, Massachusetts, USA, July 2014 [PubMed Link]

Advances and Challenges in Log Analysis
Adam Oliner, Archana Ganapathi, Wei Xu
In Communications of ACM (CACM) and ACM Queue, (Invited article), Feb, 2012 [ACM Digital Library Link]

Experience on Mining Google's Production Console Logs
Wei Xu, Ling Huang, Armando Fox, David Patterson, and Michael Jordan
In the Workshop on Managing Systems via Log Analysis and Machine Learning Techniques (SLAML '10), Vancouver, BC Oct, 2010 [pdf]

A graphical representation for identifier structure in logs
Ariel Rabkin, Avani Wildani, Randy Katz, Wei Xu, Armando Fox
In the Workshop on Managing Systems via Log Analysis and Machine Learning Techniques (SLAML '10), Vancouver, BC Oct, 2010 [pdf]

Detecting Large Scale System Problems by Mining Console Logs
Wei Xu
PhD dissertation, UC Berkeley, July, 2010 [pdf]

Using Machine Learning Techniques in Console Log Analysis
Wei Xu, Ling Huang, Armando Fox, David Patterson, and Michael Jordan
In Proc. of the 27th International Conference on Machine Learning (ICML’10), (Invited application paper) Haifa, Israel, June 2010 [pdf]

Online system problem detection by mining patterns of console logs
Wei Xu, Ling Huang, Armando Fox, David Patterson, and Michael Jordan
In Proc. of the IEEE International Conference on Data Mining (ICDM’ 09), Miami, FL, December 2009[pdf]

Large-scale system problem detection by mining console logs
Wei Xu, Ling Huang, Armando Fox, David Patterson, and Michael Jordan
In Proc. of the 22nd ACM Symposium on Operating Systems Principles (SOSP’ 09), Big Sky, MT, October 2009 [pdf]

Mining console logs for large-scale system problem detection
Wei Xu, Ling Huang, Armando Fox, David Patterson, and Michael Jordan
In Proc. of the 3rd workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SysML’08), San Diego, CA, December 2008 [pdf]

Regulating workload in J2EE application servers
Wei Xu, Zhangxi Tan, Armando Fox and David Patterson
In Proc. of the 1st International Workshop on Feedback Control Implementation and Design in Computing Systems and Networks (FeBID’06), Vancouver, Canada, April 2006 [pdf]

Predictive control for dynamic resource allocation in enterprise data centers
Wei Xu, Xiaoyun Zhu, Sharad Singhal, and Zhikui Wang
In Proc. of the 10th IEEE/IFIP Network Operations & Management Symposium (NOMS'06), Vancouver, BC, Apr. 2006 [pdf]

Feedback control theory and processing system log streams
Wei Xu
Master thesis, EECS Department, UC Berkeley, December, 2005 [pdf]

Control considerations for scaling event processing
Wei Xu, Joseph L. Hellerstein, Bill Kramer and David Patterson
In Proc. of the 16th IFIP/IEEE Distributed Systems: Operations and Management (DSOM'05), Barcelona, Spain, October 2005 [pdf]

A flexible framework for statistical learning and data mining from system log streams
Wei Xu, Peter Bodik and David Patterson
In Proc. of Workshop on Temporal Data Mining: Algorithms, Theory and Applications at The Fourth IEEE International Conference on Data Mining (ICDM'04), Brighton, UK, Nov, 2004 [pdf]

Peer-to-Peer support for massively multiplayer games
Bjorn Knutsson, Honghui Lu, Wei Xu and Bryan Hopkins
In Proc. of the 23rd Conference of the IEEE Communications Society (INFOCOM’04), Hong Kong, March 2004 [pdf]

CV

Here is my CV (as of Feb, 2016).

Personal

My vintage technology collection.

Places I have been to.

Photos