论文发表

34.

Tianpei Yang, Jianye Hao, Zhaopeng Meng, Chongjie Zhang, Yan Zheng, Ze Zheng. Efficiently Detecting and Optimally Responding Towards Sophisticated Opponents. , IJCAI 2019.  

33.

Tianpei Yang, Jianye Hao, Zhaopeng Meng, Chongjie Zhang, Yan Zheng, Ze Zheng. Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents. , AAMAS 2019..  

32.

Guangxiang Zhu, Zichuan Lin, Chongjie Zhang. Episodic Reinforcement Learning with Associated Memory. .  

31.

Tonghan Wang , Jianhao Wang , Chongyi Zheng, Chongjie Zhang. Learning Nearly Decomposable Value Functions Via Communication Minimization. .  

30.

Tonghan Wang, Jianhao Wang, Yi Wu , Chongjie Zhang. Influence-Based Multi-Agent Exploration. .  

29.

Guangxiang Zhu, Jianhao Wang, Zhizhou Ren, and Chongjie Zhang. Object-Oriented Dynamics Learning through Multi-Level Abstraction. , AAAI 2020.  

28.

Siyuan Li, Rui Wang, Minxue Tang and Chongjie Zhang. Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards. , NeurIPS, 2019.  

27.

Siyuan Li, Fangda Gu, Guangxiang Zhu, and Chongjie Zhang. Context-Aware Policy Reuse. , AAMAS, 2019.  

26.

Soo-Jin Moon, Jeffrey Helt, Yifei Yuan, Yves Bieri, Sujata Banerjee,Vyas Sekar, Wenfei Wu, Mihalis Yannakakis, Ying Zhang. Alembic: Automated Model Inference for Stateful Network Functions. , NSDI 2019.  

25.

Weiran Shen, Binghui Peng, Hanpeng Liu, Michael Zhang, Pingzhong Tang. Reinforcement Mechanism Design: With Applications to Dynamic Pricing in Sponsored Search Auctions. , AAAI 2020.  

24.

Qingpeng Cai, Ling Pan, Pingzhong Tang,. Deterministic Value-Policy Gradients. , AAAI 2020.  

23.

Feiyang Pan, Qingpeng Cai, Pingzhong Tang, Fuzhen Zhuang, Qing He. Policy Gradients for Contextual Recommendations. ,  WWW 2019.  

22.

Feiyang Pan, Shuokai Li, Xiang Ao, Pingzhong Tang, Qing He. Warm Up Cold-start Advertisements: Improving CTR Predictions via Learning to Learn ID Embeddings. ,  SIGIR 2019.  

21.

Weiran Shen, Pingzhong Tang, Yulong Zeng. Buyer Signaling Games in Auctions. , AAMAS 2019.  

20.

Feiyang Pan, Qingpeng Cai, Anxiang Zeng, Chun-Xiang Pan, Qing Da, Hua-Lin He, Qing He, Pingzhong Tang. Policy Optimization with Model-Based Explorations. , AAAI 2019.  

19.

Shani Alkoby, Zihe Wang, David Sarne, Pingzhong Tang. Making Money from What You Know - How to Sell Information?. , AAAI 2019.  

18.

Binghui Peng, Weiran Shen, Pingzhong Tang, Song Zuo. Learning Optimal Strategies to Commit To. , AAAI 2019.  

17.

Vahab S. Mirrokni, Renato Paes Leme, Pingzhong Tang, Song Zuo. Optimal Dynamic Auctions Are Virtual Welfare Maximizers. , AAAI 2019.