清华大学交叉信息研究院

1756. Zhecheng Yuan, Zhengrong Xue, Bo Yuan, Xueqian Wang, Yi Wu, Yang Gao, Huazhe Xu. Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning. Neural Information Processing Systems (NeurIPS), 2022.
1755. Renhao Wang, Hang Zhao, Yang Gao. CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation. European Conference on Computer Vision (ECCV), 2022.
1754. Jinkun Cao, Ruiqian Nai, Qing Yang, Jialei Huang, Yang Gao. An Empirical Study on Disentanglement of Negative-free Contrastive Learning. Neural Information Processing Systems (NeurIPS), 2022.
1753. Zhao-Heng Yin, Weirui Ye, Qifeng Chen, Yang Gao. Planning for Sample Efficient Imitation Learning. Neural Information Processing Systems (NeurIPS), 2022.
1752. T. Hao, J. Zhou, Y. Cheng, L. Huang, H. Wu. A Unified Framework for User Identification Across Online and Offline Data. IEEE Transactions on Knowledge and Data Engineering (TKDE) 2022.
1751. J. Huang, Y. Dai, L. Huang. Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits. Proceedings of the 39th International Conference on Machine Learning (ICML), July 2022.
1750. P. Hu, Y. Chen, L. Huang. Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation. Proceedings of the 39th International Conference on Machine Learning (ICML), July 2022.
1749. Y. Huang, J. Lin, C. Zhou, H. Yang, L. Huang. Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably). Proceedings of the 39th International Conference on Machine Learning (ICML), July 2022.
1748. H. Zhou, K. Lv, L. Huang, X. Ma. Quantum Network: Security Assessment and Key Management. IEEE/ACM Transactions on Networking (TON) , Volume: 30, Issue: 3, June 2022.
1747. P. Hu, L. Pan, Y.Chen, Z. Fang, L.Huang. Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning. Proceedings of the 23rd ACM International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing (MobiHoc), October 2022.
1746. L. Pan, L. Huang, T. Ma, H. Xu. Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification. Proceedings of the 39th International Conference on Machine Learning (ICML), July 2022.
1745. Y. Huang, Y. Cheng, Y. Liang, L. Huang. Online Min-max Optimization: Nonconvexity. The 14th International OPT Workshop on Optimization for Machine Learning (NeurIPS-OPT), December 2022.
1744. Y. Cai, C. Zhang, W. Shen, X. He, X. Zhang and L. Huang. Imitation Learning to Outperform Demonstrators by Directly Extrapolating Demonstrations. Proceedings of the 31st ACM International Conference on Information and Knowledge Management, (CIKM), October 2022.
1743. Y. Huang, Y. Liang, and L. Huang. Provable Generalization of Overparameterized Meta-learning Trained with SGD. Proceedings of the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), December 2022.
1742. X. Gu, K. Lyu, L. Huang, S. Arora. Why (and When) does Local SGD Generalize Better than SGD?. The 14th International OPT Workshop on Optimization for Machine Learning (NeurIPS-OPT), December 2022.
1741. Shichuan Deng, Jian Li, Yuval Rabani. Approximation algorithms for clustering with dynamic points. Journal of Computer and System Sciences.
1740. Jian Li, Daogao Liu. Multi-token Markov Game with Switching Costs. SODA 2022.
1739. Y. Cai, C. Zhang, W. Shen, X. Zhang, W. Ruan, L. Huang. RePreM: Representation Pre-training with Masked Model for Reinforcement Learning. Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), February 2023.

>
末页

论文发表