- 1756. Zhecheng Yuan, Zhengrong Xue, Bo Yuan, Xueqian Wang, Yi Wu, Yang Gao, Huazhe Xu. Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning. Neural Information Processing Systems (NeurIPS), 2022.
- 1755. Renhao Wang, Hang Zhao, Yang Gao. CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation. European Conference on Computer Vision (ECCV), 2022.
- 1754. Jinkun Cao, Ruiqian Nai, Qing Yang, Jialei Huang, Yang Gao. An Empirical Study on Disentanglement of Negative-free Contrastive Learning. Neural Information Processing Systems (NeurIPS), 2022.
- 1753. Zhao-Heng Yin, Weirui Ye, Qifeng Chen, Yang Gao. Planning for Sample Efficient Imitation Learning. Neural Information Processing Systems (NeurIPS), 2022.
- 1752. T. Hao, J. Zhou, Y. Cheng, L. Huang, H. Wu. A Unified Framework for User Identification Across Online and Offline Data. IEEE Transactions on Knowledge and Data Engineering (TKDE) 2022.
- 1751. J. Huang, Y. Dai, L. Huang. Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits. Proceedings of the 39th International Conference on Machine Learning (ICML), July 2022.
- 1750. P. Hu, Y. Chen, L. Huang. Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation. Proceedings of the 39th International Conference on Machine Learning (ICML), July 2022.
- 1749. Y. Huang, J. Lin, C. Zhou, H. Yang, L. Huang. Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably). Proceedings of the 39th International Conference on Machine Learning (ICML), July 2022.
- 1748. H. Zhou, K. Lv, L. Huang, X. Ma. Quantum Network: Security Assessment and Key Management. IEEE/ACM Transactions on Networking (TON) , Volume: 30, Issue: 3, June 2022.
- 1747. P. Hu, L. Pan, Y.Chen, Z. Fang, L.Huang. Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning. Proceedings of the 23rd ACM International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing (MobiHoc), October 2022.
- 1746. L. Pan, L. Huang, T. Ma, H. Xu. Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification. Proceedings of the 39th International Conference on Machine Learning (ICML), July 2022.
- 1745. Y. Huang, Y. Cheng, Y. Liang, L. Huang. Online Min-max Optimization: Nonconvexity. The 14th International OPT Workshop on Optimization for Machine Learning (NeurIPS-OPT), December 2022.
- 1744. Y. Cai, C. Zhang, W. Shen, X. He, X. Zhang and L. Huang. Imitation Learning to Outperform Demonstrators by Directly Extrapolating Demonstrations. Proceedings of the 31st ACM International Conference on Information and Knowledge Management, (CIKM), October 2022.
- 1743. Y. Huang, Y. Liang, and L. Huang. Provable Generalization of Overparameterized Meta-learning Trained with SGD. Proceedings of the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), December 2022.
- 1742. X. Gu, K. Lyu, L. Huang, S. Arora. Why (and When) does Local SGD Generalize Better than SGD?. The 14th International OPT Workshop on Optimization for Machine Learning (NeurIPS-OPT), December 2022.
- 1741. Shichuan Deng, Jian Li, Yuval Rabani. Approximation algorithms for clustering with dynamic points. Journal of Computer and System Sciences.
- 1740. Jian Li, Daogao Liu. Multi-token Markov Game with Switching Costs. SODA 2022.
- 1739. Y. Cai, C. Zhang, W. Shen, X. Zhang, W. Ruan, L. Huang. RePreM: Representation Pre-training with Masked Model for Reinforcement Learning. Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), February 2023.