Publications

Preprints

  • Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization. [PDF, arXiv]
    Sijia Chen, Yu-Jie Zhang, Wei-Wei Tu, Peng Zhao, and Lijun Zhang.


Conference Papers

  • Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition. [PDF, arXiv, bibtex]
    Long-Fei Li, Peng Zhao, and Zhi-Hua Zhou.
    In: Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024), Valencia, Spain, 2024. Page: to appear.

  • Dynamic Regret of Adversarial MDPs with Unknown Transition and Linear Function Approximation. [PDF, bibtex]
    Long-Fei Li, Peng Zhao, and Zhi-Hua Zhou.
    In: Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, 2024. Page: 13572-13580.

  • Universal Online Learning with Gradient Variations: A Multi-layer Online Ensemble Approach. [PDF, arXiv, bibtex] (Spotlight)
    Yu-Hu Yan, Peng Zhao, and Zhi-Hua Zhou.
    In: Advances in Neural Information Processing Systems 36 (NeurIPS 2023), New Orleans, Louisiana, 2023. Page: 37682-37715.

  • Dynamic Regret of Adversarial Linear Mixture MDPs. [PDF, bibtex]
    Long-Fei Li, Peng Zhao, and Zhi-Hua Zhou.
    In: Advances in Neural Information Processing Systems 36 (NeurIPS 2023), New Orleans, Louisiana, 2023. Page: 60685-60711.

  • Adapting to Continuous Covariate Shift via Online Density Ratio Estimation. [PDF, arXiv, bibtex]
    Yu-Jie Zhang, Zhen-Yu Zhang, Peng Zhao, and Masashi Sugiyama.
    In: Advances in Neural Information Processing Systems 36 (NeurIPS 2023), New Orleans, Louisiana, 2023. Page: 29074-29113.

  • Stochastic Approximation Approaches to Group Distributionally Robust Optimization. [PDF, arXiv, bibtex]
    Lijun Zhang, Peng Zhao, Zhen-Hua Zhuang, Tianbao Yang, and Zhi-Hua Zhou.
    In: Advances in Neural Information Processing Systems 36 (NeurIPS 2023), New Orleans, Louisiana, 2023. Page: 52490-52522.

  • Handling New Class in Online Label Shift. [PDF, bibtex]
    Yu-Yang Qian*, Yong Bai*, Zhen-Yu Zhang, Peng Zhao, and Zhi-Hua Zhou. (* indicates equal contribution)
    In: Proceedings of the 23rd IEEE International Conference on Data Mining (ICDM 2023), Shanghai, China, 2023. Page: 1283-1288.

  • Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization. [PDF, long version, arXiv, bibtex]
    Sijia Chen, Wei-Wei Tu, Peng Zhao, and Lijun Zhang.
    In: Proceedings of the 40th International Conference on Machine Learning (ICML 2023), Hawaii, Honolulu, 2023. Page: 5002-5035.

  • Fast Rates in Time-Varying Strongly Monotone Games. [PDF, bibtex]
    Yu-Hu Yan, Peng Zhao, and Zhi-Hua Zhou.
    In: Proceedings of the 40th International Conference on Machine Learning (ICML 2023), Hawaii, Honolulu, 2023. Page: 39138-39164.

  • Revisiting Weighted Strategy for Non-stationary Parametric Bandits. [PDF, arXiv, bibtex]
    Jing Wang, Peng Zhao, and Zhi-Hua Zhou.
    In: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023), Valencia, Spain, 2023. Page: 7913-7942.

  • Beyond Performative Prediction: Open-environment Learning with Presence of Corruptions. [PDF, bibtex]
    Jia-Wei Shan, Peng Zhao, and Zhi-Hua Zhou.
    In: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023), Valencia, Spain, 2023. Page: 7981-7998.

  • Efficient Methods for Non-stationary Online Learning. [PDF, long version, arXiv, bibtex] (Oral)
    Peng Zhao, Yan-Feng Xie, Lijun Zhang, and Zhi-Hua Zhou.
    In: Advances in Neural Information Processing Systems 35 (NeurIPS 2022), New Orleans, Louisiana, 2022. Page: 11573-11585.

  • Adapting to Online Label Shift with Provable Guarantees. [PDF, arXiv, code, bibtex]
    Yong Bai*, Yu-Jie Zhang*, Peng Zhao, Masashi Sugiyama, and Zhi-Hua Zhou. (* indicates equal contribution)
    In: Advances in Neural Information Processing Systems 35 (NeurIPS 2022), New Orleans, Louisiana, 2022. Page: 29960-29974.

  • Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits. [PDF, arXiv, bibtex]
    Haipeng Luo, Mengxiao Zhang, Peng Zhao, and Zhi-Hua Zhou. (alphabetical order)
    In: Proceedings of the 35th Annual Conference on Learning Theory (COLT 2022), London, UK, 2022. Page: 3635-3684.

  • Adaptive Bandit Convex Optimization with Heterogeneous Curvature. [PDF, arXiv, bibtex]
    Haipeng Luo, Mengxiao Zhang, and Peng Zhao. (alphabetical order)
    In: Proceedings of the 35th Annual Conference on Learning Theory (COLT 2022), London, UK, 2022. Page: 1576-1612.

  • No-Regret Learning in Time-Varying Zero-Sum Games. [PDF, arXiv, bibtex]
    Mengxiao Zhang*, Peng Zhao*, Haipeng Luo, and Zhi-Hua Zhou. (* indicates equal contribution)
    In: Proceedings of the 39th International Conference on Machine Learning (ICML 2022), Baltimore, Maryland, 2022. Page: 26772-26808.

  • Dynamic Regret of Online Markov Decision Processes. [PDF, arXiv, full version, bibtex]
    Peng Zhao, Long-Fei Li, and Zhi-Hua Zhou.
    In: Proceedings of the 39th International Conference on Machine Learning (ICML 2022), Baltimore, Maryland, 2022. Page: 26865-26894.

  • Non-stationary Online Learning with Memory and Non-stochastic Control. [PDF, arXiv, bibtex]
    Peng Zhao, Yu-Xiang Wang, and Zhi-Hua Zhou.
    In: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS 2022), online, 2022. Page: 2101-2133.
        ♣ Journal version [PDF] published at JMLR.

  • Optimal Rates of (Locally) Differentially Private Heavy-tailed Multi-Armed Bandits. [PDF, arXiv, bibtex]
    Youming Tao*, Yulian Wu*, Peng Zhao, and Di Wang. (* indicates equal contribution)
    In: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS 2022), online, 2022. Page: 1546-1574.

  • Improved Analysis for Dynamic Regret of Strongly Convex and Smooth Functions. [PDF, arXiv, bibtex]
    Peng Zhao and Lijun Zhang.
    In: Proceedings of the 3rd Conference on Learning for Dynamics and Control (L4DC 2021), online, 2021. Page: 48-59.

  • Exploratory Machine Learning with Unknown Unknowns. [PDF, code, bibtex]
    Peng Zhao, Yu-Jie Zhang, and Zhi-Hua Zhou.
    In: Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021), online, 2021. Page: 10999-11006.
        ♣ Journal version [PDF] published at Artificial Intelligence.

  • Towards Enabling Learnware to Handle Unseen Jobs. [PDF, code, bibtex]
    Yu-Jie Zhang, Yu-Hu Yan, Peng Zhao, and Zhi-Hua Zhou.
    In: Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021), online, 2021. Page: 10964-10972.

  • Storage Fit Learning with Feature Evolvable Streams. [PDF, code, bibtex]
    Bo-Jian Hou, Yu-Hu Yan, Peng Zhao, and Zhi-Hua Zhou.
    In: Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021), online, 2021. Page: 7729-7736.

  • Dynamic Regret of Convex and Smooth Functions. [PDF, arXiv, bibtex]
    Peng Zhao, Yu-Jie Zhang, Lijun Zhang, and Zhi-Hua Zhou.
    In: Advances in Neural Information Processing Systems 33 (NeurIPS 2020), Vancouver, Canada, 2020. Page: 12510-12520.
        ♣ Journal version [PDF] finally got published at JMLR (with an almost three-year review period..).

  • An Unbiased Risk Estimator for Learning with Augmented Classes. [PDF, arXiv, code, bibtex]
    Yu-Jie Zhang, Peng Zhao, Lanjihong Ma, and Zhi-Hua Zhou.
    In: Advances in Neural Information Processing Systems 33 (NeurIPS 2020), Vancouver, Canada, 2020. Page: 10247-10258.

  • Learning with Feature and Distribution Evolvable Streams. [PDF, code, bibtex]
    Zhen-Yu Zhang, Peng Zhao, Yuan Jiang, and Zhi-Hua Zhou.
    In: Proceedings of the 37th International Conference on Machine Learning (ICML 2020), Vienna, Austria, 2020. Page: 11317-11327.

  • A Simple Online Algorithm for Competing with Dynamic Comparators. [PDF, bibtex]
    Yu-Jie Zhang, Peng Zhao, and Zhi-Hua Zhou.
    In: Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI 2020), Toronto, Canada, 2020. Page: 390-399.

  • Bandit Convex Optimization in Non-stationary Environments. [PDF, journal, arXiv, bibtex]
    Peng Zhao, Guanghui Wang, Lijun Zhang, and Zhi-Hua Zhou.
    In: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020), Palermo, Italy, 2020. Page: 1508-1518.
        ♣ Journal version [PDF] published at JMLR.

  • A Simple Approach for Non-stationary Linear Bandits. [PDF, arXiv version2, errata, bibtex]
    Peng Zhao, Lijun Zhang, Yuan Jiang, and Zhi-Hua Zhou.
    In: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020), Palermo, Italy, 2020. Page: 746-755.
        ♣ A correct and self-contained version [PDF] is updated at arXiv version2.

  • Optimal Margin Distribution Learning in Dynamic Environments. [PDF, bibtex]
    Teng Zhang, Peng Zhao, and Hai Jin.
    In: Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI 2020), New York, NY, 2020. Page: 6821-6828.

  • Nearest Neighbor Ensembles: An Effective Method for Difficult Problems in Streaming Classification with Emerging New Classes. [PDF, code, bibtex]
    Xin-Qiang Cai, Peng Zhao, Kai Ming Ting, Xin Mu, and Yuan Jiang.
    In: Proceedings of the 19th International Conference on Data Mining (ICDM 2019), Beijing, China, 2019. Page: 970-975.

  • Learning from Incomplete and Inaccurate Supervision. [PDF, code, bibtex]
    Zhen-Yu Zhang, Peng Zhao, Yuan Jiang, and Zhi-Hua Zhou.
    In: Proceedings of the 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2019), Anchorage, AL, 2019. Page: 1017-1025.
        ♣ Journal version [PDF] published at IEEE TKDE.

  • Improving Deep Forest by Confidence Screening. [PDF, code, bibtex]
    Ming Pang, Kai Ming Ting, Peng Zhao, and Zhi-Hua Zhou.
    In: Proceedings of the 18th IEEE International Conference on Data Mining (ICDM 2018), Singapore, 2018. Page: 1194-1199.
        ♣ Journal version [PDF] published at IEEE TKDE.

  • Label Distribution Learning by Optimal Transport. [PDF, supp, code, bibtex]
    Peng Zhao and Zhi-Hua Zhou.
    In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI 2018), New Orleans, Louisiana, 2018. Page: 4506-4513.

  • Dual Set Multi-Label Learning. [PDF, supp, code, bibtex]
    Chong Liu, Peng Zhao, Sheng-Jun Huang, Yuan Jiang, and Zhi-Hua Zhou.
    In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI 2018), New Orleans, Louisiana, 2018. Page: 3635-3642.

  • Multi-View Matrix Completion for Clustering with Side Information. [PDF, code, bibtex]
    Peng Zhao, Yuan Jiang, and Zhi-Hua Zhou.
    In: Proceedings of the 21st Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2017), LNCS, Jeju, Korea, 2017. Page: 403-415.

Journal Papers

  • Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex Optimization. [PDF, arXiv, bibtex]
    Peng Zhao, Yu-Jie Zhang, Lijun Zhang, and Zhi-Hua Zhou.
    Journal of Machine Learning Research (JMLR), 25(98):1−52, 2024.

  • Learning with Asynchronous Labels. [PDF, code, bibtex]
    Yu-Yang Qian, Zhen-Yu Zhang, Peng Zhao, and Zhi-Hua Zhou.
    ACM Transactions on Knowledge Discovery from Data (TKDD), in press, 2024.

  • Exploratory Machine Learning with Unknown Unknowns. [PDF, code, bibtex]
    Peng Zhao, Jia-Wei Shan, Yu-Jie Zhang, and Zhi-Hua Zhou.
    Artificial Intelligence (AIJ), to appear, 2024.

  • Online Non-stochastic Control with Partial Feedback. [PDF, bibtex]
    Yu-Hu Yan, Peng Zhao, and Zhi-Hua Zhou.
    Journal of Machine Learning Research (JMLR), 24(273):1−50, 2023.

  • Non-stationary Online Learning with Memory and Non-stochastic Control. [PDF, arXiv, bibtex]
    Peng Zhao, Yu-Hu Yan, Yu-Xiang Wang, and Zhi-Hua Zhou.
    Journal of Machine Learning Research (JMLR), 24(206):1−70, 2023.

  • Learning from Incomplete and Inaccurate Supervision. [PDF, official version, code, bibtex]
    Zhen-Yu Zhang, Peng Zhao, Yuan Jiang, and Zhi-Hua Zhou.
    IEEE Transactions on Knowledge and Data Engineering (TKDE), 34(12), 5854-5868, 2022.

  • Improving Deep Forest by Screening. [PDF, official version, bibtex]
    Ming Pang, Kai Ming Ting, Peng Zhao, and Zhi-Hua Zhou.
    IEEE Transactions on Knowledge and Data Engineering (TKDE), 34(9), 4298-4312, 2022.

  • Bandit Convex Optimization in Non-stationary Environments. [PDF, arXiv, bibtex]
    Peng Zhao, Guanghui Wang, Lijun Zhang, and Zhi-Hua Zhou.
    Journal of Machine Learning Research (JMLR), 22(125):1−45, 2021.

  • 基于决策树模型重用的分布变化流数据学习. [PDF]
    赵鹏, 周志华.
    中国科学:信息科学, 2021, 51(1): 1-12.(封面文章)

  • Distribution-Free One-Pass Learning. [PDF, official version, code, bibtex]
    Peng Zhao, Xinqiang Wang, Siyu Xie, Lei Guo, and Zhi-Hua Zhou.
    IEEE Transaction on Data Engineering (TKDE), 33(3): 951-963, 2021.

  • Handling Concept Drift via Model Reuse. [PDF, official version, code, bibtex]
    Peng Zhao, Le-Wen Cai, and Zhi-Hua Zhou.
    Machine Learning (Special Issue of the ACML 2019 Journal Track), 109(3): 533-568, 2020.

Technical Notes

  • Non-stationary Linear Bandits Revisited. [PDF, arXiv]
    Peng Zhao and Lijun Zhang. Technical Note, 2021.



[go back]