Research

Preprints

Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization. [PDF, arXiv]
Sijia Chen, Yu-Jie Zhang, Wei-Wei Tu, Peng Zhao, and Lijun Zhang.

Conference Papers

Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition. [PDF, arXiv, bibtex]
Long-Fei Li, Peng Zhao, and Zhi-Hua Zhou.
In: Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024), Valencia, Spain, 2024. Page: to appear.
Dynamic Regret of Adversarial MDPs with Unknown Transition and Linear Function Approximation. [PDF, bibtex]
Long-Fei Li, Peng Zhao, and Zhi-Hua Zhou.
In: Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, 2024. Page: 13572-13580.
Universal Online Learning with Gradient Variations: A Multi-layer Online Ensemble Approach. [PDF, arXiv, bibtex] (Spotlight)
Yu-Hu Yan, Peng Zhao, and Zhi-Hua Zhou.
In: Advances in Neural Information Processing Systems 36 (NeurIPS 2023), New Orleans, Louisiana, 2023. Page: 37682-37715.
Dynamic Regret of Adversarial Linear Mixture MDPs. [PDF, bibtex]
Long-Fei Li, Peng Zhao, and Zhi-Hua Zhou.
In: Advances in Neural Information Processing Systems 36 (NeurIPS 2023), New Orleans, Louisiana, 2023. Page: 60685-60711.
Adapting to Continuous Covariate Shift via Online Density Ratio Estimation. [PDF, arXiv, bibtex]
Yu-Jie Zhang, Zhen-Yu Zhang, Peng Zhao, and Masashi Sugiyama.
In: Advances in Neural Information Processing Systems 36 (NeurIPS 2023), New Orleans, Louisiana, 2023. Page: 29074-29113.
Stochastic Approximation Approaches to Group Distributionally Robust Optimization. [PDF, arXiv, bibtex]
Lijun Zhang, Peng Zhao, Zhen-Hua Zhuang, Tianbao Yang, and Zhi-Hua Zhou.
In: Advances in Neural Information Processing Systems 36 (NeurIPS 2023), New Orleans, Louisiana, 2023. Page: 52490-52522.
Handling New Class in Online Label Shift. [PDF, bibtex]
Yu-Yang Qian*, Yong Bai*, Zhen-Yu Zhang, Peng Zhao, and Zhi-Hua Zhou. (* indicates equal contribution)
In: Proceedings of the 23rd IEEE International Conference on Data Mining (ICDM 2023), Shanghai, China, 2023. Page: 1283-1288.
Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization. [PDF, long version, arXiv, bibtex]
Sijia Chen, Wei-Wei Tu, Peng Zhao, and Lijun Zhang.
In: Proceedings of the 40th International Conference on Machine Learning (ICML 2023), Hawaii, Honolulu, 2023. Page: 5002-5035.
Fast Rates in Time-Varying Strongly Monotone Games. [PDF, bibtex]
Yu-Hu Yan, Peng Zhao, and Zhi-Hua Zhou.
In: Proceedings of the 40th International Conference on Machine Learning (ICML 2023), Hawaii, Honolulu, 2023. Page: 39138-39164.
Revisiting Weighted Strategy for Non-stationary Parametric Bandits. [PDF, arXiv, bibtex]
Jing Wang, Peng Zhao, and Zhi-Hua Zhou.
In: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023), Valencia, Spain, 2023. Page: 7913-7942.
Beyond Performative Prediction: Open-environment Learning with Presence of Corruptions. [PDF, bibtex]
Jia-Wei Shan, Peng Zhao, and Zhi-Hua Zhou.
In: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023), Valencia, Spain, 2023. Page: 7981-7998.
Efficient Methods for Non-stationary Online Learning. [PDF, long version, arXiv, bibtex] (Oral)
Peng Zhao, Yan-Feng Xie, Lijun Zhang, and Zhi-Hua Zhou.
In: Advances in Neural Information Processing Systems 35 (NeurIPS 2022), New Orleans, Louisiana, 2022. Page: 11573-11585.
Adapting to Online Label Shift with Provable Guarantees. [PDF, arXiv, code, bibtex]
Yong Bai*, Yu-Jie Zhang*, Peng Zhao, Masashi Sugiyama, and Zhi-Hua Zhou. (* indicates equal contribution)
In: Advances in Neural Information Processing Systems 35 (NeurIPS 2022), New Orleans, Louisiana, 2022. Page: 29960-29974.
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits. [PDF, arXiv, bibtex]
Haipeng Luo, Mengxiao Zhang, Peng Zhao, and Zhi-Hua Zhou. (alphabetical order)
In: Proceedings of the 35th Annual Conference on Learning Theory (COLT 2022), London, UK, 2022. Page: 3635-3684.
Adaptive Bandit Convex Optimization with Heterogeneous Curvature. [PDF, arXiv, bibtex]
Haipeng Luo, Mengxiao Zhang, and Peng Zhao. (alphabetical order)
In: Proceedings of the 35th Annual Conference on Learning Theory (COLT 2022), London, UK, 2022. Page: 1576-1612.
No-Regret Learning in Time-Varying Zero-Sum Games. [PDF, arXiv, bibtex]
Mengxiao Zhang*, Peng Zhao*, Haipeng Luo, and Zhi-Hua Zhou. (* indicates equal contribution)
In: Proceedings of the 39th International Conference on Machine Learning (ICML 2022), Baltimore, Maryland, 2022. Page: 26772-26808.
Dynamic Regret of Online Markov Decision Processes. [PDF, arXiv, full version, bibtex]
Peng Zhao, Long-Fei Li, and Zhi-Hua Zhou.
In: Proceedings of the 39th International Conference on Machine Learning (ICML 2022), Baltimore, Maryland, 2022. Page: 26865-26894.
Non-stationary Online Learning with Memory and Non-stochastic Control. [PDF, arXiv, bibtex]
Peng Zhao, Yu-Xiang Wang, and Zhi-Hua Zhou.
In: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS 2022), online, 2022. Page: 2101-2133.
♣ Journal version [PDF] published at JMLR.
Optimal Rates of (Locally) Differentially Private Heavy-tailed Multi-Armed Bandits. [PDF, arXiv, bibtex]
Youming Tao*, Yulian Wu*, Peng Zhao, and Di Wang. (* indicates equal contribution)
In: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS 2022), online, 2022. Page: 1546-1574.
Improved Analysis for Dynamic Regret of Strongly Convex and Smooth Functions. [PDF, arXiv, bibtex]
Peng Zhao and Lijun Zhang.
In: Proceedings of the 3rd Conference on Learning for Dynamics and Control (L4DC 2021), online, 2021. Page: 48-59.
Exploratory Machine Learning with Unknown Unknowns. [PDF, code, bibtex]
Peng Zhao, Yu-Jie Zhang, and Zhi-Hua Zhou.
In: Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021), online, 2021. Page: 10999-11006.
♣ Journal version [PDF] published at Artificial Intelligence.
Towards Enabling Learnware to Handle Unseen Jobs. [PDF, code, bibtex]
Yu-Jie Zhang, Yu-Hu Yan, Peng Zhao, and Zhi-Hua Zhou.
In: Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021), online, 2021. Page: 10964-10972.
Storage Fit Learning with Feature Evolvable Streams. [PDF, code, bibtex]
Bo-Jian Hou, Yu-Hu Yan, Peng Zhao, and Zhi-Hua Zhou.
In: Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021), online, 2021. Page: 7729-7736.
Dynamic Regret of Convex and Smooth Functions. [PDF, arXiv, bibtex]
Peng Zhao, Yu-Jie Zhang, Lijun Zhang, and Zhi-Hua Zhou.
In: Advances in Neural Information Processing Systems 33 (NeurIPS 2020), Vancouver, Canada, 2020. Page: 12510-12520.
♣ Journal version [PDF] finally got published at JMLR (with an almost three-year review period..).
An Unbiased Risk Estimator for Learning with Augmented Classes. [PDF, arXiv, code, bibtex]
Yu-Jie Zhang, Peng Zhao, Lanjihong Ma, and Zhi-Hua Zhou.
In: Advances in Neural Information Processing Systems 33 (NeurIPS 2020), Vancouver, Canada, 2020. Page: 10247-10258.
Learning with Feature and Distribution Evolvable Streams. [PDF, code, bibtex]
Zhen-Yu Zhang, Peng Zhao, Yuan Jiang, and Zhi-Hua Zhou.
In: Proceedings of the 37th International Conference on Machine Learning (ICML 2020), Vienna, Austria, 2020. Page: 11317-11327.
A Simple Online Algorithm for Competing with Dynamic Comparators. [PDF, bibtex]
Yu-Jie Zhang, Peng Zhao, and Zhi-Hua Zhou.
In: Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI 2020), Toronto, Canada, 2020. Page: 390-399.
Bandit Convex Optimization in Non-stationary Environments. [PDF, journal, arXiv, bibtex]
Peng Zhao, Guanghui Wang, Lijun Zhang, and Zhi-Hua Zhou.
In: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020), Palermo, Italy, 2020. Page: 1508-1518.
♣ Journal version [PDF] published at JMLR.
A Simple Approach for Non-stationary Linear Bandits. [PDF, arXiv version2, errata, bibtex]
Peng Zhao, Lijun Zhang, Yuan Jiang, and Zhi-Hua Zhou.
In: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020), Palermo, Italy, 2020. Page: 746-755.
♣ A correct and self-contained version [PDF] is updated at arXiv version2.
Optimal Margin Distribution Learning in Dynamic Environments. [PDF, bibtex]
Teng Zhang, Peng Zhao, and Hai Jin.
In: Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI 2020), New York, NY, 2020. Page: 6821-6828.
Nearest Neighbor Ensembles: An Effective Method for Difficult Problems in Streaming Classification with Emerging New Classes. [PDF, code, bibtex]
Xin-Qiang Cai, Peng Zhao, Kai Ming Ting, Xin Mu, and Yuan Jiang.
In: Proceedings of the 19th International Conference on Data Mining (ICDM 2019), Beijing, China, 2019. Page: 970-975.
Learning from Incomplete and Inaccurate Supervision. [PDF, code, bibtex]
Zhen-Yu Zhang, Peng Zhao, Yuan Jiang, and Zhi-Hua Zhou.
In: Proceedings of the 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2019), Anchorage, AL, 2019. Page: 1017-1025.
♣ Journal version [PDF] published at IEEE TKDE.
Improving Deep Forest by Confidence Screening. [PDF, code, bibtex]
Ming Pang, Kai Ming Ting, Peng Zhao, and Zhi-Hua Zhou.
In: Proceedings of the 18th IEEE International Conference on Data Mining (ICDM 2018), Singapore, 2018. Page: 1194-1199.
♣ Journal version [PDF] published at IEEE TKDE.
Label Distribution Learning by Optimal Transport. [PDF, supp, code, bibtex]
Peng Zhao and Zhi-Hua Zhou.
In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI 2018), New Orleans, Louisiana, 2018. Page: 4506-4513.
Dual Set Multi-Label Learning. [PDF, supp, code, bibtex]
Chong Liu, Peng Zhao, Sheng-Jun Huang, Yuan Jiang, and Zhi-Hua Zhou.
In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI 2018), New Orleans, Louisiana, 2018. Page: 3635-3642.
Multi-View Matrix Completion for Clustering with Side Information. [PDF, code, bibtex]
Peng Zhao, Yuan Jiang, and Zhi-Hua Zhou.
In: Proceedings of the 21st Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2017), LNCS, Jeju, Korea, 2017. Page: 403-415.

Journal Papers

Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex Optimization. [PDF, arXiv, bibtex]
Peng Zhao, Yu-Jie Zhang, Lijun Zhang, and Zhi-Hua Zhou.
Journal of Machine Learning Research (JMLR), 25(98):1−52, 2024.
Learning with Asynchronous Labels. [PDF, code, bibtex]
Yu-Yang Qian, Zhen-Yu Zhang, Peng Zhao, and Zhi-Hua Zhou.
ACM Transactions on Knowledge Discovery from Data (TKDD), in press, 2024.
Exploratory Machine Learning with Unknown Unknowns. [PDF, code, bibtex]
Peng Zhao, Jia-Wei Shan, Yu-Jie Zhang, and Zhi-Hua Zhou.
Artificial Intelligence (AIJ), to appear, 2024.
Online Non-stochastic Control with Partial Feedback. [PDF, bibtex]
Yu-Hu Yan, Peng Zhao, and Zhi-Hua Zhou.
Journal of Machine Learning Research (JMLR), 24(273):1−50, 2023.
Non-stationary Online Learning with Memory and Non-stochastic Control. [PDF, arXiv, bibtex]
Peng Zhao, Yu-Hu Yan, Yu-Xiang Wang, and Zhi-Hua Zhou.
Journal of Machine Learning Research (JMLR), 24(206):1−70, 2023.
Learning from Incomplete and Inaccurate Supervision. [PDF, official version, code, bibtex]
Zhen-Yu Zhang, Peng Zhao, Yuan Jiang, and Zhi-Hua Zhou.
IEEE Transactions on Knowledge and Data Engineering (TKDE), 34(12), 5854-5868, 2022.
Improving Deep Forest by Screening. [PDF, official version, bibtex]
Ming Pang, Kai Ming Ting, Peng Zhao, and Zhi-Hua Zhou.
IEEE Transactions on Knowledge and Data Engineering (TKDE), 34(9), 4298-4312, 2022.
Bandit Convex Optimization in Non-stationary Environments. [PDF, arXiv, bibtex]
Peng Zhao, Guanghui Wang, Lijun Zhang, and Zhi-Hua Zhou.
Journal of Machine Learning Research (JMLR), 22(125):1−45, 2021.
基于决策树模型重用的分布变化流数据学习. [PDF]
赵鹏, 周志华.
中国科学：信息科学, 2021, 51(1): 1-12.（封面文章）
Distribution-Free One-Pass Learning. [PDF, official version, code, bibtex]
Peng Zhao, Xinqiang Wang, Siyu Xie, Lei Guo, and Zhi-Hua Zhou.
IEEE Transaction on Data Engineering (TKDE), 33(3): 951-963, 2021.
Handling Concept Drift via Model Reuse. [PDF, official version, code, bibtex]
Peng Zhao, Le-Wen Cai, and Zhi-Hua Zhou.
Machine Learning (Special Issue of the ACML 2019 Journal Track), 109(3): 533-568, 2020.

Technical Notes

Non-stationary Linear Bandits Revisited. [PDF, arXiv]
Peng Zhao and Lijun Zhang. Technical Note, 2021.

Publications

Preprints

Conference Papers

Journal Papers

Technical Notes