Menu
Hui Sun
Nanjing University
LAMDA Group

Biography

I am a Ph.D. student at School of Artificial Intelligence in Nanjing University , and a member of LAMDA Group , led by Prof. Zhi-Hua Zhou .

I received my B.Sc. from Software College at Jilin University in 2019. I then joined Nanjing University as a M.Sc. student (direct admission) and graduated in 2022. From 2022 to 2023, I worked as an algorithm engineer at Shopee . In 2023, I returned to LAMDA to pursue a full-time Ph.D.

Jilin University
B.Sc. in SE
Sep 2015 - Jun 2019
Nanjing University
M.Sc. in CS
Sep 2019 - Jun 2022
Nanjing University
Ph.D. in CS
Sep 2023 - Present
Oct 2018 - Mar 2019
ByteDance
Intern
Search Algorithm
Jun 2021 - Sep 2021
AliExpress
Intern
Recommendation Algo.
Jul 2022 - Aug 2023
Shopee
Full-time
Search Algorithm
Sep 2023 - Present
Alibaba
Intern
LLM Researcher

Research Interests

I am interested in machine learning and data mining, especially the following:

Transfer Learning & Semi-supervised Learning

Multimodal & Code LLMs

Search Engine & Recomendation System

Publications

Preprint

P1
Preprint

Loading...

Hui Sun, Zheng Xie, Hao-Yuan He, and Ming Li

P2
Preprint

Loading...

Xin-Ye Li, Yali Du, Hui Sun, and Ming Li

Journal Papers

J1
TSE 2025 CCF-A SCI

Post-Incorporating Code Structural Knowledge into Pretrained Models via ICL for Code Translation

Yali Du†, Hui Sun† (Equal Contribution), and Ming Li

IEEE Transactions on Software Engineering, 2025

J2
TORS 2024

Learning Personalizable Clustered Embedding for Recommender Systems

Yizhou Chen, Guangda Huzhang, Anxiang Zeng, Qingtao Yu, Hui Sun, Heng-Yi Li, Jingyi Li, Yabo Ni, Han Yu, and Zhiming Zhou

ACM Transactions on Recommender Systems, 2024

J3
SCIS 2023 CCF-A SCI

Enhancing Unsupervised Domain Adaptation by Exploiting the Conceptual Consistency of Multiple Self-supervised Tasks

Hui Sun, and Ming Li

SCIENCE CHINA Information Sciences, 2023, 66: 142101

Conference Papers

C1
AAAI 2026 CCF-A

Dynamic-Static Synergistic Selection Method for Candidate Code Solutions with Generated Test Cases

Ren-Biao Liu, Jiang-Tian Xue, Chao-Zeng Ma, Hui Sun, Xin-Ye Li, and Ming Li

The 40th AAAI Conference on Artificial Intelligence, 2026

C2
AAAI 2026 CCF-A

ARBench: Algorithmic Reasoner or API Alchemist? Evaluating Code-Generating LLMs beyond API Calls

Ren-Biao Liu, Chao-Zeng Ma, An-Qi Li, Hui Sun, Xin-Ye Li, and Ming Li

The 40th AAAI Conference on Artificial Intelligence, 2026

C3
Tech Report

Ovis2.5 Technical Report

Ovis Team (Contributor: Hui Sun), Alibaba Group

Preprint, arXiv: 2508.11737

C4
ICCV 2025 CCF-A

MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs

Hui Sun, Shiyin Lu, Huanyu Wang, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, Ming Li

IEEE/CVF International Conference on Computer Vision, 2025

C5
ICML 2025 CCF-A

Revisiting Chain-of-Thought in Code Generation: Do Language Models Need to Learn Reasoning before Coding?

Ren-Biao Liu, An-Qi Li, Chao-Ding Yang, Hui Sun, and Ming Li

The 42nd International Conference on Machine Learning, 2025

C6
ASE 2024 CCF-A

A Joint Learning Model with Variational Interaction for Multilingual Program Translation

Yali Du, Hui Sun, and Ming Li

IEEE/ACM 46th International Conference on Automated Software Engineering, 2024

C7
ICML 2024 CCF-A

Ambiguity-Aware Abductive Learning

Hao-Yuan He, Hui Sun, Zheng Xie, and Ming Li

The 41st International Conference on Machine Learning, 2024

C8
AAAI 2023 CCF-A

Cooperative and Adversarial Learning: Co-Enhancing Discriminability and Transferability in Domain Adaptation

Hui Sun, Zheng Xie, Xin-Ye Li, and Ming Li

The 37th AAAI Conference on Artificial Intelligence, 2023

C9
AAAI 2023 CCF-A

Semi-Supervised Learning with Support Isolation by Small-Paced Self-Training

Zheng Xie, Hui Sun, and Ming Li

The 37th AAAI Conference on Artificial Intelligence, 2023

C10
WWW 2023 CCF-A

Clustered Embedding Learning for Recommender Systems

Yizhou Chen, Guangda Huzhang, Anxiang Zeng, Qingtao Yu, Hui Sun, Heng-Yi Li, Jingyi Li, Yabo Ni, Han Yu, and Zhiming Zhou

The World Wide Web Conference, 2023

Patents

PT1
已授权

一种面向环境变化的无监督迁移学习图像分类算法

Ming Li (黎铭), Hui Sun (孙辉), Zhi-Hua Zhou (周志华)

Patent No. 202210461879.4

Awards & Honors

Value Star Awards

Top 4% (8 out of 200+)

Shopee · Dec 2022

Artificial Intelligence Scholarship

50 recipients university-wide per year

Nanjing University · Oct 2019

China Collegiate Computing Contest (CCCC)

Outstanding Winner (Highest Honor)

Jilin Province · Mar 2018

ACM-ICPC Asia Regional Contest

Silver Medal

Nanning · Dec 2017

ACM-ICPC Asia EC-Final

Bronze Medal

Shanghai · Dec 2017

Jilin Province Collegiate Programming Contest

Gold Medal

Changchun · May 2017

Northeast Collegiate Programming Contest

Gold Medal

Changchun · Sep 2016

Work Experience

Alibaba AIDC - Ovis Team

LLM Researcher | Multimodal LLM Research

Sep 2023 - Present

As a key developer, I contributed to Ovis V1.6, V2.0, and V2.5. Primarily responsible for developing video understanding capabilities from scratch (including data preparation, algorithm design, and model training), contributing to the SFT for the Ovis GUI Agent with a focus on optimizing GUI & general grounding, and collecting trajectory data for the agent's web-based operations.

#1

Ovis 2.5

<40B params

#2

Ovis 2.0

OpenCompass

ICCV25

MDP3

First-author

Tech Report

Ovis2.5

Contributor

Shopee (Sea Limited)

Algorithm Engineer (Full-time) | Ranking for E-commerce Search

Jul 2022 - Aug 2023

Primarily responsible for maintaining and optimizing the foundational E-commerce search ranking model, initially used in fine-ranking and extended to recall, coarse-ranking, and long-tail scenarios. Key work includes:

Algorithm Improvements:
+0.6%

CEL - CTR AUC

Embedding Learning

+0.5%

PLE - CTR AUC

Multi-task

+0.2%

AutoDis - CTR

Numeric feature

+0.3%

AutoDis - CVR

Numeric feature

Model Infastruceture Maintenance: deconstructing neural networks into configurable blocks for faster convergence and sharing user-side computations to improve speed.
2+m →1week

Data to Convergence

Module-wise Model Resue

+105%

Training Speed

37→76 samples/(cpu·s)

+232%

Inference Speed

95.5ms→28.8ms

10x

CEL Speed

Real-world impl.

AliExpress (Alibaba Group)

Algorithm Engineer (Intern) | Ranking for E-commerce Recommendation

Jun 2021 - Sep 2021

Designed a country-specific multitask fine-ranking model for multinational e-commerce platform. During the internship, reproduced 14 papers and finally adopted PLE, ESMM, and Gradient Block to optimize the multitask framework. Designed country-level PLE in the top network to capture specific information of the top 5 key countries, and used DCN-v2 in the bottom network to achieve higher-order feature crossover for country-differentiated features.

14

Reproduced Papers

+0.50%

CTR AUC

+0.78%

CTR GAUC

+0.53%

L2P AUC

+1.17%

L2P GAUC

ByteDance

Search Algorithm Engineer (Intern) | Toutiao Series Apps

Oct 2018 - Mar 2019

Mainly responsible for the official user search in ByteDance's "Toutiao" series apps (Toutiao, Xigua Video, DongCheDi, etc.), including the vertical search results under the user search tab, and the user card result under the all web search and video search tab.

+30%

CTR of query

15%→45%

+60%

Card Recall Ratio

User cards

+5%

Top 3 CTR

User cards

Education

Nanjing University (南京大学)

M.Sc. in Computer Science and Technology, Schoolar of Artificial Intelligence

Recommended admission (without entrance exam); Ranked 1st in coding test during interview.

2019 - 2022

Jilin University (吉林大学)

B.Sc. in Software Engineering (Excellent Engineer Program)

GPA: 3.7/4.0 (Top 5%) · Exchange: ITMO University, Russia (2017)

2015 - 2019

Teaching Assistant

Artificial Intelligence Guidance (With Prof. Ming Li; Fall 2025)

Data Mining for Complex Data Objects (With Prof. Ming Li; Fall 2021)

Introduction to Data Mining (With Prof. Ming Li; Spring 2020)

© 2025 Hui Sun (孙辉). Last updated: November 2025