Publication

*: indicating equal contribution or alphabetic ordering.

Google Scholar

Sort by:
Publication Image

CoRT: Code-integrated Reasoning within Thinking

Chengpeng Li*, Zhengyang Tang*, Ziniu Li*, Mingfeng Xue, Keqin Bao, Tian Ding, Ruoyu Sun, Benyou Wang, Xiang Wang, Junyang Lin, Dayiheng Liu

arXiv:2506.09820

Publication Image

Quality-Diversity Red-Teaming: Automated Generation of High-Quality and Diverse Attackers for Large Language Models

Ren-Jian Wang, Ke Xue, Zeyu Qin, Ziniu Li, Sheng Tang, Hao-Tian Li, Shengcai Liu, Chao Qian

arXiv:2506.07121

Publication Image

Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO

Peter Chen, Xiaopeng Li, Ziniu Li, Xi Chen, Tianyi Lin

arXiv:2505.11595

Publication Image

Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment

Xueyao Zhang, Yuancheng Wang, Chaoren Wang, Ziniu Li, Zhuo Chen, Zhizheng Wu

The 63rd Annual Meeting of the Association for Computational Linguistics (ACL), 2025

Publication Image

Controlling Large Language Model with Latent Actions

Chengxing Jia, Ziniu Li, Pengyuan Wang, Yi-Chen Li, Zhenyu Hou, Yuxiao Dong, Yang Yu

The 42nd International Conference on Machine Learning (ICML), 2025

Publication Image

Adam-mini: Use Fewer Learning Rates To Gain More

Yushun Zhang, Congliang Chen, Ziniu Li, Tian Ding, Chenwei Wu, Diederik P. Kingma, Yinyu Ye, Zhi-Quan Luo, Ruoyu Sun

The 13th International Conference on Learning Representations (ICLR), 2025

Publication Image

Preserving Diversity in Supervised Fine-tuning of Large Language Models

Ziniu Li, Congliang Chen, Tian Xu, Zeyu Qin, Jiancong Xiao, Zhi-Quan Luo, Ruoyu Sun

The 13th International Conference on Learning Representations (ICLR), 2025
Best Paper Runner-up at NeurIPS 2024 Workshop on Fine-Tuning in Modern Machine Learning
(Previously titled as "Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity")

Publication Image

Understanding and Mitigating Hallucination in Large Vision-Language Models via Modular Attribution and Intervention

Tianyun Yang, Ziniu Li, Juan Cao, Chang Xu

The 13th International Conference on Learning Representations (ICLR), 2025

Publication Image

Enabling Scalable Oversight via Self-Evolving Critic

Zhengyang Tang*, Ziniu Li*, Zhenyang Xiao*, Tian Ding, Ruoyu Sun, Benyou Wang, Dayiheng Liu, Fei Huang, Tianyu Liu, Bowen Yu, Junyang Lin

arXiv:2501.05727

Publication Image

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Zhengyang Tang*, Ziniu Li*, Zhenyang Xiao*, Tian Ding, Ruoyu Sun, Benyou Wang, Dayiheng Liu, Fei Huang, Tianyu Liu, Bowen Yu, Junyang Lin

arXiv:2501.14492

Publication Image

Pruning for Robust Concept Erasing in Diffusion Models

Tianyun Yang, Ziniu Li, Juan Cao, Chang Xu

NeurIPS Workshop on Safe Generative AI, 2024

Publication Image

Unlocking Black-Box Prompt Tuning Efficiency via Zeroth-Order Optimization

Heshen Zhan, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun

The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP) (Findings), 2024

Publication Image

Sensing Jamming Strategy from Limited Observations: An Imitation Learning Perspective

Youlin Fan, Bo Jiu, Wenqiang Pu, Ziniu Li, Kang Li, Hongwei Liu

IEEE Transactions on Signal Processing (TSP)

Publication Image

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Ziniu Li, Tian Xu, Yushun Zhang, Zhihang Lin, Yang Yu, Ruoyu Sun, Zhi-Quan Luo

The 41st International Conference on Machine Learning (ICML), 2024

Publication Image

On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization

Jiancong Xiao, Ziniu Li, Xingyu Xie, Emily Getzen, Cong Fang, Qi Long, Weijie J. Su

arXiv:2405.16455

Publication Image

Why Transformers Need Adam: A Hessian Perspective

Yushun Zhang, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun, Zhi-Quan Luo

Conference on Neural Information Processing System (NeurIPS) 38, 2024

Publication Image

When is RL better than DPO in RLHF? A Representation and Optimization Perspective

Ziniu Li*, Tian Xu*, Yang Yu

The 12th International Conference on Learning Representations (ICLR) (Tiny Paper Track), 2024
(Oral presentation, with an early version at arXiv:2312.10584)

Publication Image

Imitation Learning from Imperfection: Theoretical Justifications and Algorithms

Ziniu Li*, Tian Xu*, Zeyu Qin, Yang Yu, Zhi-Quan Luo

Conference on Neural Information Processing System (NeurIPS) 37, 2023
(Spotlight presentation)

Publication Image

Provably Efficient Adversarial Imitation Learning with Unknown Transitions

Tian Xu*, Ziniu Li*, Yang Yu, Zhi-Quan Luo

The 39th Conference on Uncertainty in Artificial Intelligence (UAI), 2023
(Oral presentation, with an early version at arXiv:2106.10424v2)

Publication Image

Deploying Offline Reinforcement Learning with Human Feedback

Ziniu Li, Ke Xu, Liu Liu, Lanqing Li, Deheng Ye, Peilin Zhao

arXiv:2303.07046

Publication Image

Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis

Tian Xu*, Ziniu Li*, Yang Yu, Zhi-Quan Luo

arXiv:2208.01899
(The early version of this work is at arXiv:2106.10424v3)

Publication Image

Rethinking ValueDice: Does It Really Improve Performance?

Ziniu Li*, Tian Xu*, Yang Yu, Zhi-Quan Luo

The 10th International Conference on Learning Representations (ICLR) (Blog Track), 2022

Publication Image
Publication Image

HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning

Ziniu Li, Yingru Li, Yushun Zhang, Tong Zhang, Zhi-Quan Luo

The 10th International Conference on Learning Representations (ICLR), 2022
(Oral presentation at Workshop on Ecological Theory of Reinforcement Learning at NeurIPS, 2021)

Publication Image

A Concise Introduction to Imitation Learning (In Chinese)

Tian Xu, Ziniu Li, Yang Yu

Online Available

Publication Image

Error Bounds of Imitating Policies and Environments for Reinforcement Learning

Tian Xu, Ziniu Li, Yang Yu

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

Publication Image

Error Bounds of Imitating Policies and Environments

Tian Xu, Ziniu Li, Yang Yu

Conference on Neural Information Processing Systems 34 (NeurIPS), 2020

Publication Image

Efficient Exploration by Novelty-pursuit

Ziniu Li*, Xiong-Hui Chen*

The 2nd International Conference on Distributed Artificial Intelligence (DAI), 2020

Publication Image

Self-Guided Evolution Strategies with Historical Estimated Gradients

Fei-yu Liu, Ziniu Li, Chao Qian

The 29th International Conference on Joint Artificial Intelligence (IJCAI), 2020