[2025-10] 大语言模型强化学习方法 @ CNCC 2025 Tutorial
[2025-03] LLM Post-Training: Formulation and Algorithms @ Peking University
[2025-03] Preserving Output Diversity in Supervised Fine-Tuning of Large Language Models @ Xi'an Jiaotong University