A Collection of My Publications

* Equal contribution ✉ Corresponding author Technical Report Seedance 2.0: Advancing Video Generation for World Complexity Team Seedance Arxiv, 2026 PDF ICLR 2026 Reading Images Like Texts: Sequential Image Understanding in Vision-Language Models Yueyan Li*, Chenggong Zhao, Zeyuan Zhang, Caixia Yuan✉, Xiaojie Wang International Conference on Learning Representations (ICLR), 2026 PDF Code arxiv Sparse Model Diffing via Dynamic Circuits Yueyan Li*, Wenhao Gao, Caixia Yuan✉, Xiaojie Wang ArXiv, 2026 PDF Code Technical Report AutoGLM: Autonomous Foundation Agents for GUIs Team AutoGLM Arxiv, 2024 PDF Code EMNLP 2024 ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline Yifan Xu*, Xiao Liu, Xinghan Liu, Zhenyu Hou, Yueyan Li, Xiaohan Zhang, Zihan Wang, Aohan Zeng, Zhengxiao Du, Zhao Wenyi, Jie Tang, Yuxiao Dong✉ Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024 PDF Code Technical Report ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools Team GLM Arxiv, 2024 PDF Code ICML 2026 EntroKV: An Entropy-aware Memory Manager for KV cache Compression Wenhao Gao*, Haoran Cao, Yueyan Li, Caixia Yuan, Xiaojie Wang✉ ICML, 2026 Code

January 23, 2026 · 1 min · 180 words · Sirius

Thinking and Reasoning

A curated overview of LLM reasoning, post-training, reinforcement learning, and related resources.

June 30, 2025 · 4 min · 1628 words · Sirius

LLM Agents

A running collection of notes, papers, and benchmarks around LLM agents, agentic RL, GUI agents, and deep research systems.

June 28, 2025 · 2 min · 634 words · Sirius

Interpretability (& other areas) for Multimodal Models

A curated reading list on multimodal interpretability, information flow, diffusion models, and related research threads.

February 25, 2025 · 10 min · 4893 words · Sirius

Possible Research Areas in Mechanistic Interpretability

An overview of mechanistic interpretability directions, including circuit discovery, SAEs, steering vectors, and model diffing.

September 6, 2024 · 7 min · 3414 words · Sirius

Exploring Emotional Features in GPT2-Small

🎶Code in this post can be found at the jupyter notebook in my “saeExploration” repo. Find features that reflect positive emotions To find the features related to a specific emotion, I write five sentences containing the key words for each emotion. For example, for happy emotions I have: 1 2 3 4 5 prompt_happy = ["I'll be on a vacation tomorrow and I'm so happy.", "My mombrings home a new puppy and I'm so happy.", "I'm so glad I got the job I wanted.", "I feel so happy when I'm with my friends.", "I'm so happy I got the promotion I wanted.",] I choose to look for features that reflect happiness and sadness. Apart from that, I also wonder if the feature that reflects excitedness has something to do with the one that reflects happiness (they are alike from the semantic level at least.) ...

August 29, 2024 · 6 min · 1114 words · Sirius

A Brief Introduction to Mechanistic Interpretability Research

⚠️ Warnings This post was written when I first delved into this area, and it hasn’t been updated for a long time. Thus there might be a lot of errors. I’m still interested in interpretability and its applications. I’ll write something new and interesting later ~ 💡 This post is accompanied with another post, which contains specific content in this area. ...

August 28, 2024 · 16 min · 3210 words · Sirius