Thinking and Reasoning

The Purpose I Write This Blog Thinking models are crazily popualr nowadays. The first time I delved in this area was in September, 2023. Later I gradually forgetted this area, until Deepseek came to life. I want to keep to collect information about LLM reasoning and share my thoughts here. Thinking Models text-based explicit reasoning DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning Kimi k1.5: Scaling Reinforcement Learning with LLMs GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Skywork Open Reasoner 1 Technical Report implicit reasoning (Coconut) Training Large Language Models to Reason in a Continuous Latent Space others ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline blogs 自顶向下方式深度解读 DeepSeek-R1,内含大量细节 MLA(1):从代码角度学习和彻底理解 DeepSeek MLA 算法 从头理解思考模型(LLM based Reasoning Model),O1,DeepSeek R1,Kimi K1.5 overthinking ...

June 30, 2025 · 3 min · 1225 words · Sirius