Archive
All writing
| Date | Title |
|---|---|
| Dec 14, 2025 | How we inference (2025) After a year working at an inference engine start up, I have witnessed the great evolution of inference optimization in 2025. This blog concludes several famous breakthroughs. |
| Nov 28, 2025 | Daily Vibe Coding Share Recently, I have been invited to share some of my experiences with vibe coding. AI has profoundly changed the way I think and how I work. It is a great pleasure to witness such a transformation whe… |
| Mar 9, 2025 | RAG & Agent Share Last week I shared some my exp of RAG and Agent for all members in the company. Here's a desensitized copy of my notes. Though something I cannot share here, but these are enough for starters to le… |
| Feb 14, 2025 | 5 Years invention Patent My first-author invention patent just got approved after 5 years! Though technology has evolved since then, I'm still deeply thankful for this milestone. Looking back at those nights spent writing … |
| Feb 6, 2025 | DS-R1 & GRPO Code DSR1 paper https://arxiv.org/pdf/2501.12948 As we have read the GRPO algo in previous blog https://lihaorui.com/2025/02/05/deepseek-math-reading/ here we continue read deepseek r1's training pipeline. |
| Feb 5, 2025 | Deepseek Math Reading [](https://yaih.dawn.ee/image/SPdl) [](https://yaih.dawn.ee/image/SJKt) [](https://yaih.dawn.ee/image/Sqw7) **Proximal Policy Optimization (PPO)** is an actor-critic RL algorithm widely used in the… |
| Jan 25, 2025 | Prompt Attack Defense This blog is a note of Google's prompt attack and defense presentation at 2025 Google Cloud Export Summit Shenzhen. Video Source: 【提示词注入防御最佳实践】 https://www.bilibili.com/video/BV1DLwEeaEDa |
| Jan 24, 2025 | Swarm Code Reading Source code: https://github.com/openai/swarm My opensourced demo of how to modify and use swarm: https://github.com/haoruilee/huggingface-swarm |
| Jan 24, 2025 | Multi Agent RL & Web3 *Update at 2025.02.08: I write an implement of this idea https://github.com/haoruilee/Principal-Agent-Contract* Recently I read this paper *Principal-Agent Reinforcement Learning: Orchestrating AI … |
| Jan 22, 2025 | Tensor Parallelism and NCCL Recommend Reading: https://uvadlc-notebooks.readthedocs.io/en/latest/tutorial_notebooks/scaling/JAX/tensor_parallel_simple.html **Tensor Parallelism** optimizes the computation of large matrix oper… |
| Jan 22, 2025 | From RL to RLHF Note: This blog is a small part of these sources, recommend read them all. $1 categorizes techniques for aligning large language models (LLMs) into **four main themes** with subtopics: |
| Jan 21, 2025 | Invest Harvey Index **"Rich is having money (or assets) you haven't spent."** Over the past year, I've been exploring steady ways to grow wealth SLOWLY BUT SURELY. After countless trials, I've finally developed what I… |
| Jan 6, 2025 | 一天很漫長 但十年卻很短暫 Sam Altman 是Y Combinator這家創投公司的CEO,他最近剛滿三十歲。朋友問他這十年來他所經歷的,有什麼可以給別人建議的…他就洋洋灑灑寫了這個清單,我看完深深有感觸,就翻譯了一下… I turned 30 last week and a friend asked me if I’d figured out any life advice in the past decade… |
| Dec 23, 2024 | Qwen QwQ Guid In the rapidly evolving landscape of artificial intelligence, language models have transcended traditional boundaries, enabling applications that were once deemed impossible. Among these, QwQ https… |
| Dec 8, 2024 | Paper reading Online Learning Interesting online learning paper that solve the whole problem using pure math. Nice intro to the whole field. Data stream $D=\left\{\left(x_t, y_t\right) \mid t \in\{1,2, \ldots, T\}\right\}$, $x_… |
| Dec 6, 2024 | Bitcoin Whitepaper Summary *2024.12.06, write at the day bitcoin price hit $100,000* FTAV posts between June 2011 and today may have communicated the idea that bitcoin is a negative-sum game being played on a protocol that’s… |
| Oct 25, 2024 | Mathematical Models and Life Choices There are many interesting mathematical models in the world that can, to some extent, offer us a lot of insights. Chinese Version: https://zhuanlan.zhihu.com/p/871793375 |
| Jul 16, 2024 | Why GPT cannt compare Cuz this is a virus meme on Chinese social media, this post is first wirtten in Chinese. 最近有大量的群聊在讨论为什么GPT无法正确地比较9.11和9.9 可以看到无论是微软的Copilot,还是GPT4o,或者调用GPT API的外部网站,都无法正确地比较9.11和9.9的大小,最好玩的是即使程序已经明… |
| Jul 14, 2024 | Real Meaning of Rank Here, we first discuss a mathematical question that has long puzzled students of engineering and even physics: What is the real meaning of area, and how is it generalized to higher dimensions? |
| Jun 26, 2024 | Threads and Processes ***Robert Love on Quora:*** Here is the analogy I use in *Linux Kernel Development.* **Processes are the abstraction of running programs** : A binary image, virtualized memory, various kernel resou… |
| Jun 16, 2024 | Read AlexNet in 2024 After 12 years, reading Alexnet again can still give us insights. This work comes from UoT, Alex Krizhevsky is first author, his others famous paper: |
| Feb 17, 2021 | About this site This is my resume site, read `` to jump. If the webpages load slowly, consider switching the server: In order to improve the speed of access, this site uses a large number of .webp format pictures,… |
| Feb 16, 2021 | acknowledgement 其中为加速网页打开速度,对MIc-Theme改动为$1 |