Blog

2026 年

Agentic RL

Published:

Playing with agentic RL.

并行训练

Published:

Notes on playing with the UltraScale Playbook.

DEF file in EDA

Published:

An in-depth look at DEF files in EDA and their role in the design process.

NAS 配置过程记录

Published:

用了一台垃圾机器做主机, 已经服务器换下来的硬盘, 记录一下配置过程

实现一个简单的 eval 函数

Published:

在本文中,我们将探讨如何实现一个简单的 eval 函数,能够解析并计算基本的数学表达式。我们将使用抽象语法树(AST)来表示表达式,并通过递归下降解析器来构建 AST。最后,我们将实现一个评估函数来计算表达式的值。

ZeRO 系列论文学习

Published:

学习 ZeRO 系列论文,深入理解分布式训练中的优化技术。

NCCL Basics

Published:

An introduction to NVIDIA Collective Communications Library (NCCL) for efficient multi-GPU communication.

Cpp Tricks

Published:

A collection of useful C++ tricks and tips for developers.

2025 年

ScaLAPACK

Published:

How to use ScaLAPACK for parallel linear algebra computations on distributed-memory systems.

MPI

Published:

Distributed memory coding.

Why Qiskit Estimator is Fast?

Published:

Discover the speed advantages of Qiskit Estimator for quantum computing tasks. Learn how it optimizes performance and efficiency in quantum simulations.

CPU Arch Optimizations

Published:

An in-depth look at CPU architecture optimizations and their impact on performance.

HIF

Published:

层次插值分解

LLM

Published:

学点大模型

GPU human docs

Published:

A collection of GPU-related human documentation.